Solutions by Text

Engage, Interact, Transact.

Data Engineer (AI)

Data EngineerData EngineerFull TimeRemoteTeam 51-200Since 2008H1B SponsorCompany SiteLinkedIn

Location

District Of Columbia

Posted

24 days ago

Salary

Not specified

Bachelor Degree9 yrs expEnglishAWSAzurePythonSQL

Job Description

About Data Society Group At Data Society Group, we provide the highest quality, leading-edge, industry-tailored data and AI training and solutions for Fortune 1,000 companies and federal, state, and local governmental organizations. We partner with our clients to educate, equip, and empower their workforces with the skills they need to achieve their goals and expand their impact. We are empowering the workforces of the future, supporting engineers and scientists to train up on the most complex AI solutions and Machine Learning skills. Role Overview We are seeking a capable and resourceful Data Engineer with expertise in cloud-based text-focused AI systems to join our technology and solutions team. In this role, you will be a key individual contributor, applying your expertise to build robust, scalable, and complex data and AI solutions for our external clients. You will work within a cross-functional team, collaborating closely with UX Designers, Engineers, and Project Managers to translate client requirements into high-quality technical deliverables. Responsibilities Design, build, and maintain scalable data pipelines for structured and unstructured data ingestion, transformation, and processing. Architect, build, and deploy LLM-based solutions on cloud platforms, including prompt pipelines, orchestration layers, embeddings, vector databases, and evaluation workflows. Design and implement RAG systems end-to-end including document ingestion, chunking/embedding, indexing, retrieval, grounding, model integration. Architect and enforce data models, governance, cataloging and schema design to support both analytics and AI workloads. Build and optimize cloud-native data architectures to support compute, storage, and orchestration for high-throughput, production-grade AI workloads. Implement reliable and efficient ETL patterns, leveraging best practices for data quality, lineage, versioning, and cataloging. Instrument observability and monitoring for data pipelines, including latency, error rates, and schema drift, with alerting and automated remediation where possible. Implement monitoring, observability, and performance optimization for data and AI systems. Operate effectively within Agile workflows, contribute to sprint planning, estimations, backlog refinement and continuous improvement. Work closely with clients to gather requirements, provide technical guidance and present solutions and implementation plans. Communicate complex technical information to both technical and non-technical stakeholders. Work cross-functionally with UX, engineering, and PM teams to deliver client-facing solutions. Translate complex technical needs into clear development requirements and implementation plans. Stay current with emerging technologies and recommend improvements to our engineering practices, architecture patterns, and cloud ecosystem. Qualifications Hands-on experience deploying LLM-based applications, including RAG or similar retrieval systems. Proven experience deploying systems on AWS or Azure (AWS preferred). Strong understanding of embeddings, chunking strategies, retrieval optimization, and evaluation. 5+ years of data and analytics engineering in cloud environments. Expertise in SQL, Python, and schema design with experience in data cataloging and governance tools. Demonstrated experience building robust and maintainable data architectures, including real-time or steaming pipelines. Experience working in Agile / Scrum development processes. Excellent communication skills and ability to work cross-functionally with non-technical teams. Please note this job description is not designed to cover or contain a comprehensive listing of activities, duties or responsibilities that are required of the employee for this job. Duties, responsibilities, and activities may change at any time with or without notice. This position will be remote in the US though based out of the Washington, DC area with travel to client sites in DC if needed.

Job Requirements

  • Note:
  • Due to the confidential nature of our federal government Clients, this role requires the ability to pass a United States federal government Public Trust background check and is exclusively open to U.S. Citizens located within the United States.

Related Categories

Related Job Pages

More Data Engineer Jobs

Backend Engineer - Data Infrastructure

Spotify

Passionate music fans. Innovative tech pros. Perfect harmony. Join our band.

Data Engineer24 days ago
Full TimeRemoteTeam 5,001-10,000Since 2008H1B Sponsor

Develop and maintain the data analytics platform and backend services, optimize data infrastructure, and collaborate with teams to enhance analytics capabilities.

BigQueryClickhouseDruidGrafanaJavaKubernetesPinotPrometheusSnowflakeTerraform
New York
$125.6K - $179.4K / year

Solution Engineer - Data Engineering Specialist

Snowflake

Snowflake delivers the AI Data Cloud to help organizations share data, build apps and power their business with AI.

Data Engineer24 days ago
Full TimeRemoteTeam 5,001-10,000Since 2012H1B Sponsor

The Senior Data Platform Architect leads the design and architecture of Snowflake's Cloud Data Platform, collaborates with sales, and engages with customers to demonstrate value and support proof of concepts.

AirflowFivetranFlinkHadoopHiveInformaticaKafkaMatillionPandasPysparkPythonSparkSQL
United States
$165K - $216.6K / year

Manager, Data Engineering

Dandy

Helping dentists achieve more by making the entire lab process digital — and effortless.

Data Engineer24 days ago
Full TimeRemoteTeam 501-1,000Since 2020H1B Sponsor

The Manager of Data Engineering leads a team to ensure high-quality data products, focusing on data modeling, semantic architecture, and performance optimization, while managing stakeholder communication and strategic vision.

AirflowBigQueryDagsterDbtSnowflakeSQL
United States
$199.7K - $249.6K / year

Staff Data Engineer, Analytics Data Engineering

Dropbox

Dropbox is the one place to keep life organized and keep work moving.

Data Engineer24 days ago
Full TimeRemoteTeam 1,001-5,000Since 2007H1B Sponsor

Staff Data Engineer at Dropbox enhancing analytics data engineering

AirflowPythonSparkSQL
United States
$176.8K - $239.2K / year