Rohirrim
The AI-Native Platform Rewriting the Architecture of Modern Acquisitions.
Senior Data Engineer
Location
California
Posted
16 days ago
Salary
Not specified
Bachelor Degree10 yrs expEnglishAirflowAWSAzureCloudDockerElastic SearchETLKubernetesMy SQLPostgre SQLPythonRedisSQL
Job Description
• Design, build, and optimize data pipelines and infrastructure for AI products
• Collaborate closely with AI/ML teams, product teams, and security/compliance partners
• Develop and operate ETL/ELT workflows
• Implement and optimize vector database systems and embeddings pipelines
• Architect and manage Azure-based data infrastructure
• Build internal tools for metadata extraction and document parsing
• Monitor and improve pipeline performance and reliability
Job Requirements
- 10+ years in Data Engineering, Software Engineering, or ML/Data Infrastructure roles
- Strong experience with Python, SQL, and modern data engineering tools (Airflow, Dagster, dbt, Prefect, etc.)
- Experience building large-scale document extraction ETL pipelines (OCR, PDF parsing, metadata extraction, NLP preprocessing)
- Proficiency with Kubernetes, Docker, and containerized data pipelines deployed on Azure, AWS and/or Google Cloud
- Hands-on experience with relational databases (Postgres, SQL Server, MySQL) and non-relational systems such as Elasticsearch, Redis, and graph databases
- Experience with document-heavy or text-heavy data processing (OCR, parsing, NLP preprocessing)
- Strong data quality, governance, lineage, and validation mindset
- Excellent communicator who can align with ML, engineering, and product teams.
Benefits
- Dynamic environment
- Leadership opportunities
- Technical direction
- Mentorship roles
Related Guides
Related Categories
Related Job Pages
More Data Engineer Jobs
Data Engineer16 days ago
ContractRemoteTeam 51-200Since 2009
Data Engineer assisting with Azure PaaS cloud projects
AzureCloudETLMS SQL ServerPythonSQL
Database/Data Warehouse Developer
Decision FoundryA Global, Salesforce Marketing Cloud Implementation Partner.
Data Engineer16 days ago
Full TimeRemoteTeam 201-500H1B No Sponsor
Welcome to Decision Foundry!Decision Foundry, an advisory-led, premier Salesforce Data Cloud delivery partner, bridges the gap between data access, platform adoption, and business impact. As a certified ISV and award-winning Salesforce integration part...
United States
Data Engineer16 days ago
Full TimeRemoteTeam 201-500Since 2016
Senior Data Engineer providing scalable solutions at Super.com
Distributed SystemsPythonSQL
Data Engineer16 days ago
Full TimeRemoteTeam 501-1,000Since 2018
SAP HCM Data Replication & Migration Specialist configuring data processes for SAP systems
Cloud
Texas