Arbiter
Unifying and automating referral workflows so that every patient reaches the right provider at the right time and cost.
Senior Data Engineer, AI Infrastructure
Location
United States
Posted
109 days ago
Salary
$180K - $240K / year
Bachelor Degree8 yrs expEnglishAirflowBig QueryCloudDistributed SystemsDockerGoogle Cloud PlatformKubernetesPython
Job Description
• AI/ML Pipeline Development: Design, develop, and maintain robust, scalable data pipelines specifically for our AI models. This includes data ingestion, cleaning, transformation, classification, and tagging to create high-quality, reliable training and evaluation datasets.
• MLOps & Infrastructure: Build and manage the AI infrastructure to support the full machine learning lifecycle. This includes automating model training, versioning, deployment, and monitoring (CI/CD for ML).
• Embedding & Vector Systems: Architect and operate scalable systems for generating, storing, and serving embeddings. Implement and manage vector databases to power retrieval-augmented generation (RAG) and semantic search for our AI agents.
• AI Platform & Tooling: Champion and build core tooling, frameworks, and standards for the AI/ML platform. Develop systems that enable AI engineers to iterate quickly and self-serve for model development and deployment.
• Cross-Functional Collaboration: Partner closely with AI engineers, product managers, and software engineers to understand their needs. Translate complex model requirements into stable, scalable infrastructure and data solutions.
• Mentorship & Growth: Actively participate in mentoring junior engineers, contributing to our team's growth through technical guidance, code reviews, and knowledge sharing.
• Hiring & Onboarding: Play an active role in interviewing and onboarding new team members, helping to build a world-class data engineering organization.
Job Requirements
- 8+ years of deep, hands-on experience in Data Engineering, MLOps, or AI/ML Infrastructure, ideally within a high-growth tech environment.
- Exceptional expertise in data structures, algorithms, and distributed systems.
- Mastery in Python for large-scale data processing and ML applications.
- Extensive experience designing, building, and optimizing complex, fault-tolerant data pipelines specifically for ML models (e.g., feature engineering, training data generation).
- Profound understanding and hands-on experience with cloud-native data and AI platforms, especially Google Cloud Platform (GCP) (e.g., Vertex AI, BigQuery, Dataflow, GKE).
- Strong experience with containerization (Docker) and orchestration (Kubernetes) for deploying and scaling applications.
- Demonstrated experience with modern ML orchestration (e.g., Kubeflow, Airflow), data transformation (dbt), and MLOps principles.
- Intimate knowledge of and ability to implement unit, integration, and functional testing strategies.
- Experience providing technical leadership and guidance, and thinking strategically and analytically to solve problems.
- Friendly communication skills and ability to work well in a diverse team setting.
- Demonstrated experience working with many cross-functional partners.
Benefits
- Highly Competitive Salary & Equity Package: Designed to rival top FAANG compensation, including meaningful equity.
- Generous Paid Time Off (PTO): To ensure a healthy work-life balance.
- Comprehensive Health, Vision, and Dental Insurance: Robust coverage for you and your family.
- Life and Disability Insurance: Providing financial security.
- Simple IRA Matching: To support your long-term financial goals.
- Professional Development Budget: Support for conferences, courses, and certifications to fuel your continuous learning.
- Wellness Programs: Initiatives to support your physical and mental health.
Related Guides
Related Categories
Related Job Pages
More Data Engineer Jobs
Data Engineer109 days ago
Full TimeRemoteTeam 51-200Since 2014
Senior Data Engineer working with data pipelines for mobile growth at Button.
AirflowApacheAWSBigQueryCloudDynamoDBGoogle Cloud PlatformMySQLPostgresPythonRedisSQLTerraform
Data Engineer109 days ago
Full TimeRemoteTeam 51-200Since 2015
Senior Data Engineer at People Data Labs building data solutions
AirflowAmazon RedshiftApacheAWSAzureBigQueryCloudGoogle Cloud PlatformJavaPythonScalaSparkSQL
Healthcare Technology Consulting – Data Migration Lead
GuidehouseSolving big problems, building trust in society, and empowering our clients to shape the future.
Data Engineer109 days ago
Full TimeRemoteTeam 10,001+Since 2018H1B Sponsor
Data Migration Lead overseeing EHR implementation and data migration for healthcare technology.
CloudETLOracle
Gen AI Data Engineer II
Dynatron Software, Inc.Dealership Fixed-Ops profit maximizing solutions that integrate Technology, Data Analysis, and Coaching Expertise
Data Engineer109 days ago
Full TimeRemoteTeam 51-200Since 1999H1B No Sponsor
GenAI Data Engineer designing and scaling AI systems for Dynatron’s SaaS platform
AirflowAmazon RedshiftAWSCloudPythonSQL