Senior Data Platform & Operations Engineer

Data EngineerData EngineerFull TimeRemote

Location

United States

Posted

8 days ago

Salary

Not specified

PythonSQLPy SparkDuck DBAzureDockerKubernetesTerraform

Job Description

This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more.

Role Description

This role involves owning critical data pipelines end to end, from ingestion through transformation to production delivery, while also helping manage cloud infrastructure and DevOps operations.

  • Migrate existing PySpark pipelines to Ibis / DuckLake on the Ascend Gen3 platform
  • Build and maintain disease-state-specific measures pipelines
  • Build and maintain the cross-tenant benchmarks pipeline
  • Build and maintain an Echo NLP pipeline that extracts structured data from unstructured echocardiogram narrative text
  • Work across the full pipeline lifecycle: design, implementation, testing, deployment, and monitoring
  • Expand test coverage and improve testing harnesses and related infrastructure
  • Fix noisy and flaky test suites that generate false positives
  • Add new test types, including browser-based tests and full pipeline validation runs
  • Help establish a testing culture where tests are meaningful, not ceremonial
  • Manage Azure cloud infrastructure
  • Interface with the MSP on business and technical issues
  • Implement and maintain Infrastructure as Code using Bicep / Terraform
  • Enforce RBAC and least-privilege access policies
  • Monitor and optimize cloud costs
  • Improve and automate data ingestion, movement, and recovery processes
  • Build toward reliable, rollback-capable release cadences
  • Reduce manual operational steps
  • Make data operations more observable and repeatable
  • Use AI coding tools such as Claude Code, Cursor, or similar in daily workflow
  • Contribute to an AI-forward engineering culture focused on faster iteration, better code quality, and stronger testing

Qualifications

  • 7+ years of data engineering or platform engineering experience
  • Deep Python and SQL proficiency
  • Experience with PySpark
  • Ideally experience with Ibis or other dataframe abstraction layers
  • Experience with DuckDB
  • Experience working at a healthcare SaaS startup
  • Azure cloud experience, or strong AWS / GCP experience with the ability to ramp quickly
  • Experience with Kubernetes and Docker
  • Experience with data quality frameworks such as Great Expectations, dbt tests, or similar
  • Active, proficient use of AI coding tools in daily work
  • Self-directed working style with the ability to identify problems and solve them without constant direction

Requirements

  • Healthcare data experience, including clinical registries, HIPAA, or Safe Harbor de-identification
  • Experience with DuckLake
  • Experience with Ascend.io or similar pipeline orchestration platforms
  • Experience collaborating with offshore engineering teams across time zones
  • Experience with Infrastructure as Code tools such as Terraform, Bicep, or CloudFormation

Benefits

  • Fully remote, US-based role
  • Small team, high impact
  • Work that directly affects how hospitals improve patient care
  • Direct access to the CTO
  • Influence over architectural decisions

Job Requirements

  • 7+ years of data engineering or platform engineering experience
  • Deep Python and SQL proficiency
  • Experience with PySpark
  • Ideally experience with Ibis or other dataframe abstraction layers
  • Experience with DuckDB
  • Experience working at a healthcare SaaS startup
  • Azure cloud experience, or strong AWS / GCP experience with the ability to ramp quickly
  • Experience with Kubernetes and Docker
  • Experience with data quality frameworks such as Great Expectations, dbt tests, or similar
  • Active, proficient use of AI coding tools in daily work
  • Self-directed working style with the ability to identify problems and solve them without constant direction
  • Healthcare data experience, including clinical registries, HIPAA, or Safe Harbor de-identification
  • Experience with DuckLake
  • Experience with Ascend.io or similar pipeline orchestration platforms
  • Experience collaborating with offshore engineering teams across time zones
  • Experience with Infrastructure as Code tools such as Terraform, Bicep, or CloudFormation

Benefits

  • Fully remote, US-based role
  • Small team, high impact
  • Work that directly affects how hospitals improve patient care
  • Direct access to the CTO
  • Influence over architectural decisions

Related Categories

Related Job Pages

More Data Engineer Jobs

Lead Data Analytics Engineer

Jobgether

We use an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. Our system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company. The final decision and next steps (interviews, assessments) are managed by their internal team. We appreciate your interest and wish you the best! Data Privacy Notice: By submitting your application, you acknowledge that Jobgether will process your personal data to evaluate your candidacy and share relevant information with the hiring employer. This processing is based on legitimate interest and pre-contractual measures under applicable data protection laws (including GDPR). You may exercise your rights (access, rectification, erasure, objection) at any time. #LI-CL1 We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.

Data Engineer8 days ago
Full TimeRemote

This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Data Analytics Professional (Remote). As a key driver in our analytical team, you'll play a vital role in interpreting complex datasets that influence public behavioral health and ...

PythonRSASSQLGitData ManagementStatistical AnalysisDashboard DevelopmentWorkflow Automation
United States

Senior Data Analytics Engineer

Jobgether

We use an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. Our system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company. The final decision and next steps (interviews, assessments) are managed by their internal team. We appreciate your interest and wish you the best! Data Privacy Notice: By submitting your application, you acknowledge that Jobgether will process your personal data to evaluate your candidacy and share relevant information with the hiring employer. This processing is based on legitimate interest and pre-contractual measures under applicable data protection laws (including GDPR). You may exercise your rights (access, rectification, erasure, objection) at any time. #LI-CL1 We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.

Data Engineer8 days ago
Full TimeRemote

This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Data Analytics Professional (Remote). As a key driver in our analytical team, you'll play a vital role in interpreting complex datasets that influence public behavioral health and ...

PythonRSQLSASGitData ManagementData AnalyticsStatistical AnalysisDashboard DevelopmentWorkflow Automation
United States

CDP Architect

Jobgether

We use an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. Our system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company. The final decision and next steps (interviews, assessments) are managed by their internal team. We appreciate your interest and wish you the best! Data Privacy Notice: By submitting your application, you acknowledge that Jobgether will process your personal data to evaluate your candidacy and share relevant information with the hiring employer. This processing is based on legitimate interest and pre-contractual measures under applicable data protection laws (including GDPR). You may exercise your rights (access, rectification, erasure, objection) at any time. #LI-CL1 We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.

Data Engineer8 days ago
ContractRemote

This role provides a high-impact opportunity to architect and implement Customer Data Platform (CDP) solutions that enable clients to leverage their data for business growth and customer engagement. You will work closely with cross-functional teams—including data engineers, analy...

CDPAdobe Experience PlatformAWSGCPAzureSnowflakeRedshiftBigQueryAPIGDPRCCPAData GovernanceData Pipeline
United States

Data Engineer – Pipelines, Structured Markup

Vulcury

Vulcury invests in early stage startups and advises companies of all sizes on strategy, growth, and efficiency

Data Engineer8 days ago
Full TimeRemoteTeam 1-10Since 2022

Data Engineer developing ingestion pipelines for Vulcury's manufacturing intelligence infrastructure

AirflowCloudETLPostgreSQLPythonSQL
United States