Theoria Medical

We don’t meet the standards, we set them.

Senior Data Engineer

Data EngineerData EngineerFull TimeRemoteTeam 1,001-5,000H1B SponsorCompany SiteLinkedIn

Location

United States

Posted

5 days ago

Salary

Not specified

Microsoft FabricApache AirflowMongo DBSQLData ModelingMedallion ArchitecturePythonETLData LakehouseHIPAA

Job Description

This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more.

Role Description

We’re looking for a Senior Software Engineer responsible for designing, building, and maintaining scalable, reliable data pipelines and data platforms that support enterprise analytics and reporting across the organization. This role will partner closely with analytics, data science, and business teams to ensure healthcare data is well-modeled, governed, and readily available for downstream use. The ideal candidate has deep experience with modern data engineering tools, strong data modeling expertise, and a solid understanding of healthcare data.

Shift Structure: Remote position, Shift (TBA)

Key Responsibilities

  • Design, build, and maintain scalable data pipelines using Microsoft Fabric and Apache Airflow
  • Ingest, transform, and integrate data from a variety of sources, including relational systems, APIs, and MongoDB
  • Implement and manage data solutions aligned to Medallion architecture principles (Bronze, Silver, Gold)
  • Design and maintain analytical data models, including fact and dimension tables, to support reporting and analytics
  • Optimize data storage, performance, and reliability across lakehouse and warehouse environments
  • Ensure data quality, observability, and lineage through validation, monitoring, and documentation
  • Collaborate with data analysts and BI developers to enable performant, well-modeled datasets for Power BI
  • Partner with clinical, operational, and technical stakeholders to understand data requirements and constraints
  • Support data governance, security, and compliance efforts, including HIPAA-related controls
  • Mentor junior data engineers and contribute to engineering standards and best practices

Qualifications

  • 5+ years of experience as a Data Engineer, Senior Data Engineer, or similar role
  • Strong experience with Microsoft Fabric (e.g., Lakehouse, Data Warehouse, pipelines, notebooks)
  • Hands-on experience with Apache Airflow for workflow orchestration and scheduling
  • Experience working with MongoDB and integrating NoSQL data sources into analytical platforms
  • Strong SQL skills and experience building performant analytical queries and transformations
  • Deep understanding of data modeling concepts, including fact and dimension tables
  • Practical experience implementing Medallion architecture in a data lake or lakehouse environment
  • Experience working with healthcare data (e.g., EHR/EMR, claims, clinical, revenue cycle, or operational data)
  • Strong understanding of data engineering best practices around scalability, reliability, and maintainability

Preferred Experience

  • Experience in a healthcare provider, payer, or health technology organization
  • Familiarity with HIPAA and healthcare data privacy and security requirements
  • Experience with CI/CD for data pipelines and infrastructure-as-code concepts
  • Exposure to streaming or near–real-time data processing
  • Experience supporting enterprise BI platforms such as Power BI

Benefits

  • Competitive salary
  • 401(k) with employer match
  • Health, dental, and vision insurance
  • PTO + paid holidays
  • Life insurance coverage
  • Remote flexibility with a national legal scope

Job Requirements

  • 5+ years of experience as a Data Engineer, Senior Data Engineer, or similar role
  • Strong experience with Microsoft Fabric (e.g., Lakehouse, Data Warehouse, pipelines, notebooks)
  • Hands-on experience with Apache Airflow for workflow orchestration and scheduling
  • Experience working with MongoDB and integrating NoSQL data sources into analytical platforms
  • Strong SQL skills and experience building performant analytical queries and transformations
  • Deep understanding of data modeling concepts, including fact and dimension tables
  • Practical experience implementing Medallion architecture in a data lake or lakehouse environment
  • Experience working with healthcare data (e.g., EHR/EMR, claims, clinical, revenue cycle, or operational data)
  • Strong understanding of data engineering best practices around scalability, reliability, and maintainability
  • Preferred Experience
  • Experience in a healthcare provider, payer, or health technology organization
  • Familiarity with HIPAA and healthcare data privacy and security requirements
  • Experience with CI/CD for data pipelines and infrastructure-as-code concepts
  • Exposure to streaming or near–real-time data processing
  • Experience supporting enterprise BI platforms such as Power BI

Benefits

  • Competitive salary
  • 401(k) with employer match
  • Health, dental, and vision insurance
  • PTO + paid holidays
  • Life insurance coverage
  • Remote flexibility with a national legal scope

Related Categories

Related Job Pages

More Data Engineer Jobs

Full TimeRemote

You will join a high-visibility initiative focused on maturing the organization's data landscape. The Outpost project is moving from an "evolving" data environment to a structured, high-performance ecosystem. As a Data Architect, your mission is to solve current structural incons...

SQLData ModelingPythonDatabase Performance Tuning
United States + 24 moreAll locations: United States, Brazil, Colombia, Argentina, Chile, Venezuela, Bolivarian Republic Of, Bolivia, Plurinational State Of, Ecuador, French Guiana, Guyana, Paraguay, Peru, Suriname, Uruguay, Mexico, Costa Rica, El Salvador, Guatemala, Honduras, Nicaragua, Panama, Dominican Republic, Puerto Rico

Lead Data Engineer

C the Signs

C the Signs is a cancer prediction system that identifies patients at risk of cancer at the earliest, most curable stage

Data Engineer5 days ago
Full TimeRemoteTeam 51-200H1B No Sponsor

We are seeking a Lead Data Engineer to architect, build, and scale our next-generation healthcare data platform. In this role, you will lead the effort to design robust pipelines, modernize data architecture, and ensure high-quality ingestion and trans...

PythonSQLBigQuerydbtApache AirflowGoogle Cloud PlatformPub/SubDataflowCloud RunCloud ComposerHL7FHIRDICOMETLELTData ModelingAWSHIPAA ComplianceData QualityData Governance
United States

Lead Data Engineer

C the Signs

C the Signs is a cancer prediction system that identifies patients at risk of cancer at the earliest, most curable stage

Data Engineer5 days ago
Full TimeRemoteTeam 51-200H1B No Sponsor

Lead Data Engineer architecting healthcare data platform with AI technology

AirflowAmazon RedshiftAWSBigQueryCloudETLGoogle Cloud PlatformPythonSQL
Connecticut + 5 moreAll locations: Connecticut, New Hampshire, New York, Massachusetts, Rhode Island, Wisconsin
Full TimeRemote

Following our remarkable growth in Switzerland, Poland, and Germany, Unit8 is expanding into the US market. We are seeking a Senior Forward Deployed Engineer who will work directly with customers, owning delivery strategy and implementation. You will act as a thought leader and s...

PythonSQLPalantir FoundryData EngineeringData PipelineAgileComputer ScienceMathematicsPre-sales
United States