Aledade, a public benefit corporation, exists to empower the most transformational part of our health care landscape - independent primary care. We were founded in 2014, and since then, we've become the largest network of independent primary care in the country - helping practices, health centers and clinics deliver better care to their patients and thrive in value-based care. Additionally, by creating value-based contracts across a wide variety of health plans, we aim to flip the script on the traditional fee-for-service model. Our work strengthens continuity of care, aligns incentives and ensures primary care physicians are paid for what they do best - keeping patients healthy. If you want to help create a health care system that is good for patients, good for practices and good for society - and if you're eager to join a collaborative, inclusive and remote-first culture - you've come to the right place.
Senior Data Platform Engineer II (Databricks)
Location
United States
Posted
5 days ago
Salary
Not specified
Job Description
As a Senior Data Platform Engineer II, you will architect and manage the high-performance, distributed data environments that power our healthcare analytics. You will move beyond traditional maintenance to ensure our Databricks Lakehouse and Snowflake environments scale indefinitely. You will be responsible for the health, optimization, and security of our data platforms, making complex data accessible and expressive for web applications and AI.
\n- Develop and implement scalable and performant solutions.
- Partner, as a peer, with Engineering Managers, Product Managers, and stakeholders throughout Aledade to develop and execute technical roadmaps using Agile processes.
- Mentor and coach more junior engineers including thorough pull request reviews for other developers and be receptive to critical feedback on your own work.
- BS/BTech (or higher) in Computer Science, Engineering or a related field or equivalent experience.
- 6+ years experience as an engineer building and optimizing highly scalable distributed data systems (e.g., Databricks, Spark, or Snowflake).
- 3+ years of experience working with SQL and data modeling on large multi-table data sets.
- 3+ years of experience acting as a trusted technical decision-maker in a team setting, solving for short-term and long-term business value.
- 3+ years of experience coaching other engineers.
- Databricks & Lakehouse Architecture: Deep expertise in managing Databricks workspaces, including Unity Catalog for data governance, lineage, and fine-grained access control.
- Infrastructure as Code (IaC): Advanced proficiency with Terraform (or similar) to automate the provisioning and scaling of Databricks clusters, cloud resources (AWS preferred), and networking.
- Snowflake Proficiency (Nice-to-Have): Experience managing Snowflake environments, specifically focusing on warehouse cost optimization, security integration, and secure data sharing.
- Modern Database Internals: In-depth knowledge of distributed systems, including partitioning, liquid clustering/Z-Ordering, sharding, and high-availability strategies for petabyte-scale data.
- Observability & Optimization: Proven track record in performance monitoring and query tuning for distributed workloads to ensure system reliability and cost-efficiency.
- Data Engineering Lifecycle: Experience designing and optimizing high-throughput ETL/ELT pipelines and ingestion systems (batch and streaming) using Spark.
- Deployment & Orchestration: Experience building robust CI/CD pipelines for data infrastructure and deploying services using containerization (Docker, Kubernetes).
- Sensitive Data Handling: Expertise in building systems that handle protected information, with specific experience in HIPAA and SOX compliance frameworks.
- Healthcare Data Expertise: Experience navigating health-tech data complexities, such as Electronic Health Records (EHR), clinical data formats (HL7/FHIR), and claims data.
- Sitting for prolonged periods of time. Extensive use of computers and keyboard. Occasional walking and lifting may be required.
Related Guides
Related Categories
Related Job Pages
More Data Engineer Jobs
This role involves designing, implementing, and supporting data-intensive software solutions, focusing on complex SQL development, ETL pipelines, and analytics infrastructure for content tooling. Responsibilities include analyzing, designing, programming, debugging, and modifying software enhancements that process and deliver content data for business intelligence and reporting.
The Senior Data Engineer will be responsible for developing, maintaining, and optimizing dbt models across numerous data marts and building/maintaining ETL/ELT pipelines using Airflow for data ingestion from over 40 sources. Key duties also involve managing Snowflake infrastructure, driving cost optimization, and supporting CI/CD pipelines.
As a Senior Data Engineer, you will be instrumental in designing and building the next generation of our data infrastructure. Your work will handle massive volumes of behavioral, operational, and customer data, directly impacting product features like segmentation, personalizatio...
GHX is seeking a Software Engineer III to work on our Content Tooling solution with a focus on data engineering and analytics. This individual will be responsible for the creation, implementation, and support of data-intensive software solutions including complex SQL development,...