Sayari

Science for decision making.

Principal Data Engineer

Data EngineerData EngineerFull TimeRemoteTeam 1-10H1B No SponsorCompany SiteLinkedIn

Location

United States

Posted

56 days ago

Salary

$200K - $220K / year

Bachelor Degree8 yrs expEnglishAirflowApacheCassandraCloudElastic SearchSpark

Job Description

• Design and implement complex Spark data logic, focusing on performance optimization, data volume tuning, and robust execution. • Own the architectural design of graph build pipelines, ensuring they are scalable, automated, and highly resilient. • Plan and oversee the strategic re-architecture of data pipelines to meet evolving business needs and scale. • Optimize infrastructure-as-code and schema designs to reduce cloud costs and improve pipeline latency. • Act as a technical consultant for the team, fostering a collaborative and engineer-led approach to design decisions. • Support the development of the engineering team through code reviews, design docs, and architectural best practices. • Ensure the accuracy of mission-critical data outputs.

Job Requirements

  • 8+ years of experience in the big data space, with a proven track record of implementing large-scale features and leading process redesigns.
  • Expert-level mastery of Apache Spark for large-scale data processing.
  • Strong experience with orchestration tools (Airflow) and cloud computing environments.
  • Hands-on experience architecting and managing data flows into databases such as Elasticsearch, Memgraph, and Cassandra.
  • Demonstrated ability in system architecture, including Infrastructure as Code (IaC) and schema design.
  • A "builder" mindset with experience evolving and improving existing architectures to meet new scale requirements.

Benefits

  • 100% fully paid medical, vision, and dental for employees and their dependents
  • Generous time off; we observe all US federal holidays, close our office for a winter break (12/24-12/31), in addition to granting 18 PTO days and 10 sick days
  • Outstanding compensation package; competitive commissions for revenue roles and quarterly bonuses for non-revenue positions
  • A strong commitment to diversity, equity, and inclusion
  • Eligibility to participate in additional benefits such as 401k match up to 5%, 100% paid life insurance (up to $100,000 coverage),, and parental leave
  • A collaborative and positive culture - your team will be as smart and driven as you
  • Limitless growth and learning opportunities

Related Categories

Related Job Pages

More Data Engineer Jobs

Data Engineer

Spear AI

Artificial Intelligence & Machine Learning for National Security

Data Engineer56 days ago
Full TimeRemoteTeam 11-50Since 2020

We’re seeking a skilled Data Engineer to build the next-generation data management and artificial intelligence platform for maritime domain awareness. What you’ll do: Implement real-time data pipelines with MQTT and Redpanda for stream processing. Implement offline data pipelines...

PythonRustPostgreSQLApache IcebergApache ParquetAmazon S3MQTTRedpandaApache KafkaDagsterApache AirflowProtocol Bufferstime-series data processingbinary message parsingOLTPOLAPdistributed systemsstreaming architecturesbatch processing
United States

Senior Data Engineer

Quanata

Quanata is on a mission to help ensure a better world through context-based insurance solutions. With the full backing of State Farm, we’re powering the insurance industry of tomorrow and helping enable better driving behaviors. Our top tier team of tech-minded professionals comprises data scientists, actuaries, engineers, designers and marketers—many from the best companies in Silicon Valley—and we’re inspired to create the insurance products and experiences of the future. Learn more about us and our work at http://www.quanata.com.

Data Engineer56 days ago
Full TimeRemoteTeam 201-500H1B No Sponsor

Senior Data Engineer delivering data science services and streaming data pipelines

AirflowAWSCloudKafkaPythonSQLTerraform
United States
$215K - $300K / year

Senior Data Engineer

Tekmetric

Simplify Your Life. Supercharge Your Shop.

Data Engineer56 days ago
Full TimeRemoteTeam 51-200H1B No Sponsor

Senior Data Engineer responsible for designing data infrastructure for Tekmetric

AirflowApacheETLJavaPythonScalaSparkSQLTableau
United States

Senior Data Engineer – Analytics

Machinify, Inc.

Bending the healthcare cost curve with AI.

Data Engineer57 days ago
Full TimeRemoteTeam 51-200H1B No Sponsor

Data Engineer transforming raw external data into powerful datasets at Machinify.

AirflowAWSCloudKafkaPythonSparkSQL
California