Innodata Inc.

Innodata solves your toughest data engineering challenges using artificial intelligence and human expertise.

Senior Data Engineer – Real-Time & Distributed Systems, GCP

Data EngineerData EngineerFull TimeRemoteTeam 1,001-5,000H1B No SponsorCompany SiteLinkedIn

Location

New Jersey

Posted

22 days ago

Salary

Not specified

Bachelor DegreeEnglishAirflowApacheCloudDistributed SystemsNo SQLPythonSpark

Job Description

• Design, build, and optimize scalable data pipelines for batch and real-time processing • Develop and maintain event-driven architectures for high-throughput systems • Ensure data reliability, performance, and low-latency processing across distributed environments • Collaborate with data scientists and application teams to enable analytics and AI use cases • Implement best practices in performance tuning, monitoring, and cost optimization

Job Requirements

  • Advanced proficiency in Python for backend and large-scale data processing
  • Strong experience building and managing big data pipelines in production environments
  • Hands-on expertise with workflow orchestration tools such as Airflow or Google Cloud Composer
  • Proven experience in batch and streaming data processing using: Apache Spark Apache Beam (Dataflow)
  • Experience designing and operating event-driven systems using Pub/Sub
  • Strong understanding of distributed systems architecture and scalability patterns
  • Experience managing globally distributed, low-latency datasets
  • Hands-on experience with NoSQL databases and/or Google Cloud Spanner
  • Strong knowledge of system reliability, fault tolerance, and performance optimization

Related Categories

Related Job Pages

More Data Engineer Jobs

Data Migration Analyst

Mark43

Cloud Native Computer-Aided Dispatch, Records Management, and Analytics

Data Engineer22 days ago
Full TimeRemoteTeam 201-500Since 2012H1B Sponsor

Data Migration Analyst guiding public safety agencies through data transition.

Alabama + 34 moreAll locations: Alabama, Arizona, California, Colorado, Connecticut, Florida, Idaho, Illinois, Iowa, Kansas, Maine, Nebraska, New Hampshire, New Jersey, New Mexico, New York, North Carolina, Ohio, Oklahoma, Oregon, Maryland, Massachusetts, Michigan, Minnesota, Missouri, Pennsylvania, South Carolina, Tennessee, Texas, Utah, Vermont, Virginia, Washington, West Virginia, Wisconsin

Data Center Engineer - Remote

EVOTEK

Today’s Emerging Technology will be Tomorrow’s Competitive Advantage

Data Engineer22 days ago
Full TimeRemoteTeam 51-200H1B No Sponsor

Join EVOTEK: North America’s Premier Digital Business EnablerAs North America's premier enabler of secure digital business, we integrate cutting-edge technical expertise across data center, network, security, cloud, and communications domains. By d...

California

Lead Data Engineer

HealthCare.com

Work with us! Now hiring across the globe.

Data Engineer22 days ago
Full TimeRemoteTeam 201-500Since 2014H1B Sponsor

Data Engineer designing and maintaining data pipelines for HealthCare.com

AirflowAWSDynamoDBPythonSQL
Connecticut + 3 moreAll locations: Connecticut, New Jersey, New York, Massachusetts

Data Architect

Great Minds

Creator of Eureka Math, Wit & Wisdom, and PhD Science curricula and Geodes books for emerging readers.

Data Engineer22 days ago
Full TimeRemoteTeam 1,001-5,000Since 2007H1B No Sponsor

Data Architect creating enterprise data architecture for Great Minds' educational tools

AWS
District Of Columbia
$109K - $119K / year