Innodata solves your toughest data engineering challenges using artificial intelligence and human expertise.
Senior Data Engineer – Real-Time & Distributed Systems, GCP
Location
New Jersey
Posted
22 days ago
Salary
Not specified
Job Description
Job Requirements
- Advanced proficiency in Python for backend and large-scale data processing
- Strong experience building and managing big data pipelines in production environments
- Hands-on expertise with workflow orchestration tools such as Airflow or Google Cloud Composer
- Proven experience in batch and streaming data processing using: Apache Spark Apache Beam (Dataflow)
- Experience designing and operating event-driven systems using Pub/Sub
- Strong understanding of distributed systems architecture and scalability patterns
- Experience managing globally distributed, low-latency datasets
- Hands-on experience with NoSQL databases and/or Google Cloud Spanner
- Strong knowledge of system reliability, fault tolerance, and performance optimization
Related Guides
Related Categories
Related Job Pages
More Data Engineer Jobs
Data Migration Analyst guiding public safety agencies through data transition.
Data Center Engineer - Remote
EVOTEKToday’s Emerging Technology will be Tomorrow’s Competitive Advantage
Join EVOTEK: North America’s Premier Digital Business EnablerAs North America's premier enabler of secure digital business, we integrate cutting-edge technical expertise across data center, network, security, cloud, and communications domains. By d...
Data Engineer designing and maintaining data pipelines for HealthCare.com
Data Architect
Great MindsCreator of Eureka Math, Wit & Wisdom, and PhD Science curricula and Geodes books for emerging readers.
Data Architect creating enterprise data architecture for Great Minds' educational tools