VirtusLab

Reach peak performance | IT consulting and software engineering backed by our expertise in Dev Experience, ML and Scala

Staff Data Engineer

Data EngineerData EngineerFull TimeRemoteTeam 201-500Since 2010H1B No SponsorCompany SiteLinkedIn

Location

New York

Posted

4 days ago

Salary

Not specified

Bachelor DegreeExperience acceptedEnglishAirflowApacheAWSBig QueryCloudDistributed SystemsETLGoogle Cloud PlatformKafkaKubernetes

Job Description

• building web crawling or large-scale data systems from scratch • designing scalable, fault-tolerant distributed systems • leading complex technical initiatives • mentoring engineers and promoting a collaborative culture • operating ETL/ELT pipelines • extracting structured/unstructured web data

Job Requirements

  • Proven experience building web crawling or large-scale data systems from scratch
  • Strong architectural skills in designing scalable, fault-tolerant distributed systems
  • Track record leading complex technical initiatives and driving architecture direction for teams
  • Demonstrated ability to evolve production systems incrementally while maintaining reliability
  • Experience mentoring engineers at all levels and promoting a collaborative culture
  • Deep background in large-scale data engineering (terabytes daily)
  • Hands-on experience with cloud data warehouses (BigQuery, Snowflake)
  • Experience with Apache Kafka, Kubernetes (GKE/EKS), and orchestration tools (Airflow)
  • Familiarity with multi-cloud environments (GCP + AWS)
  • Expertise in designing and operating ETL/ELT pipelines
  • Deep expertise in web crawling technologies and advanced scraping (Scrapy or similar)
  • Experience in extracting structured/unstructured web data and SERP extraction
  • Knowledge of proxy infrastructure management, anti-bot detection, and ethical crawling
  • Familiarity with crawling vendors and AI/LLM-based extraction approaches
  • Support the VirtusLab U.S. and international teams by lending senior technical expertise to client-facing activities, including technical discovery sessions, workshops, and solution architecture
  • Conduct requirements analysis and solution discovery, identifying business and technical needs
  • Provide technical consulting and advisory services, recommending appropriate data architectures aligned with customer goals
  • Prepare and review technical sections of commercial offers, including solution descriptions, statements of work (SoWs), project estimates, timelines, and delivery models

Benefits

  • self-development opportunities
  • good working conditions

Related Categories

Related Job Pages

More Data Engineer Jobs

Relativity Archiving Analyst

Contact Government Services

Contact Review prides itself on finding high-quality, high-accountability, barred attorneys specifically tailored to the needs of our project. Assists with document review, privilege review, expert testimony, legal research, and foreign language translation Fosters a culture where every team member sees themselves as an extension of the project's team Looks for ways to improve efficiency and streamline workflows

Data Engineer4 days ago
Full TimeRemote

CGS is seeking a Relativity Archiving Analyst, who will be responsible for vetting Relativity workspaces and file share folders and archiving or purging them. File shares will be moved to archive locations. Relativity workspaces will be archived using both Relativity ARM and a fl...

United States
$74.7K - $101.4K / year

Lead Microsoft Data Engineer

Procentrix, LLC

Delivering practical solutions to solve complex challenges with an eye on maximizing customers' current IT investments

Data Engineer4 days ago
Full TimeRemoteTeam 201-500H1B No Sponsor

The Lead Azure Data Engineer is responsible for migration, governance, and data integration for the implementation of new Microsoft Cloud based document routing and task management system. They will ensure alignment with the technical, security, and document management requiremen...

United States
$135K - $160K / year

Data Engineer I

Centene Corporation

Transforming the health of the communities we serve, one person at a time.

Data Engineer4 days ago
Full TimeRemoteTeam 10,001+Since 1984H1B No Sponsor

The role involves developing and operationalizing data pipelines for data ingestion, transformation, validation, and optimization to support reporting and advanced analytics consumption. This includes designing and implementing standardized data management procedures and developing high-performance data structures for business intelligence.

United States
$27 - $49 / hour

Contract Data Engineer

Silvur

The only retirement planning platform exclusively built for those over 50. Get your Retirement Score.

Data Engineer4 days ago
ContractRemoteTeam 11-50H1B No Sponsor

Contract Data Engineer optimizing SQL for retirement decision-making tools

PostgreSQLSQL
Remote
$65 / hour