Veeva Systems
The Industry Cloud for Life Sciences
Data Engineer
Location
Kansas + 1 moreAll locations: Kansas, Missouri
Posted
175 days ago
Salary
$75K - $130K / year
3 yrs expEnglishAirflowAWSCloudJavaPythonSparkSQL
Job Description
• Take responsibility for the OpenData data processing workflows in US
• Build and maintain data processing pipelines and tools using state-of-the-art technologies
• Work with Python on Spark-based data pipelines
• Develop algorithms to build complex data relationships
• Build analytical data structures to support reporting
• Build and maintain Data Quality processes
• Collaborate with Product team to adapt reference data to changing demands in the market
• Deliver end-to-end ownership of solutions, including reports and data quality assurance
Job Requirements
- 3+ years of experience developing data pipelines using cloud-managed Spark clusters (e.g. AWS EMR, Databricks)
- Fluent in Python or Java and Spark (3+ years of experience)
- Previous experience building tools and libraries to automate and streamline data processing workflows
- Proficient with SQL / SparkSQL
- Hands-on experience working with a Data Lakehouse
- Good verbal and written communication and proven experience of working and delivering in an Agile environment
- Experience running data workflows through DevOps pipelines (nice to have)
- Experience developing data pipelines with orchestration tools (e.g. Airflow) (nice to have)
- Experience with AWS services for data processing like EMR, MWAA etc. (nice to have)
- Previous experience in the Life Sciences sector (nice to have)
- Veeva is not sponsoring H1B or supporting H1 transfers for this role (no visa sponsorship)
Benefits
- Medical, dental, vision, and basic life insurance
- Flexible PTO and company paid holidays
- Retirement programs
- 1% charitable giving program
- Work Anywhere (flexibility to work from home or in the office)