RefinedScience
Advance care by bringing together the best science, data and minds to discover pathways to life beyond disease.
Data Engineering Intern
Location
United States
Posted
21 days ago
Salary
Not specified
Bachelor Degree0.1 yrs expEnglishAirflowAWSAzureDbtGCPPythonSparkSQL
Job Description
Data Engineering Intern
At RefinedScience, our mission is to advance care by bringing together the best science, data and minds – disease by disease, patient by patient, cell by cell to discover pathways to life beyond disease.
WHAT WE ARE LOOKING FOR
We are seeking a motivated Data Engineering Intern to join our team. This internship is open to undergraduate and graduate students who are interested in building data infrastructure that supports advanced analytics, data science, and AI-driven insights in healthcare and life sciences.
You will work closely with data scientists, bioinformaticians, and engineers to help design, build, and improve data pipelines and platforms that power RefinedScience’s research and analytics initiatives.
KEY ACTIVITIES
Assist in building and maintaining data pipelines for ingesting, transforming, and validating clinical, biological, and real-world data
Support integration of data from multiple sources (e.g., clinical data, analytics outputs, external datasets)
Help develop and optimize ETL/ELT workflows to ensure data quality and reliability
Collaborate with data science and bioinformatics teams to support analytics and machine learning workflows
Contribute to data modeling, documentation, and best practices for data infrastructure
Participate in code reviews, testing, and performance improvements
Participate in Quality Reviews and Troubleshooting
Communicate progress and findings to cross-functional teams
MUST HAVES
Job Requirements
- Currently enrolled in a Bachelor’s, Master’s, or Ph.D.
- program in Data Engineering, Computer Science, Data Science, Software Engineering, or a related field
- Experience with Python and/or SQL through coursework, projects, or internships
- Basic understanding of data pipelines, databases, and data transformation concepts
- Familiarity with version control (e.g., Git)
- Strong analytical thinking and problem-solving skills
- Ability to learn quickly and work collaboratively in a team environment
- NICE TO HAVE
- Exposure to cloud platforms (AWS, GCP, or Azure)
- Familiarity with data tools such as Airflow, dbt, Spark, or similar frameworks
- Experience working with large or complex datasets
- Interest in healthcare, life sciences, or applied AI
- Duration:
- 8 – 10 Weeks
- WHY YOU’LL LOVE REFINED SCIENCE
- Team + Values
- At RefinedScience, we seamlessly integrate top-tier clinical and biological data with expert knowledge to provide unparalleled insights. We maximize patient impact with these unique insights by optimizing clinical trial probability of success and time to actionable results. We work across biopharma and we are a trusted partner in achieving better results, faster – working together to unlock strategic advantage.
- Our Values
- Act with Purpose – We believe in rigor through deliberate and thoughtful actions
- Be Curious – Curiosity is the spark that ignites innovation and growth
- Take Ownership – True ownership leads to pride and commitment in the work we do
- Invest in Relationships – Building strong connections is the foundation for effective collaboration and trust for long term success
- Embrace Agility – We celebrate agile thinking, resilience, and adaptability
Related Guides
Related Categories
Related Job Pages
More Data Engineer Jobs
Data Engineer21 days ago
Full TimeRemoteTeam 10,001+Since 1856H1B Sponsor
Senior Data Engineer designing modern data applications in healthcare
Cloud
Washington
Data Engineer21 days ago
Full TimeRemoteTeam 201-500H1B No Sponsor
Data Engineering Team Lead for the University of Maine System overseeing data engineering services
CloudETLSQL
Senior Data Architect
DatavailWe help clients turn data into decisions no matter where it lives-in apps, on-prem, in a hybrid model, or in the cloud.
Data Engineer21 days ago
Full TimeRemoteTeam 1,001-5,000Since 2007H1B Sponsor
Senior Data Architect with expertise in data management and cloud services
BigQueryCloudETLPySparkSQL
United States
Senior Data Engineer – Cloud Data Platform, Snowflake, dbt
AssistRxSpecialty therapy initiation and patient support company delivering informed access and improved outcomes.
Data Engineer22 days ago
Full TimeRemoteTeam 501-1,000Since 2011H1B Sponsor
Senior Data Engineer designing cloud data platforms at AssistRx
CloudSQLVault
Florida