Sayari

Science for decision making.

Associate Data Engineer

Data EngineerData EngineerFull TimeRemoteTeam 1-10H1B No SponsorCompany SiteLinkedIn

Location

United States

Posted

30 days ago

Salary

$85K - $100K / year

Bachelor Degree1 yr expEnglishApache AirflowSparkAWSDockerGCPPostgre SQLPythonScalaSQL

Job Description

About Sayari: Sayari is a venture-backed and founder-led global corporate data provider and commercial intelligence platform that serves financial institutions, legal and advisory service providers, multinationals, journalists, and governments. Thousands of analysts and investigators in over 30 countries rely on our products to safely conduct cross-border trade, research front-page news stories, confidently enter new markets, and prevent financial crimes such as corruption and money laundering. Our company culture is defined by a dedication to our mission of using open data to prevent illicit commercial and financial activity, a passion for finding novel approaches to complex problems, and an understanding that diverse perspectives create optimal outcomes. We embrace cross-team collaboration, encourage training and learning opportunities, and reward initiative and innovation. If you like working with supportive, high-performing, and curious teams, Sayari is the place for you. POSITION DESCRIPTION Sayari is looking for an Entry-Level Associate Data Engineer to join our Data team located in Washington, DC. The Data team is an integral part of our Engineering division and works closely with our Software & Product teams, as well as other key stakeholders across the business. JOB RESPONSIBILITIES: Write and deploy crawling scripts to collect source data from the web Write and run data transformers in Scala Spark to standardize bulk data sets Write and run modules in Python to parse entity references and relationships from source data Diagnose and fix bugs reported by internal and external users Analyze and report on internal datasets to answer questions and inform feature work Work collaboratively on and across a team of engineers using basic agile principles Give and receive feedback through code reviews SKILLS & EXPERIENCE Required Skills & Experience Bachelor’s or Master’s degree in Computer Science, Data Science, Engineering, or a related technical field — or equivalent hands-on experience Working knowledge of SQL and relational databases (such as Postgres) Experience writing code in Python (e.g., pandas, NumPy, Scrapy) or Java/Scala Familiarity with data processing frameworks like Apache Spark, or strong interest in learning them on the job Understanding of object-oriented programming principles and collaborative development in shared repositories Ability to work closely with data scientists, analysts, and engineers to help solve complex problems across large, diverse datasets Desired Skills & Experience Exposure to workflow orchestration tools such as Apache Airflow and CI/CD pipelines Familiarity with graph, search, or NoSQL databases Experience contributing to data ingestion, transformation, or ETL pipelines Comfort working with containerized applications (e.g., Docker) Experience using cloud-based data tools in AWS or GCP environments Introductory experience or coursework involving machine learning, especially in distributed systems like Spark Awareness of entity resolution concepts or interest in learning how entities are linked across data sources Experience working with international or non-English datasets The target base salary for this position is $85,000-$100,000 plus company bonus and equity. Final offer amounts are determined by multiple factors including location, local market variances, candidate experience and expertise, internal peer equity, and may vary from the amounts listed above. Benefits: 100% fully paid medical, vision, and dental for employees and their dependents Generous time off; we observe all US federal holidays, close our office for a winter break (12/24-12/31), in addition to granting 18 PTO days and 10 sick days Outstanding compensation package; competitive commissions for revenue roles and bonuses for non-revenue positions A strong commitment to diversity, equity, and inclusion Eligibility to participate in additional benefits such as 401k match up to 5%, 100% paid life insurance (up to $100,000 coverage),, and parental leave A collaborative and positive culture - your team will be as smart and driven as you Limitless growth and learning opportunities Sayari is an equal opportunity employer and strongly encourages diverse candidates to apply. We believe diversity and inclusion mean our team members should reflect the diversity of the United States. No employee or applicant will face discrimination or harassment based on race, color, ethnicity, religion, age, gender, gender identity or expression, sexual orientation, disability status, veteran status, genetics, or political affiliation. We strongly encourage applicants of all backgrounds to apply. Pay Range $85,000 — $100,000 USD

Related Categories

Related Job Pages

More Data Engineer Jobs

Data Engineer

InterWorks

Exponential growth starts once we connect.

Data Engineer30 days ago
Full TimeRemoteTeam 51-200H1B No Sponsor

Data Engineering Consultant guiding clients through data platform decisions at InterWorks

CloudETLSQL
North Carolina + 1 moreAll locations: North Carolina, Oklahoma
$90K - $150K / year

Data Engineer Mid-Level - MLE - AI

IA na iFood

Somos uma empresa brasileira de tecnologia referência na América Latina. Por meio de soluções inovadoras, conectamos milhares de restaurantes a milhões de consumidores diariamente com uma média de 100 milhões de pedidos mensais. Além do delivery de comida, também somos Mercado, Farmácia e Pet. Temos também o iFood Pago, nossa Fintech, que engloba o iFood Benefícios, o vale alimentação e refeição do iFood e o próprio iFood Pago, o banco do restaurante.

Data Engineer30 days ago
Full TimeRemote

Desenvolver e operacionalizar modelos de Machine Learning, garantindo a implementação e manutenção de pipelines eficientes. Realizar deploy de modelos e gerenciar o ciclo de vida de modelos (LCM) dentro da plataforma GenPlat. Colaborar com a equipe para fomentar uma cultura forte...

PythonAWSMLOpsMachine LearningPipelineSoftware EngineeringCloud InfrastructureOpen SourceGenAIGitDockerKubernetesCI/CD
United States + 1 moreAll locations: United States, Canada

Data Engineer II

EverCommerce

Software that Powers the Service Economy

Data Engineer30 days ago
Full TimeRemoteTeam 1,001-5,000Since 2016H1B Sponsor

Data Engineer II designing and scaling data platform for analytics and insights

AirflowApacheAWSCloudEC2KafkaPythonSQL
United States
$120K - $140K / year

Data Engineer

Keller Postman LLC

Clients First. Innovation Always. Excellence in Everything. One of the nation's fastest growing plaintiffs' law firms.

Data Engineer30 days ago
Full TimeRemoteTeam 51-200Since 2016H1B No Sponsor

Data Engineer designing and maintaining scalable data management systems for Keller Postman

AzureCloudETLKafkaPythonSQLTerraform
United States
$130K - $150K / year