Senior Data & AI Platform Engineer

AI EngineerMachine Learning EngineerFull TimeRemote

Location

United States

Posted

23 days ago

Salary

Not specified

PythonAWSSnowflakeAPI DesignDistributed SystemsOpen AIPineconeData ModelingETLPerformance OptimizationVector EmbeddingsSemantic SearchLLM Apis

Job Description

RevenueBase:

  • We're building the data infrastructure that makes AI agents trustworthy instead of error-prone.

  • We provide continuously refreshed, verified B2B data for autonomous AI agents and GTM workflows.

  • We've tripled growth while maintaining 100% gross dollar retention and staying cashflow positive.

  • We power AI agents for Clay, Zoominfo, Dun & Bradstreet, and the next generation of AI GTM tools.

About the Role

We are looking for a Senior Data & AI Platform Engineer to build internal tools and services on top of our large-scale data infrastructure. Your primary focus will be developing systems that leverage vector embeddings, LLM APIs, and semantic search to unlock value from structured and unstructured data.

This is a hands-on engineering role for someone who enjoys building practical AI-powered tools — not just experiments — and shipping them into production in a fast-moving startup environment.

What You’ll Do

  • Design and build data-driven tools that operate on large datasets stored in S3 and Snowflake

  • Implement pipelines that:

    • Extract specific columns or datasets from Snowflake

    • Generate vector embeddings via APIs such as OpenAI

    • Store and manage embeddings in vector databases like Pinecone

    • Enable semantic search and similarity-based retrieval

  • Develop enrichment workflows that:

    • Query structured data

    • Use LLM APIs to generate new derived columns

    • Write enriched results back into Snowflake

  • Build reusable internal services and SDKs around embedding generation, prompt orchestration, and data augmentation

  • Optimize performance and cost across AWS infrastructure

  • Work closely with product and data teams to turn use cases into scalable engineering solutions

  • Ensure reliability, observability, and maintainability of AI-powered pipelines

Example Projects

  • Tool to extract a single Snowflake column, generate embeddings, push to Pinecone, and expose a semantic search API

  • Batch enrichment pipeline that queries records from Snowflake, calls OpenAI APIs for structured enrichment, and writes new columns back

  • Internal framework for LLM-based data transformation and validation

  • Query abstraction layer to make AI-enhanced analytics accessible to non-engineering teams

Required Qualifications

  • 5+ years of software engineering experience

  • Strong backend engineering skills (Python preferred; other modern languages acceptable)

  • Solid experience with:

    • AWS (IAM, Lambda, ECS/EKS, S3, networking, security best practices)

    • Data warehousing (Snowflake preferred)

    • API design and distributed systems

  • Hands-on experience working with LLM APIs (e.g., OpenAI) and embedding workflows

  • Experience with vector databases (Pinecone or similar)

  • Strong understanding of data modeling, ETL/ELT patterns, and performance optimization

  • Production experience in at least one startup environment

  • Ability to operate independently and ship high-impact systems end-to-end

Nice to Have

  • Experience building internal developer platforms or data tooling

  • Familiarity with prompt engineering and evaluation pipelines

  • Experience with orchestration frameworks (Airflow, Prefect, Dagster)

  • Exposure to retrieval-augmented generation (RAG) systems

  • Infrastructure-as-code experience (Terraform, CDK)

  • Experience managing large-scale embedding refresh and re-indexing workflows

What Success Looks Like

  • Engineers and analysts can easily leverage AI-powered data enrichment

  • Embedding-based search works reliably at scale

  • New AI use cases can be implemented quickly using shared internal tooling

  • Systems are robust, observable, and cost-efficient

Why Join Us?

  • Work on practical, production-grade AI systems

  • Direct impact on how data is leveraged across the company

  • Startup speed with real ownership and autonomy

  • Opportunity to define the internal AI platform from the ground up

Related Job Pages

More AI Engineer Jobs

AI Architect

Voya Financial

Well Planned, Well Invested, Well Protected®

AI Engineer23 days ago
Full TimeRemoteTeam 5,001-10,000Since 1998H1B No Sponsor

Lead design and delivery of mobile-first and cloud-native solutions, enhancing customer engagement through modern architectures and cloud technologies.

Api ManagementAWSAzureDockerFlutterGCPIonicKotlinKubernetesReact NativeSwift
Minnesota + 7 moreAll locations: Minnesota, New Jersey, Connecticut, Rhode Island, Massachusetts, New York, South Carolina, Georgia
$131.0K - $183.7K / year

Software Engineer, AI Platform

Samsara

Pioneer of the Connected Operations Cloud

AI Engineer23 days ago
Full TimeRemoteTeam 1,001-5,000Since 2015H1B Sponsor

Software Engineer II implementing AI-driven platform capabilities for Samsara's operations.

JavaPythonGo
California
$113.6K - $171.9K / year

Principal Software Engineer, AI Platform

Saviynt

The #1 Converged Identity Platform with Intelligent Access Governance for Employees, Third Parties & Machines.

AI Engineer24 days ago
Full TimeRemoteTeam 501-1,000Since 2010H1B Sponsor

Lead backend development and scalability for Saviynt's Java-based platform

AnsibleAWSCloudDockerElasticSearchGrafanaJavaKubernetesMicroservicesMySQLPostgresPrometheusRedisSpringSpring BootSpringBootTerraform
California

Clinical/Life Sciences Background (CTD) AI Lead

OMG Technology

Location: Remote Candidate rate: Open market rate Docs required: ID proof will be required. Location: New Jersey, New Jersey (Remote) Employment Type: Contractor Minimum Experience: Experienced

AI Engineer24 days ago
ContractRemoteTeam 11-50

Clinical/Life Sciences Background (CTD) AI Lead (Remote – EST) We are looking to hire a candidate with the skills sets mentioned and experience for one of our clients within the pharmaceutical Industry. This is a 6+ month contracting role, with potential for extension. This is a ...

AWSDatabricksPythonSQLOpenAIDataikuRAGGenAI
United States