The Data Hub Between Lenders and Capital Markets
MLOps Platform Engineer
Location
United States
Posted
30 days ago
Salary
$185K - $200K / year
Job Description
Job Requirements
- A senior cloud and platform engineer: You have 8+ years of experience in cloud infrastructure, DevOps, or platform engineering roles, with deep expertise designing and operating distributed systems in production.
- Experienced with MLOps and agentic platforms: You have direct exposure to ML/GenAIOps practices, such as monitoring, anomaly detection, predictive alerting, or automated remediation, applied to real production systems. 5+ years of MLOps experience is required.
- Strong in cloud-native infrastructure: You are proficient in building and managing cloud environments, Kubernetes, containerized workloads and infrastructure-as-code tools such as Terraform.
- Comfortable supporting AI workloads: You have hands-on experience supporting platforms that and host/run deep neural networks, including LLM runtimes (e.g., vLLM, llama.cpp), ML compiler stacks (e.g., LLVM/MLIR), and PyTorch-based production systems.
- Security- and operations-minded: You have a strong understanding of infrastructure security, IAM, secrets management, and operational risk as it relates to AI-enabled systems.
- A platform-focused technical leader: You operate effectively as a technical leader, influencing architecture and standards while remaining hands-on. You communicate clearly, collaborate well cross-functionally, and thrive in ambiguous problem spaces.
- Forward-thinking and pragmatic: You are proactive and innovative, with the ability to introduce emerging agentic patterns while balancing operational maturity and long-term maintainability. You will help design and operate scalable benchmarking and evaluation frameworks for agentic AI systems, enabling quantitative measurement of accuracy, reliability, cost–performance tradeoffs, regression detection, and the impact of model, prompt, or architecture changes (including techniques such as LLM-as-a-judge), with tooling that is reusable and accessible across the organization.
- Nice To Have: Experience with Pulumi, Experience with GCP, and Cloudflare, Experience with GHA and Harness, Experience with Go lang, Experiencing supporting Data Engineering Platforms, Exposure to Data Warehousing and ETL/ELT Tools or Operations.
Benefits
- Unlimited PTO. Unplug and rejuvenate, however you want—whether that’s vacationing on the beach or at home on a mental-health day.
- $1,000 Learning & Development Fund. No matter where you are in your career, always invest in your future. We encourage you to attend conferences, take classes, and lead workshops. We also host hackathons, brunch & learns, and other employee-led learning opportunities.
- Remote-First Environment. People thrive in a flexible and supportive environment that best invigorates them. You can work from your home, cafe, or hotel. You decide.
- Health Care and Financial Planning. We offer a comprehensive medical, dental, and vision insurance package for you and your family. We also offer a 401(k) for you to contribute.
- Stay active your way! Get $138/month to put toward your favorite gym or fitness membership — wherever you like to work out. Prefer to exercise at home? You can also use up to $1,650 per year through our Fitness Fund to purchase workout equipment, gear, or other wellness essentials.
- New Family Bonding. Primary caregivers can take 16 weeks off 100% paid leave, while secondary caregivers can take 4 weeks. Returning to work after bringing home a new child isn’t easy, which is why we’re flexible and empathetic to the needs of new parents.
Related Guides
Related Categories
Related Job Pages
More Platform Engineer Jobs
Senior Platform Engineer
MixmaxMixmax is the easiest-to-use sales engagement platform, transforming the way revenue teams build pipeline, close deals and engage customers. We make life easier for everyone who interacts with customers, not just SDRs, by automating repetitive tasks and streamlining workflows. This increases productivity and empowers reps to focus on selling. Mixmax customers typically see a positive ROI in under 6 months and can start using the platform in less than a day.
The Senior Platform Engineer will enhance infrastructure, improve developer experience, lead projects, and collaborate with product teams to optimize service delivery.
Senior Staff, DevEx Platform Engineer
ZocdocZocdoc is the beginning of a better healthcare experience for millions of patients every month.
Senior Staff DevEx Platform Engineer enabling Zocdoc’s engineering teams with improved developer experience
Senior technical leadership role integrating AI and data capabilities at RTX
Databricks AI Platform Engineer focusing on software engineering and ML solutions