Pragmatike

Remote first tech projects

Principal AI/ML Engineer

Machine Learning EngineerMachine Learning EngineerFull TimeRemoteTeam 11-50Since 2022Company SiteLinkedIn

Location

Massachusetts

Posted

16 days ago

Salary

Not specified

Bachelor DegreeEnglishAWSAzureCloudDistributed SystemsDockerGoogle Cloud PlatformKubernetesPythonPy TorchType ScriptGo

Job Description

• Architect, build, and scale the end-to-end ML Ops pipeline, including training, fine-tuning, evaluation, rollout, and monitoring. • Design reliable infrastructure for model deployment, versioning, reproducibility, and orchestration across cloud and on-prem GPU clusters. • Optimize compute usage across distributed systems (Kubernetes, autoscaling, caching, GPU allocation, checkpointing workflows). • Lead the implementation of observability for ML systems (monitor drift, performance, throughput, reliability, cost). • Build automated workflows for dataset curation, labeling, feature pipelines, evaluation, and CI/CD for ML models. • Collaborate with researchers to productionize models and accelerate training/inference pipelines. • Establish ML Ops best practices, internal standards, and cross-team tooling. • Mentor engineers and influence architectural direction across the entire AI platform.

Job Requirements

  • Deep hands-on experience designing and operating production ML systems at scale (Staff/Principal-level expected).
  • Strong background in ML Ops, distributed systems, and cloud infrastructure (AWS, GCP, or Azure).
  • Proficiency with Python and familiarity with TypeScript or Go for platform integration.
  • Expertise in ML frameworks: PyTorch, Transformers, vLLM, Llama-factory, Megatron-LM, CUDA / GPU acceleration (practical understanding)
  • Strong experience with containerization and orchestration (Docker, Kubernetes, Helm, autoscaling).
  • Deep understanding of ML lifecycle workflows: training, fine-tuning, evaluation, inference, model registries.
  • Ability to lead technical strategy, collaborate cross-functionally, and operate in fast-paced environments

Benefits

  • Competitive salary & equity options
  • Sign-on bonus
  • Health, Dental, and Vision
  • 401k

Related Job Pages

More Machine Learning Engineer Jobs

Staff Software Engineer, Machine Learning

Smarter Technologies

The Automation and Insights Platform for Healthcare Efficiency

Machine Learning Engineer16 days ago
Full TimeRemoteTeam 10,001+Since 2025

ML Engineer enhancing AI-driven solutions for healthcare efficiency.

AWSCloudPythonTypeScript
Texas
$230K - $280K / year

Principal Machine Learning Engineer, ML Platform

Shippo

We help eCommerce merchants grow by empowering them with the #1 shipping solution tool needed to save time and money.

Machine Learning Engineer16 days ago
Full TimeRemoteTeam 201-500Since 2013

Principal ML Platform Engineer developing scalable ML solutions for Shippo

Distributed SystemsKubernetes
Hawaii + 6 moreAll locations: Hawaii, Nevada, New Mexico, Ohio, Oregon, Virginia, West Virginia
$212K - $287K / year

Machine Learning Engineer

MaxanaPay

Empowering businesses one transaction at a time

Machine Learning Engineer16 days ago
Full TimeRemoteTeam 11-50Since 2019

Machine Learning Engineer developing ML solutions for a Fortune 500 payment platform

PythonPyTorchScikit-LearnSparkSQLTensorflow
United States
$150K - $160K / year

Senior Machine Learning Engineer

MaxanaPay

Empowering businesses one transaction at a time

Machine Learning Engineer16 days ago
Full TimeRemoteTeam 11-50Since 2019

Senior Machine Learning Engineer collaborating in ML infrastructure at Maxana

AirflowDockerKubernetesPythonPyTorchScalaSparkTensorflow
United States
$120K - $160K / year