Pragmatike
Remote first tech projects
Principal AI/ML Engineer
Machine Learning EngineerMachine Learning EngineerFull TimeRemoteTeam 11-50Since 2022Company SiteLinkedIn
Location
Massachusetts
Posted
16 days ago
Salary
Not specified
Bachelor DegreeEnglishAWSAzureCloudDistributed SystemsDockerGoogle Cloud PlatformKubernetesPythonPy TorchType ScriptGo
Job Description
• Architect, build, and scale the end-to-end ML Ops pipeline, including training, fine-tuning, evaluation, rollout, and monitoring.
• Design reliable infrastructure for model deployment, versioning, reproducibility, and orchestration across cloud and on-prem GPU clusters.
• Optimize compute usage across distributed systems (Kubernetes, autoscaling, caching, GPU allocation, checkpointing workflows).
• Lead the implementation of observability for ML systems (monitor drift, performance, throughput, reliability, cost).
• Build automated workflows for dataset curation, labeling, feature pipelines, evaluation, and CI/CD for ML models.
• Collaborate with researchers to productionize models and accelerate training/inference pipelines.
• Establish ML Ops best practices, internal standards, and cross-team tooling.
• Mentor engineers and influence architectural direction across the entire AI platform.
Job Requirements
- Deep hands-on experience designing and operating production ML systems at scale (Staff/Principal-level expected).
- Strong background in ML Ops, distributed systems, and cloud infrastructure (AWS, GCP, or Azure).
- Proficiency with Python and familiarity with TypeScript or Go for platform integration.
- Expertise in ML frameworks: PyTorch, Transformers, vLLM, Llama-factory, Megatron-LM, CUDA / GPU acceleration (practical understanding)
- Strong experience with containerization and orchestration (Docker, Kubernetes, Helm, autoscaling).
- Deep understanding of ML lifecycle workflows: training, fine-tuning, evaluation, inference, model registries.
- Ability to lead technical strategy, collaborate cross-functionally, and operate in fast-paced environments
Benefits
- Competitive salary & equity options
- Sign-on bonus
- Health, Dental, and Vision
- 401k
Related Guides
Related Job Pages
More Machine Learning Engineer Jobs
Staff Software Engineer, Machine Learning
Smarter TechnologiesThe Automation and Insights Platform for Healthcare Efficiency
Machine Learning Engineer16 days ago
Full TimeRemoteTeam 10,001+Since 2025
ML Engineer enhancing AI-driven solutions for healthcare efficiency.
AWSCloudPythonTypeScript
Principal Machine Learning Engineer, ML Platform
ShippoWe help eCommerce merchants grow by empowering them with the #1 shipping solution tool needed to save time and money.
Machine Learning Engineer16 days ago
Full TimeRemoteTeam 201-500Since 2013
Principal ML Platform Engineer developing scalable ML solutions for Shippo
Distributed SystemsKubernetes
Hawaii + 6 moreAll locations: Hawaii, Nevada, New Mexico, Ohio, Oregon, Virginia, West Virginia
$212K - $287K / year
Machine Learning Engineer16 days ago
Full TimeRemoteTeam 11-50Since 2019
Machine Learning Engineer developing ML solutions for a Fortune 500 payment platform
PythonPyTorchScikit-LearnSparkSQLTensorflow
Machine Learning Engineer16 days ago
Full TimeRemoteTeam 11-50Since 2019
Senior Machine Learning Engineer collaborating in ML infrastructure at Maxana
AirflowDockerKubernetesPythonPyTorchScalaSparkTensorflow