PathAI

Improving patient outcomes with AI-powered pathology.

Staff Site Reliability Engineer

DevOps EngineerDevOps EngineerFull TimeRemoteTeam 501-1,000Since 2016H1B SponsorCompany SiteLinkedIn

Location

Massachusetts

Posted

62 days ago

Salary

$165.8K - $224.5K / year

Bachelor Degree8 yrs expEnglishAnsibleAWSCloudGrafanaPrometheusPythonTerraform

Job Description

• Advancing the state of our operations by implementing SRE best practices - focusing on users, monitoring, and automation. • Engineering infrastructure patterns for cloud environments in Amazon Web Services - building in security, reliability and scalability. • Designing, building, and operating our data center to support our rapidly growing Machine Learning team. • Integrating on-premises datacenter environments with existing cloud infrastructure to create a seamless hybrid cloud environment. • Improving the reliability and resilience of our infrastructure through root-cause analysis and reviewing gaps in designs, and implementations of our infrastructure. • Participating in platform on-call rotations and assisting with urgent incident response.

Job Requirements

  • 8+ years of relevant experience.
  • Automation: You work hard to eliminate toil by automating everything through scripting, configuration management tools (Ansible), and code (Python/GoLang).
  • You’ve built monitoring infrastructure with modern observability tools (Datadog/Grafana/Prometheus).
  • You’ve worked with infrastructure as code (Terraform/Cloudformation).
  • You’ve administered physical hardware stacks in production settings (iDRAC/IPMI/Nvidia UFM/Juniper Systems).
  • You’re opinionated on storage solutions and how they can be optimized for high performance workloads (Quobyte/S3/FSx/EFS).
  • Familiarity with modern network designs and comfort operating across network layers.
  • Some experience and opinions on virtualization, containerization, or container orchestration platforms. (EKS/ClusterAPI/KVM).
  • Operations experience: You’ve managed critical production infrastructure and are familiar with incident response, scaling, and rapid growth related challenges.
  • A bachelor's degree in Computer Science or equivalent experience.
  • An insatiable intellectual curiosity and the ability to learn quickly in a complex space.
  • Travel: Willingness to travel up to 25% of the time.

Benefits

  • Not Overtime Eligible
  • Eligible for Equity

Related Categories

Related Job Pages

More DevOps Engineer Jobs

Senior Staff DevOps Engineer

MetaMask

The World’s Leading Web3 Wallet

DevOps Engineer62 days ago
Full TimeRemoteTeam 51-200Since 2016H1B No Sponsor

DevSecOps Engineer at Consensys working on MetaMask and Infura platforms

AndroidAWSAzureCloudCyber SecurityFirewallsiOSJavaScriptKubernetesNode.jsPrometheusPythonTerraformTypeScript
United States
$160K - $218K / year

DevOps Engineer

Impiricus

The future of HCP-Pharma connectivity. Impiricus is the HCP-preferred platform to engage with Pharma.

DevOps Engineer62 days ago
Full TimeRemoteTeam 11-50Since 2020H1B No Sponsor

DevOps Engineer building and scaling cloud infrastructure for healthcare solutions at Impiricus

AWSCloudDockerEC2JenkinsKubernetesPythonRayTerraform
New York
$110K - $130K / year

Junior DevOps Engineer

eSimplicity

An engineering firm that delivers high-quality Healthcare IT, Cybersecurity, and Telecommunication solutions.

DevOps Engineer63 days ago
Full TimeRemoteTeam 51-200Since 2016H1B No Sponsor

Junior DevOps Engineer developing and maintaining CI/CD pipelines for eSimplicity

AnsibleAWSCloudRPATerraformVisualforce
Maryland
$75.2K / year

Reliability Engineer, Controls

Vantage Data Centers

Experience | Scalability | Efficiency By Design

DevOps Engineer63 days ago
Full TimeRemoteTeam 1,001-5,000Since 2010H1B Sponsor

Reliability Engineer controlling critical monitoring and control systems for Vantage Data Centers.

TCP/IP
Arizona