PathAI
Improving patient outcomes with AI-powered pathology.
Staff Site Reliability Engineer
DevOps EngineerDevOps EngineerFull TimeRemoteTeam 501-1,000Since 2016H1B SponsorCompany SiteLinkedIn
Location
Massachusetts
Posted
62 days ago
Salary
$165.8K - $224.5K / year
Bachelor Degree8 yrs expEnglishAnsibleAWSCloudGrafanaPrometheusPythonTerraform
Job Description
• Advancing the state of our operations by implementing SRE best practices - focusing on users, monitoring, and automation.
• Engineering infrastructure patterns for cloud environments in Amazon Web Services - building in security, reliability and scalability.
• Designing, building, and operating our data center to support our rapidly growing Machine Learning team.
• Integrating on-premises datacenter environments with existing cloud infrastructure to create a seamless hybrid cloud environment.
• Improving the reliability and resilience of our infrastructure through root-cause analysis and reviewing gaps in designs, and implementations of our infrastructure.
• Participating in platform on-call rotations and assisting with urgent incident response.
Job Requirements
- 8+ years of relevant experience.
- Automation: You work hard to eliminate toil by automating everything through scripting, configuration management tools (Ansible), and code (Python/GoLang).
- You’ve built monitoring infrastructure with modern observability tools (Datadog/Grafana/Prometheus).
- You’ve worked with infrastructure as code (Terraform/Cloudformation).
- You’ve administered physical hardware stacks in production settings (iDRAC/IPMI/Nvidia UFM/Juniper Systems).
- You’re opinionated on storage solutions and how they can be optimized for high performance workloads (Quobyte/S3/FSx/EFS).
- Familiarity with modern network designs and comfort operating across network layers.
- Some experience and opinions on virtualization, containerization, or container orchestration platforms. (EKS/ClusterAPI/KVM).
- Operations experience: You’ve managed critical production infrastructure and are familiar with incident response, scaling, and rapid growth related challenges.
- A bachelor's degree in Computer Science or equivalent experience.
- An insatiable intellectual curiosity and the ability to learn quickly in a complex space.
- Travel: Willingness to travel up to 25% of the time.
Benefits
- Not Overtime Eligible
- Eligible for Equity
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
DevOps Engineer62 days ago
Full TimeRemoteTeam 51-200Since 2016H1B No Sponsor
DevSecOps Engineer at Consensys working on MetaMask and Infura platforms
AndroidAWSAzureCloudCyber SecurityFirewallsiOSJavaScriptKubernetesNode.jsPrometheusPythonTerraformTypeScript
DevOps Engineer
ImpiricusThe future of HCP-Pharma connectivity. Impiricus is the HCP-preferred platform to engage with Pharma.
DevOps Engineer62 days ago
Full TimeRemoteTeam 11-50Since 2020H1B No Sponsor
DevOps Engineer building and scaling cloud infrastructure for healthcare solutions at Impiricus
AWSCloudDockerEC2JenkinsKubernetesPythonRayTerraform
Junior DevOps Engineer
eSimplicityAn engineering firm that delivers high-quality Healthcare IT, Cybersecurity, and Telecommunication solutions.
DevOps Engineer63 days ago
Full TimeRemoteTeam 51-200Since 2016H1B No Sponsor
Junior DevOps Engineer developing and maintaining CI/CD pipelines for eSimplicity
AnsibleAWSCloudRPATerraformVisualforce
DevOps Engineer63 days ago
Full TimeRemoteTeam 1,001-5,000Since 2010H1B Sponsor
Reliability Engineer controlling critical monitoring and control systems for Vantage Data Centers.
TCP/IP
Arizona