Staff Site Reliability Engineer

DevOps EngineerDevOps EngineerFull TimeRemoteTeam 501-1,000Since 2015H1B SponsorCompany SiteLinkedIn

Location

New York

Posted

37 days ago

Salary

Not specified

Bachelor Degree7 yrs expEnglishAnsibleAWSAzureCloudDNSGoogle Cloud PlatformKubernetesLinuxPythonTcp/ipTerraformGo

Job Description

• Establish and evolve SRE best practices across the organization. • Define and drive observability strategy for system health, performance, and reliability. • Design and implement software-driven solutions within the infrastructure domain. • Act as a technical leader and force multiplier, helping set priorities and influencing decision-making. • Take ownership of large, ambiguous initiatives, driving them from concept to delivery. • Combine deep knowledge of software development, infrastructure, and security to improve platform resilience. • Proactively identify systemic risks and reliability gaps. • Partner with engineering teams to improve developer workflows, tooling, and operational maturity. • Provide technical mentorship, architecture guidance, and high-quality design and code reviews.

Job Requirements

  • Bachelor’s or Master’s degree in Computer Science or equivalent practical experience.
  • 7+ years of experience in site reliability engineering, infrastructure engineering, or platform engineering roles, with demonstrated impact at scale.
  • Expert-level, methodical troubleshooting across the entire stack.
  • Strong command-line proficiency and deep expertise in Linux systems and operating system fundamentals.
  • Advanced understanding of networking concepts including load balancing, proxies, DNS, TCP/IP, NAT, and service-to-service communication.
  • Experience working across multiple languages (e.g., Python, Go, Bash).
  • Strong track record of automating repetitive and complex operational work to reduce toil and increase reliability.
  • Ability to design and build internal tools (Python or Go) that standardize and scale engineering practices.
  • Deep experience with cloud platforms (AWS preferred, GCP/Azure acceptable).
  • Strong expertise in Kubernetes and container orchestration (EKS, Helm).
  • Experience designing and maintaining company-wide IaC codebases using tools such as Terraform, Pulumi, CloudFormation, or Ansible.

Benefits

  • Health insurance
  • Professional development
  • Remote work options

Related Categories

Related Job Pages

More DevOps Engineer Jobs

DevOps Engineer

Arine

Arine optimizes medication to ensure each patient is on the safest, most effective therapy for their unique health needs

DevOps Engineer37 days ago
Full TimeRemoteTeam 11-50H1B No Sponsor

DevOps Engineer automating cloud infrastructure and delivery pipelines for a healthcare tech company

AWSCloudDockerEC2JenkinsKubernetesLinuxPythonSDLCShell ScriptingTerraform
United States
$120K - $150K / year
DevOps Engineer37 days ago
Full TimeRemoteTeam 501-1,000H1B No Sponsor

Senior DevOps Engineer managing AWS CI/CD pipelines for federal consulting

AWSCloudLinuxTerraform
Virginia
Full TimeRemoteTeam 501-1,000H1B No Sponsor

Public Trust Eligibility RequiredThis is a contingent position, meaning employment is dependent upon the successful award of the associated contract to Aretum and completion of any required background investigation or security clearance verification.&a...

Virginia
DevOps Engineer37 days ago
Full TimeRemoteTeam 10,001+Since 1979H1B Sponsor

Senior Software Engineer improving cloud-based systems reliability and performance

AWSCloudGrafanaPrometheusPythonTerraform
United States