Transcend

Together, we can build the future.

Senior Site Reliability Engineer

Full TimeRemoteTeam 51-200Since 2015Company SiteLinkedIn

Location

United States

Posted

8 days ago

Salary

$150K - $167K / year

Bachelor Degree5 yrs expEnglishAWSCloudJava ScriptPythonTerraformType Script

Job Description

• Lead reliability-focused design and readiness reviews for new and existing services • Build, operate, and continuously improve our observability stack • Own and evolve incident management practices • Plan and execute disaster recovery exercises and game days • Perform capacity planning and cost optimization for our cloud infrastructure • Identify and drive down systemic reliability risks across application, infrastructure, and process layers • Collaborate closely with Developer Experience, Security, and product engineering to embed reliability best practices • Participate in and help continuously improve the on-call rotation

Job Requirements

  • 5+ years of experience in Site Reliability Engineering, Production Engineering, Infrastructure Engineering, or a closely related role
  • Strong experience operating modern cloud infrastructure, ideally on AWS
  • Proficiency with at least one programming language used at Transcend (e.g., JavaScript, Typescript, or Python)
  • Hands-on experience with infrastructure-as-code and CI/CD tooling (e.g., Terraform, CloudFormation)
  • Deep familiarity with observability and monitoring systems (e.g., Datadog or equivalent)
  • Proven track record running incident response and post-incident analysis
  • Excellent communication and collaboration skills
  • Comfort participating in an on-call rotation
  • Minimum level of education: Bachelor’s degree in Computer Science, Engineering, Information Systems, or a related technical field

Benefits

  • Flexible PTO
  • Parental leave
  • 401(k) match
  • Competitive compensation package including employee equity

Related Categories

Related Job Pages