Transcend
Together, we can build the future.
Senior Site Reliability Engineer
Location
United States
Posted
8 days ago
Salary
$150K - $167K / year
Bachelor Degree5 yrs expEnglishAWSCloudJava ScriptPythonTerraformType Script
Job Description
• Lead reliability-focused design and readiness reviews for new and existing services
• Build, operate, and continuously improve our observability stack
• Own and evolve incident management practices
• Plan and execute disaster recovery exercises and game days
• Perform capacity planning and cost optimization for our cloud infrastructure
• Identify and drive down systemic reliability risks across application, infrastructure, and process layers
• Collaborate closely with Developer Experience, Security, and product engineering to embed reliability best practices
• Participate in and help continuously improve the on-call rotation
Job Requirements
- 5+ years of experience in Site Reliability Engineering, Production Engineering, Infrastructure Engineering, or a closely related role
- Strong experience operating modern cloud infrastructure, ideally on AWS
- Proficiency with at least one programming language used at Transcend (e.g., JavaScript, Typescript, or Python)
- Hands-on experience with infrastructure-as-code and CI/CD tooling (e.g., Terraform, CloudFormation)
- Deep familiarity with observability and monitoring systems (e.g., Datadog or equivalent)
- Proven track record running incident response and post-incident analysis
- Excellent communication and collaboration skills
- Comfort participating in an on-call rotation
- Minimum level of education: Bachelor’s degree in Computer Science, Engineering, Information Systems, or a related technical field
Benefits
- Flexible PTO
- Parental leave
- 401(k) match
- Competitive compensation package including employee equity