Replicated
We help software vendors ship their apps to complex customer environments using Kubernetes and Helm.
Senior Customer Reliability Engineer
DevOps EngineerDevOps EngineerFull TimeRemoteTeam 51-200Since 2017H1B No SponsorCompany SiteLinkedIn
Location
United States
Posted
172 days ago
Salary
$149.5K - $192.5K / year
3 yrs expEnglishKubernetesLinuxGo
Job Description
• Provide expert support to customers, resolving issues related to Kubernetes, Linux, and Replicated products, including troubleshooting failures and identifying root causes
• Work proactively with customers to ensure successful deployment, management, and scaling of applications using Replicated, providing guidance, best practices, training, and onboarding assistance
• Collaborate closely with CREs and product engineers to share customer feedback, identify product improvements, and contribute to the product roadmap
• Contribute to tooling and best practices that empower internal teams and vendors; opportunities to develop coding skills and make code contributions over time
• Participate in on-call rotation to provide support coverage for Replicated products
• Build deep expertise in customer-managed deployments, including cluster installation scenarios, and help vendors operationalize Kubernetes applications
• Drive continuous learning and professional growth, leveraging company-provided training, certifications, and curiosity/professional development budgets
• Participate in documentation review, process improvement, and vendor interaction to improve support workflows and product usability
Job Requirements
- Preferably 3 or more years of professional experience
- Experience with Linux system administration and ability to troubleshoot complex system and network issues at an advanced level
- Experience with Kubernetes and Helm, including diagnosing complex issues on bare metal and developing/troubleshooting advanced Helm charts
- Exceptional technical and non-technical communication and interpersonal skills in English
- Strong problem-solving skills and ability to think critically and act quickly under pressure
- Customer-centric mindset and a genuine desire to help others succeed
- Experience working remotely with teams across various time zones
- Willingness to participate in on-call support coverage
- Nice to haves: Experience with CNCF tools
- Nice to haves: Familiarity with Go and ability to debug Go programs
- Nice to haves: Customer-facing experience
- Preferred remote location: Australia or New Zealand (applicants must have legal right to work there)
- Note: Replicated cannot provide US sponsorship at this time (applicants must be legally authorized to work in the United States)
Benefits
- Health/Dental/Vision
- Life/AD&D
- LTD/STD
- FSA
- 401K
- Stock options
- Partner perk programs
- Generous time off, we expect you to take a minimum of 3 weeks of per year
- Laptop+accessories you need to get set up
- Generous home office set up allowance or co-working space allowance - up to $10,000 per year!
- Curiosity Budget to help you keep learning and growing!
- Professional development budget
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Senior DevOps Engineer
DistantJobRemote Recruitment Agency®. Find your next superstar remote developer in under 3 weeks.
DevOps Engineer172 days ago
Full TimeRemoteTeam 51-200H1B No Sponsor
Senior DevOps Engineer building AWS infrastructure for AI brand-safety contextual intelligence.
AWSCloudDockerGrafanaJenkinsKubernetesPrometheusPythonTerraformGo
United States
Full Stack Developer – DevOps Engineer
NextLink GroupIT services specialists since 1996. We enable success through simplicity, flexibility, and innovation.
DevOps Engineer173 days ago
ContractRemoteTeam 201-500Since 1996H1B No Sponsor
Full Stack Engineer developing and maintaining Azure applications
AnsibleAzureCloud
United States
Site Reliability Engineer
TC IoT SolutionsIoT Solutions is a Telit Cinterion business unit. Mobilogix is a retired brand.
DevOps Engineer175 days ago
Full TimeRemoteTeam 501-1,000Since 1986H1B No Sponsor
SRE ensuring reliability of Telit Cinterion's IoT platforms and critical applications.
AnsibleAWSCloudElasticSearchIoTJavaScriptJenkinsKubernetesLinuxPuppet
Florida