Evidation
Health, powered by you.
Senior DevOps Engineer
Location
California
Posted
82 days ago
Salary
$160K - $200K / year
8 yrs expEnglishAWSCloudDistributed SystemsDockerKubernetesPythonRubyTerraformGo
Job Description
• Design, build, and maintain highly available, scalable infrastructure on AWS using Infrastructure as code.
• Design and operate multi-tenant Kubernetes environments running on EKS, including cluster operations, workload management, autoscaling, and cost-optimized configurations.
• Drive Infrastructure-as-Code (IaC) best practices using Terraform and Pulumi, including modularization, testing, versioning, and safe deployment patterns.
• Contribute to CI/CD ecosystem using GitHub Actions, reusable workflows, and secure secrets management; ensure fast, resilient, and traceable deployment pipelines.
• Build and maintain containerization based software delivery pipeline leveraging Docker, Helm charts, and Github workflows.
• Define and continuously improve monitoring, alerting, dashboards, and logging using Datadog.
• Evaluate operational data to identify performance, stability, and cost-efficiency opportunities.
• Provide advanced support for major incidents, performing root cause analysis, writing clear postmortems, and ensuring long-term corrective actions.
• Apply a security-first mindset to infrastructure architecture, IAM, network boundaries, and workload configurations.
• Implement work in alignment to controls in support of ISO 27001, SOC 2, HIPAA, and other regulated requirements.
• Collaborate with Security to operationalize secure-by-default infrastructure patterns.
• Collaborate with Engineering, Data, and Delivery teams to define requirements, translate technical needs, and deliver scalable solutions.
• Facilitate knowledge sharing through documentation, playbooks, incident reviews, and architectural discussions.
• Identify opportunities to add value beyond immediate requests—improving reliability, simplifying processes, and reducing operational load.
Job Requirements
- 8+ years of DevOps, SRE, Platform Engineering, or relevant experience supporting production cloud systems.
- Expert-level experience with AWS services.
- Expert-level experience managing Kubernetes environments, including Helm, KEDA, cluster lifecycle, and multi-environment deployments.
- Advanced CI/CD experience using GitHub Actions (workflows, reusable workflows, OIDC auth, environments) or similar technology.
- Expert-level containerization skills (Docker, image optimization, registry management).
- Strong proficiency with Terraform and Pulumi for Infrastructure as Code.
- Hands-on experience with AI-assisted development tools (VSCode, GitHub Copilot, code generation workflows).
- Strong proficiency with scripting and coding automation tools.
- Experience in more than one of: Bash, Python, Ruby, or Go.
- Experience building reliable, observable systems using Datadog (metrics, logs, traces, monitors) or similar solution.
- Strong understanding of distributed systems, networking, autoscaling, and operational patterns in cloud-native architectures.
- Strong debugging, problem-solving, and incident response skills across complex, multi-service systems.
Benefits
- salary + bonus + equity + benefits
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
DevSecOps Engineer
MeridianLinkConnecting You to Better: MeridianLink is the developer of the industry's first multi-channel loan origination system.
DevOps Engineer82 days ago
Full TimeRemoteTeam 501-1,000Since 1998H1B Sponsor
DevSecOps Engineer responsible for managing security in software development
DNSDockerJavaLinuxNGINXPython
DevOps Engineer85 days ago
Full TimeRemoteTeam 201-500H1B No Sponsor
Open this job to view full details and requirements.
AWSAzureCloudDockerGoogle Cloud PlatformJenkinsKubernetesPythonRubyTerraformGo
United States
DevOps Engineer85 days ago
Full TimeRemoteTeam 201-500Since 2012H1B No Sponsor
Staff Site Reliability Engineer leading infrastructure reliability strategy
AWSDistributed SystemsJavaScriptPythonRubyTerraform
DevOps Engineer85 days ago
Full TimeRemoteTeam 201-500Since 2012H1B No Sponsor
Senior Site Reliability Engineer for building and maintaining secure infrastructure at Bugcrowd
AWSDockerGrafanaJavaScriptKotlinPrometheusPythonRubyTerraformGo