BrightHire

Elevating the human side of hiring. We're hiring!

Senior Site Reliability Engineer

DevOps EngineerDevOps EngineerFull TimeRemoteTeam 11-50Since 2019H1B SponsorCompany SiteLinkedIn

Location

United States

Posted

94 days ago

Salary

Not specified

EnglishElastic SearchGrafanaKubernetesPrometheusPythonSQL

Job Description

• You will own the end-to-end reliability and performance of many of our most critical systems. • Working in lockstep with Product and Engineering, you will design, build, and refine the platform that our application and AI features run on, from Kubernetes and databases through CI/CD and observability. • You will focus on keeping our systems fast, reliable, and easy for developers to work with. • You will work on real infrastructure that supports features people use every day—things like: • Continuing to improve and iterate on our observability stack that includes Kibana, Grafana, OTel, and Elastic. • Database performance improvements by analyzing slow and high-volume queries, tuning indexes, optimizing query patterns and timing, and recommending schema and code changes to keep QPS and latency low. • Kubernetes improvements and upgrades, including deploying new services, improving resource utilization, tightening security, and standardizing deployment patterns across teams. • Improving CI/CD pipelines for both backend and frontend services so engineers can ship quickly and safely, with clear feedback loops, fast build times, and reliable rollbacks. • Enhancing the local developer experience so that running and debugging the app locally feels fast, consistent, and representative of production. • Helping improve our CI/CD and observability for our ML pipeline and models, bringing MLOps best practices into our existing infrastructure.

Job Requirements

  • You have real-world experience running production systems and doing SRE, Platform, or DevOps work for web applications or APIs.
  • You are comfortable working across Kubernetes, CI/CD, databases, and backend services, and you enjoy owning problems end to end.
  • You have strong experience with Kubernetes in production environments, including cluster upgrades, workload deployments, scaling, and debugging.
  • You have experience with observability stacks (such as Elasticsearch and Kibana, Prometheus, Grafana, or similar) and can lead efforts like upgrading Kibana to new major versions and improving logs, metrics, and dashboards.
  • You have worked deeply with relational databases and SQL, know how to profile slow queries, design and tune indexes, and work with engineers to adjust query patterns, timing, and frequency to improve performance.
  • You are comfortable in at least one backend language (i.e. Python) and can read and modify application code to support infra and performance improvements.
  • You have experience improving CI/CD pipelines, including build and test speed, deployment workflows, and release strategies (such as blue/green or canary).
  • You have worked with infrastructure-as-code tools or similar patterns to manage environments in a repeatable way.
  • You think deeply about developer experience and reliability and use both metrics and empathy to guide your decisions.
  • You care about security, resiliency, and cost as integral aspects of the systems you build and manage.
  • You move fast and independently, but you know when to pull in teammates for pairing, reviews, or cross-team alignment.

Benefits

  • Flexible working hours
  • Professional development opportunities
  • Remote work options
  • Strong observability

Related Categories

Related Job Pages

More DevOps Engineer Jobs

DevOps Engineer – Azure

VELAIO

Transforming The Way Organizations Grow Their Businesses Through Technology, Process, and People

DevOps Engineer99 days ago
Full TimeRemoteTeam 51-200Since 2010H1B No Sponsor

DevOps Engineer designing Azure cloud solutions for dynamic team

AnsibleAzureChefPuppetSQLTerraform
United States

DevOps Specialist

Correlated Solutions, Inc.

Correlated Solutions offers non-contact measurement solutions for materials and testing using digital image correlation.

DevOps Engineer99 days ago
Full TimeRemoteTeam 11-50Since 1998H1B No Sponsor

DevOps Specialist utilizing programming to enhance product quality and team productivity

PythonSQL
California
$68K - $80K / year

DevOps Engineer, Data Platform

National University

National University is committed to maintaining a high-quality workforce representative of the populations we serve. National University employs more than 4,500 faculty and staff and serves over 45,000 students. We are united in our mission to meet the global education demands of the 21st Century and are dedicated to creating a supportive academic and work environment. National University (NU) is proud to be an equal opportunity employer and does not discriminate against any employee or applicant per applicable federal, state and local laws.

DevOps Engineer99 days ago
Full TimeRemoteTeam 1,001-5,000H1B Sponsor

DevOps Engineer managing university's data platform and infrastructure automation

AzureCloudDockerGoogle Cloud PlatformKubernetesPythonSQLTerraform
United States
$78.5K - $106.0K / year
DevOps Engineer100 days ago
Full TimeRemoteTeam 5,001-10,000H1B Sponsor

Technical Leader I overseeing DevOps solutions and infrastructure management

AzureCloudDNSDockerGrafanaKubernetesPrometheusPythonSMTPTCP/IPTerraformVMware
Michigan
$90.7K - $129.3K / year