Zscaler

We make it easy to secure your cloud transformation. Get fast, secure, and direct access to apps without appliances.

Staff Site Reliability Engineer

DevOps EngineerDevOps EngineerFull TimeRemoteTeam 5,001-10,000Since 2008H1B SponsorCompany SiteLinkedIn

Location

California

Posted

4 days ago

Salary

$119K - $170K / year

Bachelor Degree9 yrs expEnglishBashDnsFirewallsGrafanaHTTPIcmpLoad BalancingNagiosOsi ModelPrometheusPythonTcp/ip

Job Description

About Zscaler

Zscaler is a pioneer and global leader in zero trust security. The world’s largest businesses, critical infrastructure organizations, and government agencies rely on Zscaler to secure users, branches, applications, data & devices, and to accelerate digital transformation initiatives. Distributed across more than 160 data centers globally, the Zscaler Zero Trust Exchange platform combined with advanced AI combats billions of cyber threats and policy violations every day and unlocks productivity gains for modern enterprises by reducing costs and complexity.

Here, impact in your role matters more than title and trust is built on results. We believe in transparency and value constructive, honest debate—we’re focused on getting to the best ideas, faster. We build high-performing teams that can make an impact quickly and with high quality. To do this, we are building a culture of execution centered on customer obsession, collaboration, ownership and accountability. 

We champion an “AI Forward, People First” philosophy to help us accelerate and innovate, empowering our people to embrace their potential. If you’re driven by purpose, thrive on solving complex challenges and want to make a positive difference on a global scale, we invite you to bring your talents to Zscaler to help shape the future of cybersecurity.

Role

We are looking for a Staff Site Reliability Engineer to join our team. This role will report to the Senior Manager, Site Reliability Engineering and offers the flexibility of hybrid (3 days a week) out of San Jose, CA, or can be performed fully remote. 

As a key member of the Zero Trust Exchange team, you will be responsible for all aspects of the Zscaler production data center services, including servers, operating systems, storage, and supporting systems. You will be an instrumental part of the Site Reliability Engineering team, ensuring the availability, latency, performance, efficiency, and scalability of a cloud that processes tens of billions of transactions daily.

What you’ll do (Role Expectations)

  • Own the reliability of a large-scale cloud service (Linux/BSD, bare metal, Kubernetes, custom load balancing, SD-WAN) by partnering with Engineering and Network teams to define requirements early, conduct operability reviews, and contribute code/design docs for platform resilience
  • Develop and operate end-to-end observability (metrics/logs/traces, dashboards, alerting) and incident tooling to manage SLOs/error budgets, reduce noise, and improve system detection and diagnosis
  • Participate in an on-call rotation to lead full-cycle incident response; perform deep cross-stack troubleshooting (OS, networking, distributed systems, packet captures, core dumps) to drive permanent software fixes and codify learnings into runbooks and tests
  • Build and maintain everything-as-code for fleet and service lifecycle, driving provisioning, configuration, release automation, canary deployments, and complex rollout/rollback workflows
  • Continuously improve platform hygiene through consistent OS/app upgrades, dependency/vulnerability patching, capacity and performance tuning, and strict CI/CD validation prior to production rollouts

Who You Are (Success Profile)

  • You act like an owner. Your passion for the mission fuels your bias for action. You operate with integrity because you genuinely care about the outcome. You adapt to what’s needed, navigating seamlessly between high-level strategy and hands-on execution.
  • You are a problem-solver. You seek out challenges because you are energized by finding solutions, knowing that solving the hard problems delivers the biggest impact.
  • You are a high-trust collaborator. You are ambitious for the team, not just yourself. You embrace our challenge culture by giving and receiving ongoing feedback—knowing that candor delivered with clarity and respect is the truest form of teamwork and the fastest way to earn trust.
  • You operate with urgency. You understand that in a high-growth environment, speed and quality are not mutually exclusive. You have a relentless focus on execution and a bias for action, delivering high-impact results quickly to win for the customer and the team.
  • You think at scale. You connect your day-to-day work to the larger company mission and think globally. You build solutions, processes, and teams that are not just effective today but are built to last and support a high-growth, global organization.

What We’re Looking for (Minimum Qualifications)

  • US Citizenship is required (due to the nature of assigned customers) and 5+ years industry experience in software engineering, infrastructure software, and/or platform engineering
  • Proficiency in at least one programming language (such as Python, Bash, or Go) with demonstrated ability to write production-quality code (testing, code reviews, CI, maintainable design,scripting for diagnostics
  • Strong Linux/Unix systems fundamentals (process/memory, filesystems, networking stack basics, debugging/perf troubleshooting) and solid understanding of networking protocols and components (e.g., HTTP, DNS, TCP/IP, ICMP, OSI model, subnetting, and load balancing/traffic concepts)
  • Proven experience operating production services (including incident response, troubleshooting, reducing toil) and ability to participate in on-call rotations and support occasional after-hours or weekend deployments
  • Managing BSD in production, with a focus on driving systemic fixes through platform engineering

What Will Make You Stand Out (Preferred Qualifications)

  • Proven expertise in operating Kubernetes at scale
  • Deep experience with the Prometheus/OpenTelemetry ecosystems, including instrumenting golden signals, defining SLOs, and performing alert tuning to ensure high-availability environments

#LI-KM9 #LI-Remote

Zscaler’s salary ranges are benchmarked and are determined by role and level. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position across all US locations and could be higher or lower based on a multitude of factors, including job-related skills, experience, and relevant education or training.

The base salary range listed for this full-time position excludes commission/ bonus/ equity (if applicable) + benefits.

Base Pay Range

$119,000$170,000 USD

At Zscaler, we are committed to building a team that reflects the communities we serve and the customers we work with. We foster an inclusive environment that values all backgrounds and perspectives, emphasizing collaboration and belonging. Join us in our mission to make doing business seamless and secure.

Our Benefits program is one of the most important ways we support our employees. Zscaler proudly offers comprehensive and inclusive benefits to meet the diverse needs of our employees and their families throughout their life stages, including:

  • Various health plans
  • Time off plans for vacation and sick time
  • Parental leave options
  • Retirement options
  • Education reimbursement
  • In-office perks, and more!

Learn more about Zscaler’s Future of Work strategy, hybrid working model, and benefits here.

By applying for this role, you adhere to applicable laws, regulations, and Zscaler policies, including those related to security and privacy standards and guidelines.

Zscaler is committed to providing equal employment opportunities to all individuals. We strive to create a workplace where employees are treated with respect and have the chance to succeed. All qualified applicants will be considered for employment without regard to race, color, religion, sex (including pregnancy or related medical conditions), age, national origin, sexual orientation, gender identity or expression, genetic information, disability status, protected veteran status, or any other characteristic protected by federal, state, or local laws. See more information by clicking on the Know Your Rights: Workplace Discrimination is Illegal link.

Pay Transparency

Zscaler complies with all applicable federal, state, and local pay transparency rules.

Zscaler is committed to providing reasonable support (called accommodations or adjustments) in our recruiting processes for candidates who are differently abled, have long term conditions, mental health conditions or sincerely held religious beliefs, or who are neurodivergent or require pregnancy-related support.

Benefits

  • 401(K), 401(K) matching, Adoption Assistance, Company equity, Company sponsored family events, Dedicated diversity and inclusion staff, Dental insurance, Disability insurance, Volunteer in local community, Employee stock purchase plan, Family medical leave, Generous parental leave, Health insurance, Life insurance, Charitable contribution matching, Mentorship program, Open office floor plan, Paid sick days, Onsite office parking, Partners with nonprofits, Performance bonus, Pet insurance, Lunch and learns, Free snacks and drinks, Team based strategic planning, OKR operational model, Tuition reimbursement, Mandated unconscious bias training, Vision insurance, Wellness programs, Some meals provided, Mental health benefits, Diversity employee resource groups, Hiring practices that promote diversity, Fertility benefits, Employee resource groups, Hybrid work model, President's club, Employee awards, Diversity recruitment program, Pension, Transgender health care benefits, Mother's room, Personal development training, Flexible time off, Bereavement leave benefits

Related Categories

Related Job Pages

More DevOps Engineer Jobs

Lead DevOps Engineer

Xpansiv

Infrastructure for an Evolving World

DevOps Engineer4 days ago
Full TimeRemoteTeam 201-500H1B Sponsor

Lead DevOps Engineer maintaining and improving software infrastructure at Xpansiv

AnsibleAWSAzureChefCloudDockerGoogle Cloud PlatformJenkinsKubernetesMicroservicesPythonRubySQLTerraform
New York
$160K - $180K / year
DevOps Engineer4 days ago
Full TimeRemote

We are looking for a skilled individual to join our rapidly growing team at Bluelight Consulting. This position is ideal for someone who thrives in a fast-paced, dynamic environment where everyone's opinions and efforts are valued and appreciated. You will have the opportunity to...

AWSGCPAzureTerraformPulumiCloudFormationCircleCIGitLabJenkinsKubernetesHelmAnsibleChefPuppet
United States

Senior Site Reliability Engineer

HavocAI

Autonomous Solutions for Maritime Operations

DevOps Engineer4 days ago
Full TimeRemoteTeam 11-50Since 2024H1B No Sponsor

Senior Site Reliability Engineer focused on autonomous surface vessels.

CloudDistributed SystemsKubernetesLinuxPythonGo
United States
$150K - $185K / year
DevOps Engineer4 days ago
Full TimeRemoteTeam 51-200

Collate is the creator of the fast-growing open-source OpenMetadata project, and we’re passionate about transforming the way data teams work together. Our mission is to help every company realize the fullest potential of data through AI Agents via open-source, and unified metadat...

KubernetesDockerCI/CDInfrastructure as CodeDevSecOpsAWSECSJavaPythonTypeScriptNode.jsLoad BalancersWeb ServersCachingQueuing Systems
United States