Chess.com

The world’s favorite place to play and learn Chess.

Engineering Lead, Systems Operations

Full-stack EngineerSoftware EngineerFull TimeRemoteTeam 501-1,000Since 2007H1B No SponsorCompany SiteLinkedIn

Location

United States

Posted

8 days ago

Salary

Not specified

5 yrs expEnglishAnsibleAWSAzureChefCloudDNSDockerGoogle Cloud PlatformGrafanaKubernetesLinuxNo SQLPrometheusPuppetTcp/ipTerraformUnix

Job Description

• Lead and mentor a team of 5-8 system operations engineers, providing technical guidance, career development, and performance management while demonstrating adaptive leadership styles and fostering a teachable culture of continuous learning • Define and execute the multi-year SysOps strategy with clear prioritization of critical initiatives, including multi-regional infrastructure architecture capable of handling millions of concurrent sessions across global data centers • Own the hybrid cloud migration roadmap, partnering with leadership to integrate bare-metal datacenter resources with cloud services for optimal performance and cost efficiency, delivering value through time-to-market optimization • Establish on-call rotation policies and incident response procedures with strong focus on work-life balance, ensuring rapid resolution of critical system issues while maintaining team health and high availability SLAs • Drive the implementation of monitoring, observability, and alerting systems that reach the right people at the right time, proactively identifying and resolving performance bottlenecks before they impact users and preventing organizational surprises • Partner with engineering leadership to implement infrastructure-as-code practices and establish deployment pipelines that support continuous integration and delivery, emphasizing quality with high first-time-right rates and low rework • Oversee capacity planning, load testing, and resource allocation strategies across distributed computing environments, demonstrating excellent time management and execution velocity while managing infrastructure budget and cost optimization • Champion security protocols and risk assessment procedures for infrastructure components and data protection with unwavering integrity, ensuring compliance with industry standards and earning trust across the organization • Collaborate with product and engineering leaders to design scalable solutions for high-traffic applications, valuing others' time by simplifying cross-team workflows and ensuring clear presentation of technical concepts to varied audiences • Lead automation initiatives that deliver measurable value to both internal and external customers, reducing manual operational overhead and improving system reliability through scripting and configuration management • Build authentic relationships with cross-functional teams and stakeholders, ensuring transparent communication of system health and aligning SysOps priorities with business objectives through excellent listening and presentation skills • Recruit, retain, and develop top engineering talent by understanding individual motivations and aligning team goals with personal drivers, fostering an inclusive culture where growth mindset principles guide decision-making and risk-taking • Demonstrate focus on commitments by managing distractions effectively, maintaining a strong track record of successful execution, and accumulating wins that build credibility and trust across the organization

Job Requirements

  • 5+ years of experience in system operations, DevOps, or infrastructure engineering roles with demonstrated excellence in execution and velocity
  • 2+ years of experience managing technical teams, including hiring, performance management, and career development with proven ability to identify and adapt leadership styles
  • Strong proficiency with UNIX/Linux operating systems and command-line administration
  • Deep experience with cloud platforms (GCP, AWS, or Azure) and infrastructure-as-code tools (Terraform, CloudFormation, or similar)
  • Hands-on experience with configuration management systems (Ansible, Chef, Puppet, or similar)
  • Solid understanding of networking fundamentals, protocols (TCP/IP, HTTP/HTTPS, DNS), and network troubleshooting
  • Experience with containerization and orchestration technologies (Docker, Kubernetes, or similar)
  • Proficiency with monitoring and observability tools (Datadog, Prometheus, Grafana, ELK stack, or similar)
  • Experience with relational and NoSQL databases, including performance optimization and scaling strategies
  • Excellent communication skills with proven ability to reach the right stakeholders, present complex technical concepts clearly, and listen effectively to understand diverse perspectives
  • Strong prioritization and time management skills, with ability to distinguish critical work from nice-to-have initiatives
  • Demonstrated integrity in decision-making, earning respect and trust from peers, direct reports, and senior leadership
  • Proven track record of building and scaling reliable systems and high-performing teams with high-quality outcomes and low maintenance costs
  • Growth mindset with ability to share ideals and risks positively, avoid fixed mindset behaviors, and remain teachable in all situations
  • Ability to understand what motivates individuals and teams, aligning work with intrinsic drivers to maximize engagement.

Benefits

  • 100% remote (always have been, always will be!)

Related Job Pages

More Full-stack Engineer Jobs

Senior Software Engineer

Station A

Station A is the world's first AI-powered clean energy marketplace.

Full-stack Engineer8 days ago
Full TimeRemoteTeam 11-50H1B No Sponsor

Senior Software Engineer owning product slices at Station A

PostgreSQLPythonReactSQLTypeScript
California + 5 moreAll locations: California, Nevada, New York, Oregon, Virginia, Washington
$138K - $155K / year

Software Engineer

Instructure, Inc.

At Instructure, we are dedicated to empowering EdTech providers and educational organizations to unlock their full potential through innovative technology solutions. Our mission is to provide intuitive products and services that simplify learning and personal development, foster meaningful relationships, and inspire progress in education and careers.

Full-stack Engineer8 days ago
Full TimeRemoteTeam 1,001-5,000

The engineer will contribute to designing and building production features across the stack using Rails, TypeScript, and React, while also implementing serverless/edge APIs and jobs on AWS. Responsibilities include collaborating on data modeling, ensuring internationalization support, and instrumenting services with observability tools.

United States
$75K - $109K / year

Software Engineer

Caris Life Sciences

Fulfilling the promise of precision medicine through quality and innovation.

Full-stack Engineer8 days ago
Full TimeRemoteTeam 1,001-5,000Since 2008H1B No Sponsor

The role involves executing the full software development life cycle, monitoring data pipelines and application infrastructures in AWS, and developing/integrating Gen AI solutions to enhance workflows. Responsibilities also include developing production-grade software systems leveraging AI/ML and building/enhancing full-stack applications using Python frameworks and React.

United States
$100K - $120K / year

Staff Software Development Engineer

CVS Health

Bringing our heart to every moment of your health.

Full-stack Engineer8 days ago
Full TimeRemoteTeam 10,001+Since 1963H1B No Sponsor

The Staff Software Development Engineer will lead efforts in developing end-to-end CI/CD pipelines, establishing clean security processes, building applications and CLI tools, and creating an out-of-the-box observability and deployment platform for enterprise teams. A specific focus will be on building a seamless CI/CD experience with a strong security posture and understanding of efficient cloud infrastructure.

United States
$106K - $260K / year