Sezzle
Financially empowering the next generation of consumers.
Principal Site Reliability Engineer
Location
United States
Posted
40 days ago
Salary
$200K - $260K / year
Bachelor Degree12 yrs expEnglishAWSDistributed SystemsGrafanaKubernetesMicroservicesMy SQLPostgresPrometheusRDBMSSQLGo
Job Description
• Architect, upgrade, design, and build scalable infrastructure solutions leveraging Kubernetes, AWS, RDS (MySQL/Postgres), and modern distributed patterns.
• Help drive the infrastructure team’s roadmap, leading us to higher levels of reliability, recoverability, and scalability.
• Drive capacity planning, benchmarking, and work with the team to stress test our systems, find bottlenecks, and prepare for further growth in the business.
• Define, maintain and enforce SLAs and alerts across our infrastructure.
• Lead the teams towards stronger signal anomaly detection, better, more flexible alerting.
• Help Lead Sezzle’s AI enablement efforts, identifying opportunities to apply AI and automation to enhance infrastructure reliability, developer productivity, and internal tooling.
• Build in consistency and scalability across a distributed microservices architecture while maintaining performance and reliability.
• Establish and evolve engineering best practices for observability, security, and CI/CD across teams.
• Mentor engineers and champion a culture of learning, innovation, and operational excellence.
• Collaborate cross-functionally to translate business goals into technical roadmaps and deliver results that matter.
Job Requirements
- 12+ years of professional software engineering or infrastructure engineering experience, including significant SRE and backend experience.
- Deployed significant changes to a production application or infrastructure configuration in the past 30 days.
- Strong proficiency in Golang, with experience building and maintaining RESTful APIs.
- Expertise with SQL-based RDBMS (MySQL, PostgreSQL) and experience optimizing schema and queries for performance at scale.
- Proficiency in observability tools (Prometheus, Grafana, Datadog, New Relic).
- Solid understanding of distributed systems design patterns (e.g., transactional outbox, event-driven architecture and stream processing, queues).
- Demonstrated ability to bring new ideas forward, influence decisions, and lead complex technical initiatives.
- Bachelor’s degree in Computer Science or equivalent practical experience.
Benefits
- Unlimited PTO, volunteer hours and sabbatical
- Life, STD/LTD, medical, dental and vision insurance
- Highly discounted LifeTime gym membership
- 401k with match
- Collaborative fun co-workers
- The opportunity to join the fastest growing FinTech alongside a team of motivated and driven individuals
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
DevOps Engineer
G2i Inc.G2i is a hiring platform run by engineers that match you with pre-vetted React and React Native engineers.
DevOps Engineer40 days ago
Full TimeRemoteTeam 11-50H1B No Sponsor
DevOps Engineer managing AWS infrastructure and CI/CD pipelines
AnsibleAWSKubernetesPostgresPythonTerraform
United States
DevOps Engineer40 days ago
Full TimeRemoteTeam 11-50H1B No Sponsor
Staff Cloud Ops Engineer ensuring reliability and performance of critical cloud services
AnsibleCloudTerraform
DevOps Engineer40 days ago
Full TimeRemoteTeam 51-200Since 2016H1B Sponsor
Director of Security & Reliability Engineering at an AI-powered penetration test tool company
AWSAzureCloudGoogle Cloud PlatformKubernetesTerraform
Senior Site Reliability Engineer
Wikimedia FoundationImagine a world in which every single human being can freely share in the sum of all knowledge.
DevOps Engineer40 days ago
Full TimeRemoteTeam 501-1,000Since 2003H1B Sponsor
Senior Site Reliability Engineer operating Wikimedia's data systems
AnsibleDistributed SystemsOpen SourcePuppetPythonRubyTerraformGo