Granicus

Empowering a Modern Digital Government.

Senior Site Reliability Engineer – AWS, AI/ML, APM

DevOps EngineerDevOps EngineerFull TimeRemoteTeam 501-1,000Since 1999H1B SponsorCompany SiteLinkedIn

Location

United States

Posted

140 days ago

Salary

$80K - $100K / year

Postgraduate Degree5 yrs expEnglishAnsibleAWSAzureChefCloudElastic SearchJavaLinuxLogstashPuppetPythonRubyUnixGo

Job Description

• Provide production support on a shift according to the team on-call roster. • Work on the customer and internal engineering/implementation team raised tickets while not on-call for production support. • Work on SREs backlog items. • Continuously monitor the health and performance of our services, systems, and infrastructure. • Respond to alerts and incidents promptly to ensure high availability. • Develop and maintain automation scripts and tools to streamline operations and reduce manual intervention. • Assist in troubleshooting and resolving incidents, performing root cause analysis, and implementing long-term fixes to prevent recurrence. • Participate in designing and implementing system improvements to enhance reliability, scalability, and performance. • Work closely with software engineers to understand application requirements, provide feedback on design and architecture, and support deployment and release processes. • Create and maintain documentation for processes, procedures, and troubleshooting guides to ensure knowledge sharing within the team. • Assist in capacity planning activities to anticipate future needs and ensure that our infrastructure can handle growth. • Implement and adhere to security best practices to protect our systems and data.

Job Requirements

  • 5+ years in site reliability engineering, system administration, or a similar role, with a proven track record of managing large-scale, high-availability systems.
  • Experience supporting AI/ML infrastructure, including model deployment, inference optimization, and integration with services like AWS Bedrock is highly desirable.
  • Expertise in Linux/Unix systems, and cloud platforms (AWS, Azure, or Google Cloud).
  • Strong proficiency in scripting languages (Python, Bash, Ruby) and programming languages (Go, Java, C++).
  • Familiarity with AI/ML operations, including model lifecycle management, vector databases, and inference performance tuning.
  • Experience with the ELK Stack (Elasticsearch, Logstash, Kibana) for centralized logging, monitoring, and observability.
  • Experience with configuration management tools (Ansible, Chef, Puppet).
  • Exposure to AI/ML toolchains, including AWS Bedrock, SageMaker, and LLMOps frameworks.
  • Relevant certifications such as AWS Certified DevOps Engineer, AWS Certified Machine Learning – Specialty, Google Cloud Professional DevOps Engineer, or similar are a plus.

Benefits

  • Flexible Time Off – Take the time you need to rest, recharge, and live your life.
  • Company-Wide Wellbeing Days – Paid days off to unplug and focus on your mental health.
  • Work From Home Reimbursement – Support a productive home office environment.
  • Multiple Health Plan Options – Including a 100% employer-paid plan.
  • Employer HSA Contributions – When enrolled in a High-Deductible Health Plan.
  • Fitness Reimbursement Program – Stay active, your way.
  • On-Demand Mental Health Support – Access to Headspace and other wellness tools.
  • Paid Parental Leave – For both birthing and non-birthing parents.
  • Traditional & Roth 401(k) – With a generous company match.
  • Life & AD&D Insurance – 100% employer-paid coverage for peace of mind.
  • Online Learning Platforms – Fuel your professional development.
  • Competitive Salary & Bonuses – Your contributions are valued and rewarded.

Related Categories

Related Job Pages

More DevOps Engineer Jobs

Senior Site Reliability Engineer

Circle

The all-in-one community platform for creators and brands. https://circle.so/

DevOps Engineer141 days ago
Full TimeRemoteTeam 51-200Since 2019H1B Sponsor

Senior Site Reliability Engineer ensuring fast, reliable, and secure systems for Circle’s platform

AWSKubernetesMySQLPostgresRedis
California
$130K - $140K / year

Intermediate DevOps Engineer

AbacusNext

Cloud-based tech provider for legal and accounting firms. AbacusLaw, Amicus Attorney, Amicus Cloud, OfficeTools, HotDocs

DevOps Engineer141 days ago
Full TimeRemoteTeam 201-500Since 1983H1B No Sponsor

DevOps Engineer designing and implementing processes at CARET

AnsibleAWSAzureCloudDNSDockerKubernetesMongoDBPythonRedisSQLTerraform
California
$90K - $110K / year
DevOps Engineer143 days ago
Full TimeRemoteTeam 1-10Since 2017H1B No Sponsor

Senior DevOps Engineer managing application deployments in cloud environments

AWSAzureCloudDockerGrafanaKubernetesPrometheusPythonTerraformVault
United States
$10K / month

Web3 Infrastructure DevOps Engineer

Generative AI

Your global hub to Discover, Learn, and Grow with AI

DevOps Engineer143 days ago
Full TimeRemoteTeam 51-200H1B No Sponsor

DevOps Engineer managing decentralized infrastructure for Loti AI, Inc.

DockerIPFSKubernetes
United States