Omilia - Conversational Intelligence

Omilia is the leading provider of Natural Language Understanding enabled IVR & natural dialogue interaction solutions.

Senior Site Reliability Engineer

DevOps EngineerDevOps EngineerFull TimeRemoteTeam 201-500Since 2002H1B No SponsorCompany SiteLinkedIn

Location

United States

Posted

16 days ago

Salary

Not specified

English

Job Description

About the Role

We're looking for a Senior Site Reliability Engineer who approaches operational problems as engineering challenges. You won't just monitor dashboards and respond to pages — you'll help define and drive service level objectives, identify reliability risks, and work alongside engineering teams to ensure reliability and performance are first-class concerns from design through to production.

Your mission is not only to keep the platform running but also to make the platform more reliable by default — through better practices, smarter automation, and a culture where every engineer thinks about failure modes.

What You'll Do

Drive Incident Excellence

  • Act as a first responder during incidents; lead root cause analysis and blameless post-mortems.
  • Turn incident learnings into systemic improvements — better tooling, better runbooks, better architecture.
  • Provide input and guidance to squads on troubleshooting documentation and operational runbooks, ensuring they are practical and effective for production support.

Engineer Reliability

  • Define, implement, and iterate on SLIs, SLOs, and error budgets to drive data-informed reliability decisions.
  • Identify and measure operational toil; build software and automation to systematically reduce it.
  • Conduct capacity planning and performance analysis to stay ahead of scaling challenges.

Build Observability

  • Design and evolve observability platforms (metrics, logs, traces, dashboards) that give engineering teams genuine insight into system behaviour — not just noise.
  • Continuously improve alert quality: reduce false positives, increase signal, and ensure every alert is actionable.

Shape Reliability Culture

  • Partner with development teams to embed reliability thinking into the software delivery lifecycle — from design reviews to deployment strategies.
  • Champion practices like chaos engineering, progressive rollouts, and failure injection testing.
  • Mentor engineers across teams on reliability principles and operational best practices.

Participate in On-Call

  • Join on-call rotations and continuously improve the on-call experience for yourself and others.

Job Requirements

  • Must Have
  • Fluent English - ideallyon native level
  • Education: Bachelor's or Master's in Computer Science, Engineering, or equivalent practical experience.
  • Demonstrated experience applying SRE principles: SLOs/SLIs, error budgets, toil reduction, and capacity planning.
  • Experience building or significantly evolving observability and monitoring solutions (we use Prometheus, Grafana, and ELK, but we care more about your approach than your tool familiarity).
  • Experience with AWS.
  • Linux systems administration background (RHEL/CentOS).
  • Hands-on experience operating services on container orchestration platforms (Kubernetes preferred).
  • A track record of improving the reliability of production systems at scale — through better automation, observability, and process, not just firefighting.
  • Strong communication skills and the ability to influence engineering culture across teams.
  • An analytical, systems-thinking mindset — you instinctively ask "why did this fail?" and "how do we make sure it can't?"
  • Nice to Have
  • Infrastructure-as-code and configuration management experience (Terraform, Ansible).
  • Strong scripting and automation skills (Bash, Python, or Go) — you're comfortable writing the glue that keeps systems healthy and eliminates repetitive work.
  • Networking fundamentals (TCP/IP, DNS, load balancing).
  • Database experience — relational (PostgreSQL, MySQL) or NoSQL (Redis).
  • Telephony domain knowledge (SIP, VoIP).
  • Familiarity with chaos engineering tools and practices.

Benefits

  • Fixed compensation;
  • Long-term employment with the working days vacation;
  • Development in professional growth (courses, training, etc);
  • Being part of successful cutting-edge technology products that are making a global impact in the service industry;
  • Proficient and fun-to-work-with colleagues;
  • Apple gear
  • Omilia is proud to be an equal opportunity employer and is dedicated to fostering a diverse and inclusive workplace. We believe that embracing diversity in all its forms enriches our workplace and drives our collective success. We are committed to creating an environment where everyone feels welcomed, valued, and empowered to contribute their unique perspectives without regard to factors such as race, color, religion, gender, gender identity or expression, sexual orientation, national origin, heredity, disability, age, or veteran status, all eligible candidates will be given consideration for employment.

Related Categories

Related Job Pages

More DevOps Engineer Jobs

DevOps SRE

Southwest Power Pool

Southwest Power Pool (SPP) is about more than power. We’re about the power of relationships. Our employees have the opportunity to work together to ensure electricity is delivered reliably and affordably to the millions of people living in our service territory. We have been voted one of Arkansas’ Best Places to work by Arkansas Business and we are looking for a member of our team who is passionate about our mission to keep the lights on! We have a core ideology here at SPP that we stand by: Do the right thing, for the right reason, in the right way. PLEASE NOTE: SPP is not able to sponsor employment visas or student-work authorizations (STEM OPT) for this position. Please ensure you are eligible to work in the U.S. without sponsorship prior to applying. COMPENSATION INFORMATION: The salary range(s) represents our good faith estimate for the role at this time. While we strive to provide competitive and transparent compensation, there may be circumstances where an offer is above or outside of the listed range. We are open to discussing salary expectations with qualified candidates considering factors such as the candidate's qualifications, skills, competencies, experience and geographic location will all be considered during the hiring process. Lead DevOps SRE | Pay Range: $112,240.00 - $145,810.00 Senior DevOps SRE | Pay Range: $87,950.00 - $112,190.00

DevOps Engineer16 days ago
Full TimeRemote

Join a mission-driven technology team powering the reliability of the electric grid for millions across the central United States. As a DevOps SRE, you’ll play a pivotal role in ensuring the performance, resilience, and long-term scalability of SPP’s production systems and user-f...

United States

Senior DevOps Engineer

Miratech

Helping Visionaries Change the World

DevOps Engineer16 days ago
Full TimeRemoteTeam 501-1,000Since 1989H1B No Sponsor

Senior DevOps Engineer responsible for global IVR system operations and stability

ApacheAWSCloudEC2LinuxMySQLSplunkUnix
United States
DevOps Engineer16 days ago
Full TimeRemoteTeam 51-200

The A.C.Coy company has an immediate opening for an Infrastructure Engineer. Ideal candidates will work with teams to architect and deploy server, storage, and security solutions. Develop and maintain automation and configuration management processes for Windows and Linux servers...

United States

Infrastructure Engineer

Physicians Insurance A Mutual Company

Physicians Insurance A Mutual Company is dedicated to protecting, defending, and supporting our Members. As a national boutique mutual insurance company, we passionately serve all our Members and partners with our suite of medical professional liability offerings. We help them overcome obstacles with a team providing underwriting, risk-management, claims, and stop-loss expertise—backed by strong financials and all supported by exceptional, personalized service. With over 8,500 Members and growing, this experience is in evidence all over the nation.

DevOps Engineer16 days ago
Full TimeRemote

The IT Infrastructure Engineer position is responsible for designing, building, deploying, securing, and maintaining the organization's digital network, handling hardware (routers, switches, firewalls) and software to ensure reliable, efficient connectivity, performance, and secu...

United States