Tealium
We connect data so you can connect with your customers.
Senior Observability Engineer
Location
United States
Posted
38 days ago
Salary
$140K - $160K / year
Bachelor Degree4 yrs expEnglishAWSJavaJenkinsKubernetesPrometheusPythonTerraformGo
Job Description
• Participate in rotating on-call approximately 20% of working time.
• Lead end-to-end observability design for all features in production and internal usage.
• Instrument features in Tealium products.
• Implement monitoring and cost tracking.
• Build open telemetry pipelines to track LLM request/response metrics, prompt engineering observability, token usage, hallucination detection, and failover.
Job Requirements
- 4+ years in Site Reliability Engineering and Observability Engineering with focus on production-grade 24X7X365 systems.
- Deep experience instrumenting services and applications for observability.
- Familiarity with prompt engineering, embeddings, vector DBs (Neptune), and RAG-style architectures.
- Hands-on experience with OpenTelemetry, Datadog, Sumologic, Prometheus, or similar.
- Experience integrating observability into AI platforms: e.g., Bedrock, Neptune, LangChain, LlamaIndex, HuggingFace, SageMaker, etc.
- Proficiency with Java, Python, Go, or similar languages.
- Experience with multiple AWS services.
- Strong background in Infrastructure-as-Code (Terraform, ArgoCD) and CI/CD tooling (Jenkins, GitHub Actions).
- Understanding of Kubernetes and container orchestration.
- Excellent collaboration skills and comfort leading across SRE, Data Engineering, and Product/ML teams.
- Experience mentoring or leading technical initiatives.
- Communication skills for explaining complex concepts to non-technical stakeholders.
Benefits
- Employees are eligible to receive an annual bonus and stock options.
- Employees and their families are eligible for medical, dental, vision, life, and disability insurance.
- Employees have the option to enroll in our 401k plan and are eligible to receive contributions for company matching.
- Employees are eligible for flexible paid time-off and extended paid parental leave.
- We offer 11 paid holidays annually.
- We offer 15 hours of paid work time for volunteer activities and programs.
- Our sick leave accrual is the following for our employees: Exempt CA employees (not including San Francisco) including NY : accrue 40 hours each year. Unused sick leave carries over into the next year. Employees cannot exceed 80 hours in a given year. Exempt Non - CA employees (not including NY) including SF: Accrue 1 hour every 30 hours worked. Cannot exceed 180 hours in the calendar year. Non-Exempt: accrue 1 hour every 30 hours worked. Unused carries over to the next year. Not to exceed 108 hours in a calendar year.
Related Guides
Related Categories
Related Job Pages
More Engineer Jobs
Senior Genesys Telecom Engineer
ComPsychThe World’s Largest Provider of Mental Health Services and GuidanceResources® for Life.
Engineer38 days ago
Full TimeRemoteTeam 1,001-5,000Since 1984H1B Sponsor
Senior Telecom Engineer specializing in Genesys Cloud CX and Workforce Management.
Cloud
Engineer38 days ago
Full TimeRemoteTeam 201-500H1B Sponsor
Adobe Real-Time CDP Engineer designing and implementing customer data platform solutions
AWSAzureCloudGoogle Cloud PlatformSQL
United States
Engineer38 days ago
Full TimeRemoteTeam 201-500Since 2017H1B No Sponsor
RF Systems Engineer designing and integrating RF sensing capabilities for Forterra's systems
Energy Engineer I/II – Peak Load Management
Power TakeOffRevolutionizing the way utilities and businesses participate in energy efficiency.
Engineer38 days ago
Full TimeRemoteTeam 51-200Since 2007H1B No Sponsor
Energy Engineer supporting peak load management at Power TakeOff.