Avantos.ai
AI-Driven Onboarding and Servicing Made Smarter
Senior DevOps Engineer
Location
United States
Posted
17 days ago
Salary
Not specified
8 yrs expEnglishAWSCloudDistributed SystemsDockerJava ScriptKafkaKubernetesMicroservicesNext.jsPostgre SQLPythonTerraformGo
Job Description
• Own and evolve our infrastructure, reliability, and deployment practices
• Responsible for building the foundational platform that enables our engineering teams to ship quickly and reliably while maintaining the security and compliance standards required in financial services
• Design, implement, and maintain our AWS cloud infrastructure using infrastructure-as-code principles with Terraform
• Build and optimize CI/CD pipelines to enable rapid, safe deployments across multiple environments
• Own observability strategy—implement comprehensive monitoring, logging, and alerting systems using Datadog and other tooling
• Architect and manage containerized workloads on ECS Fargate and evaluate migration paths to Kubernetes
• Establish and enforce security best practices, working closely with compliance teams on financial services requirements
• Design and implement disaster recovery, backup, and business continuity strategies
• Optimize system performance, cost efficiency, and resource utilization across AWS services
• Collaborate with engineering teams to improve service reliability, reduce toil, and establish SLOs/SLIs
• Participate in incident response and conduct thorough post-mortems to drive continuous improvement
• Mentor engineers on DevOps practices, cloud architecture patterns, and operational excellence
Job Requirements
- 8+ years of experience in DevOps, SRE, or infrastructure engineering roles
- Expert-level proficiency with AWS services including ECS Fargate, ALB, Cognito, S3, SQS, and related services
- Deep hands-on experience with Terraform for managing complex, multi-account AWS environments
- Strong scripting and automation skills in Python and/or Bash
- Proven experience designing and implementing CI/CD pipelines (GitHub Actions, ArgoCD, or similar)
- Solid understanding of containerization technologies (Docker) and orchestration platforms (Kubernetes/ECS)
- Experience with observability and monitoring tools (Datadog, CloudWatch, or equivalent)
- Deep knowledge of networking, security, and AWS best practices
- Strong problem-solving abilities and experience troubleshooting complex distributed systems
- Excellent communication skills and ability to work cross-functionally with engineering teams
- Nice to haves: Experience in financial services or highly regulated industries, Familiarity with event-driven architectures and message queue systems (Kafka, SQS), Experience with PostgreSQL performance tuning and RDS management, Knowledge of microservices architecture patterns and service mesh technologies, Experience with security tooling, vulnerability scanning, and compliance frameworks, Familiarity with our application stack (Golang, Next.js, PostgreSQL), Experience managing AI/ML infrastructure and AWS Bedrock.
Benefits
- Competitive compensation + meaningful equity
- Opportunity to build production infrastructure from the ground up for a rapidly scaling AI platform
- A culture optimized for engineering excellence, focus, deep work, and ownership—not ticket factories
- Remote work flexibility
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
DevOps Engineer17 days ago
Full TimeRemoteTeam 11-50Since 2023
DevOps/Infrastructure Engineer managing GCP infrastructure for health optimization product
CloudGoogle Cloud PlatformPostgreSQLSQLTerraform
Senior DevOps Engineer
Jump - Advisor AIJump uses AI to help financial managers automatically take notes, stay compliant, update their CRM, and serve clients.
DevOps Engineer17 days ago
Full TimeRemoteTeam 51-200Since 2023
Senior DevOps Engineer building infrastructure for financial advisors at jumpapp.com
CloudDistributed SystemsGoogle Cloud PlatformGrafanaKubernetesPrometheusSDLCSQLTerraform
Site Reliability Engineer
PeachGiving lenders the tools to scale and modernize through integration to our API-first, cloud-native platform.
DevOps Engineer17 days ago
Full TimeRemoteTeam 11-50Since 2018
SRE developing infrastructure and maintaining high reliability at a B2B SaaS company
CloudPythonTerraform
DevOps Engineer17 days ago
Full TimeRemoteTeam 51-200Since 2019
Site Reliability Engineer optimizing deployments and maintaining cloud infrastructure
AWSAzureCloudGoogle Cloud PlatformKubernetesLinuxSubversionUnix
United States