Thinkahead Consultant Psychologist Pty Ltd
We get to the heart of the matter.....real people......real solutions
Senior Cloud/DevSecOps Engineer
Location
United States
Posted
117 days ago
Salary
$170K - $200K / year
Bachelor DegreeEnglishAWSCloudDockerKafkaKubernetesMicroservicesPrometheusRedisTerraformVault
Job Description
• Design, implement, and automate multi-cloud infrastructure provisioning and deployments using Terraform, CloudFormation, Kubernetes/EKS, Docker, and serverless cloud functions.
• Architect and maintain robust CI/CD pipelines (Makefile, PyTest, Dockerfiles, Spies/Mocks) supporting modern agentic microservices and asynchronous event-driven workflows.
• Integrate and operationalize LangChain, LangGraph, LlamaIndex, and Pinecone-powered agent orchestration flows, building secure, monitorable event brokers (Kafka, AWS EventBridge, Redis Streams) and orchestrated job queues (Celery, AWS Batch).
• Champion security best practices: automate secrets/token/certificate management (Vault, AWS Secrets Manager), enforce fine-grained RBAC and token-based authentication (OAuth2), oversee Private Link and cross-cloud access controls.
• Monitor, manage, and remediate cloud and on-prem security incidents, participate in on-call rotations, and support production outage resolution and root cause analysis.
• Implement comprehensive observability: distributed tracing, logging, metrics, alerting (Prometheus, ELK, OpenTelemetry, DataDog), dashboard visualization, and actionable production feedback loops.
• Collaborate with architects, engineers, and QA to define, document, and maintain event schema contracts, compliance policies, backup/recovery, and SLO/SLA targets.
• Contribute to security audits, compliance reporting, incident and postmortem documentation, and continuous process improvement reviews.
• Lead or participate in sprint planning, backlog grooming, process retrospectives, and cross-team knowledge sharing and onboarding.
Job Requirements
- Demonstrated experience in event-driven cloud infrastructure (AWS EKS, Kubernetes, Terraform, Docker, serverless Lambda/Batch, cross-cloud integration).
- Proficiency in building/optimizing CI/CD pipelines for fast, reliable agentic deployments (Makefile, PyTest, Dockerfiles).
- Practical experience implementing security in agent/LLM and microservices environments: Vault, Secrets Manager, token/cert rotation, RBAC, network controls.
- Experience deploying, scaling, and monitoring event brokers (Kafka/EventBridge/Redis Streams) and background worker orchestration platforms (Celery, AWS Batch).
- Deep knowledge of security, compliance, observability, incident response, and SRE best practices.
- Familiarity with LangChain/LangGraph agentic patterns, vector DBs (Pinecone), and event-driven ML/data integrations is highly desirable.
- Excellent communication skills for cross-function collaboration, agile ceremonies, incident postmortems, documentation, and knowledge transfer.
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Senior DevOps Engineer – Cloud Infra
EvolutionIQLeading the artificial intelligence transformation for insurance carriers.
DevOps Engineer117 days ago
Full TimeRemoteTeam 51-200H1B Sponsor
Senior DevOps Engineer focused on building scalable cloud infrastructure
CloudDockerGoogle Cloud PlatformKubernetesTerraform
Senior Site Reliability Engineer
ScienceLogicWe are a leader in AIOps providing modern IT operations with actionable insights to predict and resolve problems faster.
DevOps Engineer117 days ago
Full TimeRemoteTeam 501-1,000Since 2010H1B Sponsor
Senior Site Reliability Engineer building secure SaaS infrastructure for AI products at ScienceLogic
AWSCloudKubernetesLinuxPerlPythonTerraform
Virginia
DevOps Engineer118 days ago
Full TimeRemoteTeam 201-500Since 2010H1B Sponsor
Lead DevOps Engineer driving infrastructure and tooling at Updater
AWSFluxKubernetesPrometheusTerraform
Site Reliability Engineer – Bilingual, Portuguese, English
nCloud IntegratorsYour Success is Our Business
DevOps Engineer118 days ago
ContractRemoteTeam 11-50Since 2018H1B No Sponsor
SRE Engineer responsible for availability and reliability of systems in cloud environments
AWSCloudDistributed SystemsJavaJavaScriptPythonRubySplunk
United States