SageSure Insurance Managers
SageSure is an insurance company and division of Insight Catastrophe Group, a New York-based company that delivers property risk management services. As an empl
Principal DevOps Engineer
Location
United States
Posted
61 days ago
Salary
Not specified
Bachelor Degree8 yrs expEnglishAWSCloudDistributed SystemsDockerKubernetesNGINXPythonGo
Job Description
• Drive the development and continuous improvement of platform tools, emphasizing scalability, reliability, and monitoring capabilities to effectively support engineering teams.
• Design and implement self-service tools and frameworks that empower engineering teams, promoting scalability, efficiency, and reusability across various platforms.
• Provide expert-level technical oversight and mentorship to engineering teams, ensuring platform capabilities are seamlessly integrated into workflows and aligned with organizational goals.
• Establish and maintain comprehensive technical documentation and engineering standards, ensuring platform tools remain understandable, extensible, and accessible to all teams.
• Analyze and resolve complex performance issues within platform tools, identifying root causes, and implementing robust, scalable solutions to enhance efficiency and reliability.
• Proactively research and adopt new technologies, tools, and engineering patterns that elevate developer productivity and improve self-service capabilities.
• Focus extensively on scalability, performance optimization, and sustainable software delivery, ensuring efficient resource utilization and cost effectiveness.
• Actively participate in on-call rotations, providing critical expertise and technical guidance to maintain production environment resilience and high availability.
Job Requirements
- 8+ years of experience building, scaling, and implementing DevOps solutions, with deep expertise in automation and tooling.
- Ability to develop and document reusable, scalable solutions that enable engineering teams to operate efficiently.
- Demonstrated expertise in automation practices, leveraging best practices and methodologies to streamline workflows and seamlessly integrate tooling into engineering processes.
- A platform-focused mindset, capable of developing, documenting, and promoting reusable patterns that significantly enhance engineering productivity.
- Deep knowledge of Kubernetes or similar container orchestration frameworks, along with experience using ecosystem tools such as Helm, Docker, Argo Rollouts, and Ingress NGINX, to build resilient and scalable solutions.
- Hands-on experience with CI/CD methodologies and tools (e.g., GitLab CI, Blue/Green and Canary deployments, API testing), emphasizing modular, scalable, and efficient pipeline design.
- Proficient in automated scripting, including designing, coding, debugging, and maintaining scripts in Python, Bash, or Go, and capable of setting scripting standards and best practices.
- Experience developing solutions on cloud platforms, particularly AWS, with comprehensive knowledge of cloud-native architecture and best practices.
- Strong understanding of monitoring and observability best practices, including the ability to design solutions that deliver actionable insights into distributed systems.
Benefits
- Generous health benefits and perks
- Tuition reimbursement
- Wellness allowance
- Paid volunteer time off
- Matching 401K plan
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
DevOps Engineer61 days ago
Full TimeRemote
AutoRABIT is looking for a Site Reliability/DevSecOps Engineer to help develop, scale and operate our cloud services. In this role you will be an experienced business professional able to implement and execute best practice operations and improvements across teams by providing vi...
AWSGCPAzureKubernetesDockerTerraformJenkinsAnsiblePythonBashELKGitLab CICI/CDLinuxTCP/IPDNSHTTP
DevOps Engineer61 days ago
Full TimeRemoteTeam 51-200Since 2015H1B Sponsor
DevOps Engineer collaborating on AI-native scientific solutions at TetraScience
AWSCloudDockerJavaKubernetesLinuxMicroservicesPythonTerraformGo
United States
DevOps Engineer61 days ago
Full TimeRemoteTeam 51-200H1B Sponsor
Site Reliability Engineer developing and operating cloud services at AutoRABIT
AnsibleAWSAzureChefCloudDockerGoogle Cloud PlatformJavaJenkinsKubernetesPythonShell ScriptingTerraformGo
DevOps Engineer62 days ago
Full TimeRemoteTeam 10,001+Since 1993H1B Sponsor
Senior Site Reliability Engineer developing scalable managed cloud services at Red Hat
AnsibleAWSAzureChefCloudDistributed SystemsDNSDockerGoogle Cloud PlatformJavaKubernetesLinuxOpenShiftPrometheusPuppetPythonTCP/IPUnixGo