Kentik
The network observability company.
Staff Site Reliability Engineer, Cloud
Location
United States
Posted
58 days ago
Salary
$165K - $200K / year
Bachelor Degree8 yrs expEnglishAnsibleAWSAzureCloudDNSDockerFirewallsGoogle Cloud PlatformGrafanaKubernetesLinuxPrometheusPuppetPythonTcp/ipTerraformGo
Job Description
• Make sure our real-time, scalable, infrastructure is set up for growth and working efficiently. Our infrastructure runs on our own hardware, across multiple locations as well as all major cloud vendors
• Work on tools and processes to better monitor our platform as well as ensuring its stability through our rapid growth
• Deep-diving into diverse topics, from firewalls and IP routing, to database replication strategies or automating build processes
• Collaborate with engineering and infrastructure teams on finding solutions from an operational perspective
• Assist with expanding our cloud deployments across the major cloud providers
• Contribute code, code reviews and tools or patches to all kinds of existing code
• Write design documents or collaborate on colleagues’ docs to introduce new features or changes into our infrastructure
• Provide valuable feedback on team goals, projects, and processes. We believe in continuously improving our team
Job Requirements
- 8+ years of experience in cloud-based Systems Administration, IT and/or SRE related projects
- Expertise in public cloud environments such as AWS, GCP, Azure, or OCI.
- Strong command of containerization and orchestration using Docker and Kubernetes.
- Solid programming and automation skills using Bash, Python, or Go.
- Proficiency with Infrastructure as Code (IaC) and configuration management platforms such as Terraform, Ansible, and Puppet.
- Proficiency in Linux administration and command-line tools (e.g., SSH, grep, awk).
- Detailed understanding of major internet protocols (TCP/IP, DNS, HTTP, TLS)
- Networking administration experience: concepts such as routing, firewalls (iptables), peering sound familiar
- A passion for documenting code, processes, and infrastructure in runbooks and wikis
- Worked with metrics monitoring solutions such as grafana, prometheus, telegraf, and OpenTelemetry
- Experience creating and managing tickets with third party vendors and owning cloud vendor partner relationships.
Benefits
- 100% of premiums are paid by company for health, vision and dental coverage for you and your dependents
- Additionally, an annual Health Reimbursement Account (HRA) of $3,000 for an individual or $4,500 for a family
- Paid family & medical leave
- Open PTO, a quarterly Wellness Day, and a minimum of 10 paid holidays
- 401(k) retirement account
- Home office reimbursement
- Stock options
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
DevOps Engineer58 days ago
Full TimeRemoteTeam 501-1,000H1B Sponsor
Principal DevOps Engineer at SageSure enhancing software delivery processes
AWSCloudDistributed SystemsDockerKubernetesNGINXPythonGo
United States
DevOps Engineer58 days ago
Full TimeRemote
AutoRABIT is looking for a Site Reliability/DevSecOps Engineer to help develop, scale and operate our cloud services. In this role you will be an experienced business professional able to implement and execute best practice operations and improvements across teams by providing vi...
AWSGCPAzureKubernetesDockerTerraformJenkinsAnsiblePythonBashELKGitLab CICI/CDLinuxTCP/IPDNSHTTP
DevOps Engineer58 days ago
Full TimeRemoteTeam 51-200Since 2015H1B Sponsor
DevOps Engineer collaborating on AI-native scientific solutions at TetraScience
AWSCloudDockerJavaKubernetesLinuxMicroservicesPythonTerraformGo
United States
DevOps Engineer59 days ago
Full TimeRemoteTeam 51-200H1B Sponsor
Site Reliability Engineer developing and operating cloud services at AutoRABIT
AnsibleAWSAzureChefCloudDockerGoogle Cloud PlatformJavaJenkinsKubernetesPythonShell ScriptingTerraformGo