571-463-1810Β ||Β [Upgrade to PRO to see contact]
Β
Hiring SRE or DevOps Engineer || Hybrid β 3x
Only on W2
Role: SRE or DevOps Engineer
Location: Hybrid at Alpharetta, GA
Duration: 6+ month contract
We are seeking a skilled Site Reliability Engineer to join our team and help build, maintain, and scale our cloud-native infrastructure. You will work closely with development and operations teams to ensure our systems are reliable, scalable, and efficient. The ideal candidate is passionate about automation, observability, and infrastructure-as-code, and thrives in a collaborative, fast-paced environment.
Key Responsibilities
** Design, implement, and manage cloud infrastructure on Azure using Terraform and Terragrunt.
** Maintain and optimize Kubernetes clusters on Azure Kubernetes Service (AKS).
** Build and manage CI/CD pipelines using GitHub Actions/Workflows and ArgoCD for GitOps deployments.
** Enhance system reliability by implementing monitoring, alerting, and observability solutions with Grafana.
** Automate operational tasks to reduce toil and improve team efficiency.
** Participate in on-call rotations, incident response, and post-mortem analysis.
** Collaborate with development teams to improve application performance, scalability, and resilience.
** Implement and advocate for SRE best practices, including SLIs, SLOs, and error budgets.
** Continuously improve system performance, cost efficiency, and security.
Β
Required Skills & Qualifications
** BS Degree
** 3+ years of experience in an SRE, DevOps, or cloud infrastructure role.
** Strong experience with Azure cloud services and infrastructure
** Proficiency with Kubernetes (preferably AKS and container orchestration.
** Hands-on experience with java and Terraform for infrastructure-as-code.
** Experience with CI/CD tools, especially GitHub Workflows/Actions and ArgoCD.
Β
Desired Skills:Β
** AKS
** Terragrunt
** Prometheus, Loki, Tempo experience is a plus
** Masterβs Degree Preferred