Senior Site Reliability Engineer
Senior Site Reliability Engineer (SRE) – 90% Remote – Atlanta, GA.
Charles Simon Associates is partnered with a leading global business who are building out a world-class Site Reliability Engineering (SRE) function. We are recruiting on their behalf for an experienced Site Reliability Engineer to join the team in Atlanta, Georgia.
This isn’t a “keep the lights on” position; it’s a high-impact role focused on driving automation, reliability, and performance across distributed, cloud-native systems that power critical services worldwide.
The Offer:
Salary: Up to $140,000 (DOE) + benefits.
Location: Atlanta, GA/Hybrid (onsite once a month).
Start: ASAP.
If you do not meet the 3 criteria below, do not apply; you will not be successful:
- All candidates MUST be U.S. citizens to apply for this role due to travel requirements.
- All candidates MUST have worked as a Site Reliability Engineer in multiple roles.
- All candidates MUST have extensive Azure experience.
Why join?
- You’ll be part of a forward-thinking engineering culture where automation and reliability come first.
- You’ll work on cutting-edge tech stacks (Azure, AKS, Kubernetes, Terraform, Datadog, IaC).
- You’ll influence how reliability, resilience, and observability are built into systems from the ground up.
- You’ll join a global business that invests heavily in SRE and sees it as business-critical not a bolt-on.
What you’ll do:
- Define and enforce SLOs, SLIs, and SLAs that set the standard for performance and reliability.
- Manage infrastructure as code (Terraform, Pulumi, CloudFormation) for scalable, auditable deployments.
- Previous experience of deploying functional apps using Terraform and AKS.
- Deploy and operate Kubernetes & AKS environments at scale.
- Automate everything possible with PowerShell, Python, or Bash.
- Integrate and optimise observability & monitoring (Datadog preferred, but Grafana, Azure Insights, and Log Analytics are also in play).
- Lead incident response, postmortems, and continuous improvement cycles.
- Optimise cost, capacity, and performance across cloud workloads.
- Drive resilience through chaos engineering and recovery testing.
What we’re looking for:
- Strong hands-on experience as a Site Reliability Engineer.
- Deep expertise with Terraform and IaC in live production environments.
- Proven experience with Kubernetes / AKS.
- Strong scripting background (PowerShell, Python, or Bash).
- Monitoring/observability knowledge with tools like Datadog, Grafana, Azure App Insights, Log Analytics.
- Solid understanding of web applications & distributed systems.
Bonus points if you bring:
- Knowledge of Microservices Architecture.
- Experience working in Kanban environments.
If you’re the type of engineer who thrives on solving complex reliability challenges, loves automation, and wants to shape how SRE is done in a global business, this role is for you.
Recommended Jobs
Urgent Care Medical Director Position in North Atlanta with Profit Sharing
An Urgent Care Medical Director position is available in Atlanta, Georgia that involves overseeing and working at six different sites in the northern part of the city. Opportunity Details ~95%…
Restaurant Team Member Part Time
Req ID: 469727 Address: 2 Sonny Perdue Drive Garden City, GA, 31408 Benefits: * Fuel Your Growth with Love's - company funded tuition assistance program * Paid Time Off * Flexible Schedulin…
Tax Staff Accountant - Remote
We are working with a mid-market CPA firm in the Sandy Springs area that is looking to add an experienced Tax Staff Accountant to their team, position details as follows: Must have 2-4 years of pub…
Internship, Service Technician Trainee (Fall 2025)
What To Expect In the Technician Trainee Intern role, you will be servicing customer vehicles and completing hands on tasks at the service center. In addition to being a team player, excellent ora…
Equipment Operator
Equipment Operator JOB-10045108 Anticipated Start Date Oct. 13, 2025 Location Schertz, TX Type of Employment Contract-to-Hire Employer Info As a family-owne…
Team Member
Minimum Qualifications (Required) Must be able to effectively communicate with guests and other employees Must be able to listen attentively to guests, supervisors, and employees Must work with a s…
CNA/ Direct Support Professional/ Health Aide
Job Description Job Description Benefits: ~ Paid time off Job Description This individual will work hands on with senior adults and with developmentally disabled adults under the superv…
Locum | Physician Orthopedic Surgery
When it comes to finding the perfect locums assignment, sometimes it is all about who you know. CompHealth has been around for a long time and have a vast network of connections to facilities across t…