Cloud Infrastructure - Site Reliability Engineer
Title: Cloud Infrastructure - Site Reliability Engineer
Location: Alpharetta, GA or Berkeley Heights, NJ (5 Days Onsite)
Certifications:
Certified Engineer, DevOps, SRE, CSREF
Job Description:
As a Cloud Infrastructure Site Reliability Engineer (SRE) with expertise across multiple public cloud platforms, you will be responsible for managing and operating cloud infrastructure in alignment with the principles of Google's SRE model. Your role will focus on ensuring the reliability, availability, and performance of our cloud services, while driving automation and continuous improvement across production environments. You will collaborate closely with cross-functional teams to strengthen our cloud reliability posture and streamline operations through innovative automation solutions.
Key Responsibilities:
- Design, build, and maintain highly available, scalable, and secure cloud infrastructure on platforms such as AWS, GCP, or Azure.
- Develop and implement automation for provisioning, monitoring, scaling, and incident response using Infrastructure-as-Code tools (e.g., Terraform, CloudFormation, Ansible). Monitor system reliability, capacity, and performance; proactively detect and address issues before they impact users.
- Respond to production incidents, participate in on-call rotations, and lead post-incident reviews to drive root cause analysis and reliability improvements.
- Collaborate with software engineering and security teams to ensure new services and features are production-ready and meet reliability standards.
- Build and maintain tools for deployment, monitoring, and operations; automate manual processes to reduce toil.
- Document operational processes and system architectures to ensure knowledge sharing and repeatability.
- Continuously evaluate and implement new technologies to improve system reliability, security, and efficiency.
Qualifications :
- Bachelor's degree in computer science, Engineering, or a related technical field, or equivalent practical experience.
- 3+ years of experience in software development with proficiency in at least one programming language (e.g., Python, Go, Java, C++).
- Experience administrating cloud platforms (AWS, GCP, Azure), including networking, security, containerization, storage, data management, and serverless technologies.
- Solid understanding of Linux systems, networking fundamentals, virtualized, and distributed systems, file systems, system processes and configurations.
- Deep understanding of observability (monitoring, alerting, and logging) tools in cloud environments. Ability to set up and maintain monitoring dashboards, alerts, and logs. Familiarity with Continuous Integration/Continuous Deployment (CI/CD) tools for automated testing, deployments, provisioning, and observability.
- Ability to manage and respond to incidents, perform root cause analysis, and implement post-mortem reviews. Understanding of setting, monitoring, and maintaining Service-Level Objectives (SLOs) and Service-Level Agreements (SLAs) for system reliability.
- Additional Qualifications a Plus: Experience working with enterprise-scale financial services or other regulated industries
Recommended Jobs
Nurse Practitioner (NP) / Physician Assistant (PA) - LOCUM TENENS
This is a generalized description of locum PA job requirements. Specific assignment details may vary based on the facility and the PA specialty. General Job Responsibilities: Provide comprehensiv…
Revenue Accounting Manager
Job Description Job Description You could be an accounting manager anywhere. Why Jerry.ai? Join a pre-IPO startup with capital, traction and runway ($240M funded | 60X revenue growth in 5 years…
Project Planner
Job Description Job Description SUMMARY The Project planner will exhibit genuine interest in solving work problems through proactively asking questions, clearly communicating and collaborating…
Executive Personal Assistant
Job Description Job Description Benefits: Bonus based on performance Competitive salary Paid time off Benefits/Perks Competitive Compensation Great Work Environment Career Ad…
Facilities and Small Engine Maintenance Supervisor
Join Our Winning Team! – Small Engine Mechanic & Facility Supervisor Augusta, GA | Mon–Thurs 9AM–5:30PM | Fri 9AM–12PM; flexibility in schedule possible Are you passionate about engines, tools,…
Dental Hygienist
Job Description Job Description Would you like to work in a practice that puts patients first? Where you are greeted by the smell of freshly baked cookies every morning? Are you looking for your …
Experience Southern Charm while Healing in Warner Robins!
Registered Nurse - Telemetry - Travel - (Tele RN) Dive into the heart of Warner Robins as a Telemetry RN, where every shift brings a new chance to heal and thrive! With a mix of adult and geriatric p…
Marketing Lead
This is a remote, individual contributor role , leading the end-to-end development of targeted marketing programs. Job Title: Vertical Marketing Leader - Americas Position Type: Full-Time, E…
Advanced Technician
Your Job The Georgia Pacific facility in Rincon, GA currently recruiting for an Advanced Technician (AT) assigned to the Away From Home department. In this role compensation will be commen…
Construction Project Engineer
Job Description Job Description Eastern Excavating Co. is seeking a full time Project Engineer. The company offers a family atmosphere and is established in the Savannah, GA area for 35+ years. O…