Site Reliability Engineer
Responsibilities - Be available to respond to critical service incidents outside of business hours on a rotating on-call schedule.
- Proactively monitor application health and performance across cloud infrastructure (AWS).
- Troubleshoot and prevent service interruptions in real-time, working closely with development teams to resolve incidents efficiently.
- Lead and participate in disaster recovery drills and security incident simulations.
- Implement Infrastructure as Code (IaC) and maintain scalable deployments using AWS-native tools and services.
- Collaborate with development teams to ensure smooth CI/CD workflows using Git and containerized deployments (Docker).
- Work closely with stakeholders and product teams to ensure technical reliability aligns with business needs.
- Support and improve observability tools, alerting mechanisms, and logging infrastructure to promote transparency and response agility.
- Champion best practices in security, availability, performance, and incident response.
Required Technologies & Tools
- Cloud Infrastructure : Strong proficiency in Amazon Web Services (AWS) with knowledge of services like EC2, ECS, RDS, CloudWatch, and IAM.
- Programming/Scripting : Proficiency in Node.js and scripting for automation and tooling.
- Containerization : Experience with Docker for container-based deployment pipelines.
- Frontend Awareness : Familiarity with React and Ember.js to understand performance implications at the frontend level.
- Backend Stack : Understanding of NestJS and scalable Node-based services.
- Databases : Proficient in MySQL and performance monitoring of relational databases.
- Version Control : Proficiency with Git for collaborative code management and DevOps workflow integration.
Core Competencies
- Incident Response : Calm and focused under pressure with a structured approach to resolving outages and degradation.
- System Design : Ability to contribute to and review architectural designs for scalability and resiliency.
- Collaboration : Strong communication skills to coordinate across developers, QA, and product teams.
- Automation & Efficiency : Passion for automation, repeatability, and continuous improvement.
- Security Mindset : Consistent implementation of security best practices and a strong grasp of data protection standards.
Qualifications
- 3+ years of experience in a Site Reliability, DevOps, or related engineering role.
- Proven track record managing and scaling applications in a production AWS environment.
- Familiarity with full stack environments , particularly those using Node.jss .
- Experience maintaining and deploying databases such as MySQL with performance tuning.
- Experience with container orchestration (e.g., ECS or Kubernetes is a plus).
- Commitment to uptime, performance, and security in fast-moving SaaS environments.
Recommended Jobs
Desktop Support Technician
About the Role: MetroSys is seeking a skilled and customer-focused Desktop Support Technician to provide onsite technical support at a manufacturing client facility in Dublin, Georgia. The ideal…
Warehouse Operations Clerk
Resumen: El Empleado de operaciones de almacén es responsable de garantizar que todos los documentos relacionados con el envío se completen de manera precisa y oportuna. Este puesto reporta al Supervi…
Quality Assurance Tech - 3rd Shift
Job Summary Adhere to and report accurately and completely on all aspects of food safety and quality-related functions l. Review all quality documentation and operations control documents and assi…
Warehouse Operations Manager (Second Shift)
Warehouse Operations Coordinator Job Description IGF Mission To deliver quality service and cultivate community around the table. Operations Mission To deliver quality service through a t…
VP Product Management, Developer & Observability Tools, OCI, NA
**Job Description** Oracle Cloud Infrastructure (OCI) is Oracle's next-generation enterprise cloud platform, delivering high-performance, secure, and scalable compute, networking, AI, storage, and dev…
Stocker - Meat
Position Title: Stocker - Meat Department: Meat Supervisor: Meat Manager FLSA: Full/Part Time, Hourly, 8-10 Hour Shifts, Union Restaurant Depot is a wholesale cash-and-carry foodservice…
Concrete Construction Estimator
Who we are: American Structural Concrete is an industry leader in safety. For over 30 years, our team of seasoned and experienced professionals has built some of the most recognized concrete str…
Full Time Cardiology Job GA
Interventional Cardiologist with PV interest position west of Savannah Job ID# 70847 Job Details BC/BE in Interventional Cardiology Joining a private Cardiology practice ateam of 1Interven…
Data Analyst (Oracle SQL, Tableau, Criminal Justice)
Job Title : Data Analyst (Oracle SQL Tableau Criminal Justice) Location : Atlanta GA (Remote ) We are currently seeking candidates who meet the following qualification Primary Duties …
Sr Project Engineer
Portfolio Business : Huber Engineered Materials J.M. Huber Corporation is one of the largest privately held, family-owned companies in the United States. Established in 1883, we are a diversifie…