Site Reliability Engineer I
Description Summary of This Role
Responsible for availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning. Creates a bridge between development and operations by applying a software engineering mindset to system administration topics. Splits time between operations/on-call duties and developing systems and software that help increase site reliability and performance.
What Part Will You Play?
Chaos engineering - thinks laterally about how systems might fail in theory, designs tests to demonstrate how they behave in practice, and then formulate and implement remediation plans, as appropriate.
Use practices from DevOps and GitOps to improve automation and processes to make self service possible.
Pushing our systems to their limits, and then coming up with designs for how to get them to the next performance tier.
Safeguarding reliability. Ensuring that our services are highly available, resilient against disasters, self-monitoring, and self-healing.
Running “game days” to test assumptions about reliability and learn what will break before it matters to customers.
Building systems to proactively monitor the health, performance and security of our production and non-production virtualized infrastructure.
Improving our monitoring and alerting systems to make sure engineers get paged when it matters (and don't get paged when it doesn't).
Troubleshooting systems and network issues, alongside our Technical Operations Team.
What Are We Looking For in This Role?
Minimum Qualifications
- BS in Computer Science, Information Technology, Business / Management Information Systems or related field
- No experience required. Typically has a basic knowledge with programming in one or more programming languages and Unix/Linux systems internals and administration (e.g. filesystems, inodes, system calls) or networking (e.g. TCP/IP, routing, network topologies and hardware, SDN).
Preferred Qualifications
- Basic knowledge of Kuberentes and Docker.
- Good understanding of Linux operating system. Must be comfortable using the command line and writing bash scripts.
- Must be able to write programs using Python or Go.
- Typically has 1-2 years of experience in IT.
What Are Our Desired Skills and Capabilities?
Skills / Knowledge - Learns to use professional concepts. Applies company policies and procedures to resolve routine issues.
Job Complexity - Works on problems of limited scope. Follows standard practices and procedures in analyzing situations or data from which answers can be readily obtained. Builds stable working relationships internally.
Supervision - Normally receives detailed instructions on all work.
Recommended Jobs
Server
For this position, pay will be variable by location - plus tips. We want you to be the kind of Server you'd want to have. After all, you know that having a great Server can make or break th…
OBGYN in Stockbridge, GA
Looking for flexibility, predictability of schedule, no restrictions, and meaningful work in a safe practice environment TeamHealth has an opening at Piedmont Henry Hospital in Stockbridge, GA, just …
Customer Service Account Manager
Job Description Job Description Join Barfield and Become a Part of the Adventure! If you are a talented Customer Service Account Manager interested in working in the exciting field of aviation,…
PT Customer Service Leader
Primary Purpose To provide fast, easy, flexible and friendly service to our customers through the achievement of Food Lion customer service standards. Responsible for assisting the Customer Service…
Staff Accountant
Staff Accountant Who: A dynamic and expanding organization seeking a skilled accounting professional. What: Handle general ledger accounting, reconciliations, and financial reporting. When…
Staff Accountant
SUMMARY OF POSITION Staff Accountant performs a variety of duties such as maintaining the general ledger, reviewing financial statements, preparing financial reports, assisting with audits an…
Buyer Associate
Job Description Job Description Our Company & Culture: Hi there! Do you love fashion, clothing, and shopping? Do you like the idea of sustainable and recycled styles? Kid to Kid is not your ty…
Facilities Health & Safety Specialist
About Keystone Management, LLC Join the Keystone Community We are changing the world, one world at a time, by providing various asset ma…
Data Management Architect - Atlanta, Georgia
Req ID: 333227 NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, adaptable, and forward-thinking organiza…
Angular full stack developer.
Angular full stack developer. Build, deploy and support best-in-class software solutions for internal and external customers. Leverage technical expertise and latest tech stack to implem…