Share this Job

Title:  Site Reliability Engineer


Research Triangle Park, NC, US, 27709

Requisition ID:  31455


Are you data-driven?  We at NetApp believe in the transformative power of data – to expand customer touchpoints, to foster greater innovation, and to optimize operations.  We are designed for simplicity, optimized to protect, created to embrace future opportunity, and open to enrich choice.  We are the data authority for hybrid cloud, and we are helping our customers realize the full potential of their data.


We’ve built a Data Fabric for a data-driven world – to simplify and integrate data management across the resources that are best for the business.  With the Data Fabric, our customers can harness the power of cloud data services, build cloud infrastructures, and modernize storage through data management.


By harnessing the power of hybrid cloud data services, customers gain the freedom of choice to securely manage and move data – anywhere, on any cloud. Only NetApp can help organizations deliver data-rich customer experiences when they rapidly test and deploy new applications that easily use data and services regardless of where they reside or in what form.

Job Summary

Job Summary


An opportunity to work as a Service Reliability Engineer (SRE) for the Cloud Data Services Business Unit of NetApp.  This role is a hybrid of systems engineering and software engineering. You will be responsible for building out the new NetApp Kubernetes-as-a-Service (NKS) cloud-based production environment and be empowered to ensure the availability, security, and performance of the Service through automation.   This team is responsible for Systems Monitoring, Provisioning, Configuration Management, Capacity Management, Deployment and Rollback, Incident Management, and SDLC practices.


The NKS SRE team’s primary responsibility is running the NKS service supporting NetApp customers worldwide.  


You'll work with the latest container technologies in the leading cloud service provider(s) and a full stack of modern programming languages and tools to develop and support this service.  This team will depend on you to advise on design, implementation, process, and scaling of this service while playing the vital role of ensuring service availability, performance and security.

Job Requirements

Required skills:


Understanding of cloud native container technologies like Kubernetes

Experience working in a 24/7 production engineering organization

Understanding of Linux container principles and best practices

Ability to listen, communicate, evaluate, problem solve, multi-task, and prioritize in a high-pressure, mission-critical, and rewarding team environment.


Additional desired skills:


Deep expertise troubleshooting complex distributed systems

Demonstrated experience programming and testing Python, Ruby or Go

Experience using RESTful API services

Experience with creating and improving documented procedures and/or playbooks

Working knowledge of Chef, Puppet, Ansible, or Salt

Understanding of TCP/IP and Unix networking

Typically requires a minimum of 8 years of related experience with a Bachelor’s degree; or 6 years and a Master’s degree; or a PhD with 3 years experience; or equivalent experience.


So get ready to tap into the data visionary within, and join us as we accelerate digital transformation and empower our customers to change the world with data!


If you ask a NetApp employee why they work here, the answer is inevitably the same: the people. At NetApp, our culture is at the heart of what we do. We place importance in trust, integrity, teamwork, and caring above all else. NetApp is a place where people are empowered to make a difference. Empowered to innovate. Empowered to collaborate. Empowered to help ourselves and others be data-driven and change the world. We take care of each other, our customers, our partners, and our communities simply because it’s the right thing to do.


We work hard but also recognize the importance of work-life balance for our employees because what’s important to them is important to us!  Recently we implemented Family First, which encourages employees to take paid time off to bond with a new child (through birth or adoption) or to care for a family member with a serious health condition.  Our volunteer time off program is best in class, offering employees 40 hours of paid time off per year to donate their time with their favorite organizations.  We provide comprehensive medical, dental, wellness and vision plans for you and your family.  We offer educational assistance, legal services, and access to discounts and fitness centers. We also offer financial savings programs to help you plan for your future.  


Join us and see what empowerment can do.



Equal Opportunity Employer Minorities/Women/Vets/Disabled

Nearest Major Market: Durham
Nearest Secondary Market: Raleigh

Job Segment: Medical, Engineer, Data Management, Linux, Healthcare, Engineering, Data, Research, Technology