Share this Job

Title:  Site Reliability Engineer


Waltham, MA, US, 02451

Requisition ID:  32642


Are you data-driven?  We at NetApp believe in the transformative power of data – to expand customer touchpoints, to foster greater innovation, and to optimize operations.  We are designed for simplicity, optimized to protect, created to embrace future opportunity, and open to enrich choice.  We are the data authority for hybrid cloud, and we are helping our customers realize the full potential of their data.


We’ve built a Data Fabric for a data-driven world – to simplify and integrate data management across the resources that are best for the business.  With the Data Fabric, our customers can harness the power of cloud data services, build cloud infrastructures, and modernize storage through data management.


By harnessing the power of hybrid cloud data services, customers gain the freedom of choice to securely manage and move data – anywhere, on any cloud. Only NetApp can help organizations deliver data-rich customer experiences when they rapidly test and deploy new applications that easily use data and services regardless of where they reside or in what form.

Job Summary

The OnCommand Insight Team is one of NetApp’s fastest moving teams. We deliver enterprise software to manage infrastructure and applications. We are seeking Site Reliability Engineers to help us build out our new cloud offering.


As a Site Reliability Engineer, you’ll engage in and improve the lifecycle of cloud services - from design through deployment, operation and refinement. You’ll maintain services by measuring and monitoring availability, latency and overall system health. You’ll play an important role in scaling systems sustainably through automation and evolving them by pushing for changes to improve reliability and velocity.

Job Requirements


  • Work with other SREs and Developers to ensure maximum performance, reliability and automation of our deployments and infrastructure
  • Work with, and consult with, development on new features and software architecture to ensure scalability
  • Develop software, both as components of our solution and outside of the solution, for deployment automation, packaging and monitoring visibility
  • Analyze and improve latency, performance, and availability
  • Resolve critical and high visibility customer issues


  • Strong understanding of systems design and networking with regards to performance and scale
  • Strong experience in Linux system administration and shell scripting
  • Scripting and automation with Python, Go, Ruby, or another programming language
  • Experience with at least one major cloud provider (AWS preferred)
  • Experience or clear understanding of containerization and container orchestration tools such as Kubernetes
  • Experience operating a cloud service infrastructure, including
    • Scaling and high availability patterns
    • Issue trubleshooting and resolution
    • Sftware deployment and CI/CD pipelines
    • Mnitoring
  • Experience with configuration management tools such as Terraform, Ansible, and SaltStack
  • Understanding of microservices architecture and REST interfaces

Typically requires a minimum of 5 years of related experience with a Bachelor’s degree; or 3 years and a Master’s degree; or a PhD without experience; or equivalent work experience.


So get ready to tap into the data visionary within, and join us as we accelerate digital transformation and empower our customers to change the world with data!


If you ask a NetApp employee why they work here, the answer is inevitably the same: the people. At NetApp, our culture is at the heart of what we do. We place importance in trust, integrity, teamwork, and caring above all else. NetApp is a place where people are empowered to make a difference. Empowered to innovate. Empowered to collaborate. Empowered to help ourselves and others be data-driven and change the world. We take care of each other, our customers, our partners, and our communities simply because it’s the right thing to do.


We work hard but also recognize the importance of work-life balance for our employees because what’s important to them is important to us!  Recently we implemented Family First, which encourages employees to take paid time off to bond with a new child (through birth or adoption) or to care for a family member with a serious health condition.  Our volunteer time off program is best in class, offering employees 40 hours of paid time off per year to donate their time with their favorite organizations.  We provide comprehensive medical, dental, wellness and vision plans for you and your family.  We offer educational assistance, legal services, and access to discounts and fitness centers. We also offer financial savings programs to help you plan for your future.  


Join us and see what empowerment can do.



Equal Opportunity Employer Minorities/Women/Vets/Disabled

Nearest Major Market: Waltham
Nearest Secondary Market: Boston

Job Segment: Medical, Engineer, Cloud, Software Engineer, Data Management, Healthcare, Engineering, Technology, Data