Apply now »

Title:  Software Engineer

Location: 

Bangalore, Karnataka, IN

Requisition ID:  135334

Job Summary

The Customer Reliability Engineering (CRE) team at Keystone blends software engineering, SRE, and customer experience with a strong customer-first mindset, owning issues end-to-end and driving systemic reliability improvements.

You act as a bridge between customers, support, and engineering resolving complex cross-system issues and improving reliability, observability, and customer experience across distributed systems like subscription, activation, telemetry, and billing.

CREs take direct ownership of debugging, fixing, and enhancing services within the subscription lifecycle, resolving issues at the source and involving development teams only for major or architectural changes.

About the Team

The NetApp Keystone team powers NetApp’s storage-as-a-service (STaaS) offering, enabling customers to consume storage across on-prem and cloud environments through a flexible subscription model.

The platform spans multiple distributed components, including Subscription Engine, Activation Workflows, Data Analytics, Processors, Collectors, ASUP, Sphere, and the Keystone Console, working together to deliver a seamless, reliable, and scalable customer experience.

Job Requirements

  • 5–8 years of software development or customer engineering experience, with at least 3 years in backend or technical support engineering roles
  • Strong proficiency in Go or Python (preferably both); ability to debug and contribute to production codeWorking knowledge of React
  • TypeScript for diagnosing UI-layer issues
  • Strong understanding of distributed systems, microservices, and event-driven architectures
  • Hands-on experience with Kubernetes and Docker (log analysis, debugging, deployments)
  • Proficiency with REST and gRPC APIs; ability to isolate and debug failures
  • Experience with PostgreSQL and at least one NoSQL database; ability to write diagnostic queries
  • Familiarity with time-series databases (ClickHouse, InfluxDB, TimescaleDB)
  • Experience with Kafka or NATS (consumer lag, offsets, message flow debugging)
  • Hands-on experience with Prometheus, Grafana, and log aggregation tools
  • Working knowledge of CI/CD pipelines and Git workflows
  • Understanding of Agile/SCRUM/LEAN methodologies
  • Strong written and verbal communication skills ability to author clear RCA reports, runbooks, and customer updates

Role & Responsibilities:

  • Own end-to-end resolution of customer issues across Keystone systems
  • Perform RCA and act as DRI to drive incident resolution
  • Deliver fixes/enhancements; involve dev teams for major changesImprove reliability, observability, and error handlingBuild diagnostics/runbooks to reduce MTTR and drive prevention
  • Collaborate across teams and customers to enhance platform stability and experience

Education

  • IC - Typically requires a minimum of 5 years of related experience.
  • Bachelor of Science Degree in Computer Science, Electrical Engineering, or a related field; a Master’s Degree is preferred


Job Segment: Software Engineer, Cloud, Electrical Engineering, Technical Support Engineer, Computer Science, Engineering, Technology

Apply now »