Principal Site Reliability Engineer - Military veterans preferred

2025-04-21
Nevro
Other

/yr

  employee   contract


Redwood City
California
94065
United States


Principal Site Reliability Engineer

US-CA-Redwood City

Job ID: 2025-4870
Type: Regular Full-Time
# of Openings: 1
Category: Research & Development
HQ

Overview

Build and maintain AWS infrastructure and developer tooling. Write terraform infrastructure modules. Perform Github actions. Support, monitor, and manage cloud infrastructure via Infrastructure as Code (terraform). Use Object oriented Languages, including C# and Java. Contribute to Nevro’s Cloud roadmap and strategy that scales horizontally and provide balance between quality, efficiency, and usability through automation and developer efficiency. Define Site Reliability Engineer (SRE) Handbook and best practices. Define Runbooks and developer processes. Participate in on-call rotation to resolve site incidents and document findings into repeatable procedures. Work cross-functionally with departments such as Marketing, Regulatory and Quality. Guide implementation of SRE practices in total product development cycle. Assist in CI/CD tooling development and guide the software development lifecycle. Prepare reports and presentations and document progress with senior management. Share ownership with Web Services team to create shared responsibility where SRE owns availability of service (SLOs/KPIs) and establishing monitoring and alerting practices. Establish SLOs/KPIs. Define Service Level Objectives to assess release readiness of all services. Lead and define significant/whole portions of planning, developing, coordinating, and directing development across complex products, directing internal and external resources. Contribute software/scripts to enable easier operational support for other SREs and developer using scripting technologies, including Powershell, Python, and bash. Perform networking, load balancing, DNS, and security configurations Identify, document, and help improve performance and operational efficiency challenges. Monitor production systems using tools such as Datadog and Grafana. Provide and support computing infrastructure (Infra-as-Code), including Terraform and AWS CloudFormation. Document developer processes. Configure and manage monitoring using tools like Datadog, Grafana. Create dashboards, monitors, alerts. Position allows for telecommuting from anywhere in US.

#LI-DNI



Responsibilities

Bachelor’s degree in Computer Science, Electronics Engineering, or Computer Applications and 10 years of progressive experience as a development engineer, site reliability engineer, or any occupation in development engineering

Must possess 10 years of experience with:

  • CI/CD tools (including Jenkins, GitHub Actions)
  • Scripting technologies, including Powershell, Python, and bash
  • Object oriented Languages, including C# and Java
  • Networking, load balancing, DNS, and security configurations

Must possess 4 years of experience with:

  • Git
  • Infra-as-Code (including Terraform and AWS CloudFormation)
  • Production systems monitoring using tools such as Datadog and Grafana

AWS Solutions Architect Associate certification

Position allows for telecommuting from anywhere in US.



Nevro offers equal employment opportunity, regardless of race, color, creed, religion, national origin, marital or family status, sex, sexual orientation, gender expression (including religious dress and grooming practices), gender (including pregnancy, childbirth or medical condition related to pregnancy or childbirth), physical or mental condition, disability, age or other characteristics protected by laws.



Equal employment opportunity, including veterans and individuals with disabilities.

PI268362847