Home
/
Comprehensive
/
Site Reliability Engineer
Site Reliability Engineer-March 2024
Santa Clara
Mar 28, 2026
ABOUT NVIDIA
NVIDIA is a computing platform company, innovating at the intersection of graphics, HPC, and AI.
10,000+ employees
Technology
VIEW COMPANY PROFILE >>
About Site Reliability Engineer

  NVIDIA has been redefining computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Join the team and see how you can make a lasting impact on the world. NVIDIA is looking to hire a deeply technical, creative, and Senior Site Reliability Engineer to build , support and maintain the next generation AI powered enterprise products that improve engineering efficiency, data security, and power our product development. This role will give an opportunity to collaborate with Cloud and AI/ML workforce in a dynamic and agile working environment.

  What you will be doing:

  Collaborate on translating business objectives into actionable plans.

  Address operational challenges, automate processes, and iterate for efficiency.

  Contribute to implementing AI technologies and building/maintaining AI tools.

  Monitor, optimize, and manage system performance and resources.

  Institute validated practices for reliability, remediations, and troubleshooting.

  Design, deploy, and automate production support, documenting essential knowledge.

  Navigate intricate tasks with a deep understanding of SRE principles.

  Lead cross-organizational projects from inception to completion.

  Mentor and train junior engineers for professional development.

  What we need to see:

  6+ years of working experience in cloud, platform or SRE roles

  A Bachelors or Masters Degree in an Engineering or Computer Science or related field or equivalent experience

  Proficient in one or more programming languages: Python, Go, Perl, or Ruby.

  Hands-on experience handling and scaling distributed systems in a public, private, or hybrid cloud environment 24x7x365.

  Hands-on experience in deploying, supporting, and supervising new and existing services, platforms, and application stacks.

  Experience with CI/CD systems such as Jenkins, GitHub Actions, etc.

  Experience with Infrastructure as Code (IaC) methodologies and relevant tools.

  Extensive experience working with MS Windows Server and/or Linux operating systems.

  Solid communication skills, demonstrating the ability to comprehend and articulate technical issues to a non-technical audience

  Serve as a subject matter expert in core team functions.

  Ways to stand out from the crowd:

  Cloud expertise in Azure and AWS.

  Passionate and experienced in AI methodologies.

  Strong background in software design and development.

  Systematic problem-solving approach, coupled with strong communication skills and a sense of ownership and drive. You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

  Widely considered to be one of the technology world’s most desirable employers, NVIDIA offers highly competitive salaries and a comprehensive benefits package. As you plan your future, see what we can offer to you and your family www.nvidiabenefits.com/

  The base salary range is 128,000 USD - 247,250 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

  You will also be eligible for equity and benefits (https://www.nvidia.com/en-us/benefits/) . NVIDIA accepts applications on an ongoing basis.

  NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

  NVIDIA is a Learning Machine

  NVIDIA pioneered accelerated computing to tackle challenges no one else can solve. Our work in AI and the metaverse is transforming the world's largest industries and profoundly impacting society.

  Learn more about NVIDIA .

Comments
Welcome to zdrecruit comments! Please keep conversations courteous and on-topic. To fosterproductive and respectful conversations, you may see comments from our Community Managers.
Sign up to post
Sort by
Show More Comments
SIMILAR JOBS
Sous Chef (Sushi) - Market Cafe - Hyatt Regency Qingdao
Description: You will be responsible to assist with the efficient running of the department in line with Hyatt International's Corporate Strategies and brand standards, whilst meeting employee, guest
Assistant General Manager
You are applying for work with a franchisee of Taco Bell, not Taco Bell Corp. or any of its affiliates. If hired, the franchisee will be your only employer. Franchisees are independent business owner
Product Manager II: Proactive Digital Experiences
Overview Come join the QuickBooks Digital Experiences Team and lead the next chapter of digital self-help. The Digital Experiences team works closely with our core product, customer experience, and h
Project Engineer
smart people. smart ideas. smart choice. A thriving environment for learning, innovation and growth. Why do so many people join MTS Systems Corporation and stay for a career? Because this is a place
Speedco Diesel Technician Apprentice
Req ID: 432404 Address: 2689 Sidley Ct Austinburg, OH, 44010 Welcome to Love’s! * * Where People are the Heart of Our Success * * Diesel Technician Apprentice As with Love’s, Our Speedco values go be
Coding Denial Coordinator
Job Summary: The Coding Denial Coordinator is responsible, along with the Assistant Director and Associate Director, for coordinating workflows and assignment of denials to the denials team in Health
Technical Program Manager
Summary: Meta Platforms, Inc. (Meta), formerly known as Facebook Inc., builds technologies that help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed
EH & S ANALYST I
EH & S ANALYST I At TE, you will unleash your potential working with people from diverse backgrounds and industries to create a safer, sustainable and more connected world. Job Overview TE Connec
Materials Handler
This is a Public Notice for position being filled under the Direct Hiring Authority (DHA) for PERM position of Materials Handler (Forklift/Motor Vehicle Operator), WG-6907-06.Minot Civilian Personnel
Housekeeping
Job Number 24014799 Job Category Housekeeping & Laundry Location Courtyard Greenville Haywood Mall, 70 Orchard Park Drive, Greenville, South Carolina, United States Schedule Full-Time Located Rem
Copyright 2023-2026 - www.zdrecruit.com All Rights Reserved