Home
/
Comprehensive
/
Site Reliability Engineer
Site Reliability Engineer-June 2024
Santa Clara
Jun 20, 2025
ABOUT NVIDIA
NVIDIA is a computing platform company, innovating at the intersection of graphics, HPC, and AI.
10,000+ employees
Technology
VIEW COMPANY PROFILE >>
About Site Reliability Engineer

  NVIDIA has been redefining computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Join the team and see how you can make a lasting impact on the world. NVIDIA is looking to hire a deeply technical, creative, and Senior Site Reliability Engineer to build , support and maintain the next generation AI powered enterprise products that improve engineering efficiency, data security, and power our product development. This role will give an opportunity to collaborate with Cloud and AI/ML workforce in a dynamic and agile working environment.

  What you will be doing:

  Collaborate on translating business objectives into actionable plans.

  Address operational challenges, automate processes, and iterate for efficiency.

  Contribute to implementing AI technologies and building/maintaining AI tools.

  Monitor, optimize, and manage system performance and resources.

  Institute validated practices for reliability, remediations, and troubleshooting.

  Design, deploy, and automate production support, documenting essential knowledge.

  Navigate intricate tasks with a deep understanding of SRE principles.

  Lead cross-organizational projects from inception to completion.

  Mentor and train junior engineers for professional development.

  What we need to see:

  6+ years of working experience in cloud, platform or SRE roles

  A Bachelors or Masters Degree in an Engineering or Computer Science or related field or equivalent experience

  Proficient in one or more programming languages: Python, Go, Perl, or Ruby.

  Hands-on experience handling and scaling distributed systems in a public, private, or hybrid cloud environment 24x7x365.

  Hands-on experience in deploying, supporting, and supervising new and existing services, platforms, and application stacks.

  Experience with CI/CD systems such as Jenkins, GitHub Actions, etc.

  Experience with Infrastructure as Code (IaC) methodologies and relevant tools.

  Extensive experience working with MS Windows Server and/or Linux operating systems.

  Solid communication skills, demonstrating the ability to comprehend and articulate technical issues to a non-technical audience

  Serve as a subject matter expert in core team functions.

  Ways to stand out from the crowd:

  Cloud expertise in Azure and AWS.

  Passionate and experienced in AI methodologies.

  Strong background in software design and development.

  Systematic problem-solving approach, coupled with strong communication skills and a sense of ownership and drive. You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

  Widely considered to be one of the technology world’s most desirable employers, NVIDIA offers highly competitive salaries and a comprehensive benefits package. As you plan your future, see what we can offer to you and your family www.nvidiabenefits.com/

  The base salary range is 128,000 USD - 247,250 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

  You will also be eligible for equity and benefits (https://www.nvidia.com/en-us/benefits/) . NVIDIA accepts applications on an ongoing basis.

  NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

  NVIDIA is a Learning Machine

  NVIDIA pioneered accelerated computing to tackle challenges no one else can solve. Our work in AI and the metaverse is transforming the world's largest industries and profoundly impacting society.

  Learn more about NVIDIA .

Comments
Welcome to zdrecruit comments! Please keep conversations courteous and on-topic. To fosterproductive and respectful conversations, you may see comments from our Community Managers.
Sign up to post
Sort by
Show More Comments
SIMILAR JOBS
Network Development Engineer
Description The Amazon Robotics Infrastructure Engineering team is looking for a Network Development Engineer (NDE) to join our team. We build and operate the network and services that enable Amazon
Area Coach
You are applying for work with a franchisee of Taco Bell, not Taco Bell Corp. or any of its affiliates. If hired, the franchisee will be your only employer. Franchisees are independent business owne
2024 Product Manager Intern
Description We’re on the lookout for the curious, those who think big and want to define the world of tomorrow. At Amazon, you will grow into the high impact, visionary person you know you’re ready t
Access Control
Summary Access Control personnel support the MDA Security and Emergency Management Directorate (DSS) in executing multiple Security Operations Services at MDA facilities in the United States.Responsi
Team Member - Food Champion
Work today, get paid today? Yes!! Apply and learn how! Hospitality Restaurant Group (Taco Bell) is looking for Food Champions who love serving customers and want to further their professional careers
Mortgage Loan Officer
Mortgage Loan OfficerJob Locations US-PA-LebanonRequisition ID2023-18581 Location NameDowntown Lebanon CountyLebanon CategoryMortgage Banking Position Type (Portal Searching)Full-Time FLSA StatusNon-
Software Development Engineer, Amazon Stores
Description Come build the future as a Software Development Engineer at Amazon, where you will be inspired working along best-in-class inventors and innovators! You will have the opportunity to creat
Elektroniker / Mechatroniker / Industriemechaniker (m/w/d)
Description Der Schwerpunkt dieser Rolle liegt in der vorbeugenden Wartung und Instandhaltung unserer förder- und gebäudetechnischen Anlagen sowie der Einhaltung aller Sicherheitsvorschriften und –ri
Engineering Operation Technician, InfraOps DCEO
Description Amazon is seeking a collaborative electrical maintenance professional for our Amazon Web Services Team. This engineering operations technician position serves as the on-site maintenance t
Paid Services Account Manager (Shanghai/Hangzhou)
Description Amazon provides enterprises the opportunity to sell their goods on the Amazon platform worldwide, and more than 2 million sellers have been using this Marketplace service today. Amazon is
Copyright 2023-2025 - www.zdrecruit.com All Rights Reserved