Home
/
Comprehensive
/
Deep Learning Performance Architect
Deep Learning Performance Architect-December 2024
Shanghai
Dec 14, 2025
ABOUT NVIDIA
NVIDIA is a computing platform company, innovating at the intersection of graphics, HPC, and AI.
10,000+ employees
Technology
VIEW COMPANY PROFILE >>
About Deep Learning Performance Architect

  We are looking for a first-class DL Performance architect to drive the performance analysis and optimization of the state of art inference network on our GP: identify HW, SW performance limiters of DL networks, prototype the key primitives and guide the design of next generation architecture and DL software optimization.

  What you’ll be doing:

  Establish deep learning applications and use-cases for performance analysis, modelling, and projections

  Analyzing and proposing both SW and HW optimizations for deep learning applications

  Specify hardware/software configurations and metrics to analyze performance, power, accuracy and resiliency in existing and future uni-processor and multiprocessor configurations

  Collaborate across the company to guide the direction of next-gen deep learning HW/SW by working with architecture, library, and compiler teams

  What we need to see:

  MS or PhD in relevant discipline (CS, EE, Math) or equivalent experience with 2+ years of experience

  Track record of designing architectures to accelerate computational demanding algorithms and applications

  Strong background in computer architecture

  Expert mathematical foundation in machine learning and deep learning

  Strong programming skills in C, C++, Perl, or Python

  Ways to stand out from the crowd:

  Prior experience working on assembly level performance optimization

  Experience working with deep learning frameworks like Caffe, TensorFlow and Torch

  Familiarity with GPU computing (CUDA, OpenCL) and HPC (MPI, OpenMP)

  Background with systems-level performance modeling, profiling, and analysis

  Experience in characterizing and modeling system-level performance, executing comparison studies, and documenting and publishing results

  NVIDIA is a Learning Machine

  NVIDIA pioneered accelerated computing to tackle challenges no one else can solve. Our work in AI and the metaverse is transforming the world's largest industries and profoundly impacting society.

  Learn more about NVIDIA .

Comments
Welcome to zdrecruit comments! Please keep conversations courteous and on-topic. To fosterproductive and respectful conversations, you may see comments from our Community Managers.
Sign up to post
Sort by
Show More Comments
SIMILAR JOBS
BestChoice-Regional Float ICU RN
BestChoice-Regional Float ICU RN Contingent- 24 hours per month Shift: Days and Nights Available Not ready to apply? Use this here (https://outlook.office365.com/owa/calendar/AlexandraThompsonBlade@h
Radiology Tech Aide- Pool/Per Diem- Mercy Fitzgerald
Employment Type: Part time Shift: Rotating Shift Description: Mercy Fitzgerald, a member of Trinity Health Mid-Atlantic, is looking for a Radiology Tech Aide to join our Radiology team! Employment Ty
994442 - Faculty Non-Tenure Track-9Mo Psychology
Job Title:  Faculty Non-Tenure Track-9 Mo Physical Location:  Trumbull Campus - Trumbull, OH Salary:   Basic Function:   Kent State University at Trumbull invites applications for a full-time, nine-m
Inbound Sales Consultant
Takes inbound sales calls. Processes orders, answer questions and sells products/services. Identifies sales opportunities by learning about the customer’s business and links customer needs to the pro
Sr. Automation Tester with READY API
Cognizant Technology Solutions is looking for Automation Lead to join the team of IT professionals in a permanent role. If you meet our background requirements and skills and are looking for an oppor
Substation Designer P&C
JOB REQUIREMENTS: Substation Designer - P&C Job LocationsUS-WI-Middleton ID 2023-5155 Category Electrical Engineering 2ndCategory Substation Position Type Regular Division Renewables OverviewSubs
Software Development Engineer, CDS (Core Device Software)
Description The Amazon Devices team designs and engineers high-profile consumer electronics, including the best-selling Kindle family of products. We have also produced groundbreaking devices like Fi
Operating Model and Organizational Design Analyst (all genders)
Wie müssen Unternehmen aufgebaut sein, damit sie Ihre Strategien umsetzen und im Wettbewerb den entscheidenden Vorteil erreichen können? Mit welchen Maßnahmen können Unternehmen den Spagat zwischen g
Linux Systems Administrator (Active Top Secret Clearance Required)
Req ID: RQ165309 Type of Requisition: Regular Clearance Level Must Be Able to Obtain: Top Secret SCI + Polygraph Job Family: Systems Administration Skills: Computer Systems,Database Server,Documentat
Farm Worker / Robert Lee Menees
JOB DOES NOT START UNTIL, March 15, 2024.APPLICANT SHOULD RECEIVE A COPY OF THE JOB DESCRIPTION AND JOB REFERRAL BEFORE CALLING EMPLOYER CONTACT. All workers should be physically able to meet and per
Copyright 2023-2025 - www.zdrecruit.com All Rights Reserved