Home
/
Comprehensive
/
Software Engineer, Systems ML - HPC Specialist
Software Engineer, Systems ML - HPC Specialist-March 2024
Menlo Park
Mar 28, 2026
ABOUT META
We’re building a team as diverse as the communities and billions of people we serve every day. Our teammates don’t need to conform here. Lived experiences are an asset, and we value your unique perspe
10,000+ employees
Social Media, Technology
VIEW COMPANY PROFILE >>
About Software Engineer, Systems ML - HPC Specialist

  Summary:

  Meta is seeking an AI Software Engineer to join our Research & Development teams. The ideal candidate will have industry experience working on AI Infrastructure related topics. The position will involve taking these skills and applying them to solve for some of the most crucial & exciting problems that exist on the web.Some aspects of this role as an HPC specialist may include authoring components such as cuBLAS, cuDNN, AITemplate, FlashAttention and development of runtimes such as LLM disaggregated runtime. HPC specialists spend time optimizing the program to reduce the accelerators idle time. They also develop tools to debug (cuda-gdb), profiler utilizing the accelerated computing hardware (such as PE’s/SFU etc in MTIA or Transformer engine in H100). They are experts in systems who are able to design, debug and accelerate AI workloads from single-node scale up to multi-node scale out distributed systems. They also are able to influence the next generation of Silicon architectures (such as Tensor Core in V100. Transformer Engine in H100) based on the evolving AI workload needs.We are hiring in multiple locations.

  Required Skills:

  Software Engineer, Systems ML - HPC Specialist Responsibilities:

  Apply relevant AI and machine learning techniques to build & optimize our intelligent systems that improve Metas products and experiences

  Develop custom/novel architectures, define use cases, and develop methodology & benchmarks to evaluate different approaches

  Apply in depth knowledge of how the machine learning system interacts with the other systems around it

  Assist in goal setting related to project impact, AI system design, and ML excellence

  Minimum Qualifications:

  Minimum Qualifications:

  Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience.

  2+ years of experience in HPC and parallel computing.

  Proficiency in GPU programming using CUDA and familiarity with CUDA libraries (cuBLAS, cuDNN, etc.).

  Proven track record of leading successful HPC projects.

  Proven technical expertise in HPC architectures and technologies.

  Preferred Qualifications:

  Preferred Qualifications:

  PhD in Computer Science, Computer Engineering, or relevant technical field.

  Experience developing AI algorithms or AI-System infrastructure in C/C++ or Python.

  Experience developing AI Compiler (TorchInductor in PyTorch 2.0).

  Public Compensation:

  $146,994/year to $208,000/year + bonus + equity + benefits

  Industry: Internet

  Equal Opportunity:

  Meta is proud to be an Equal Employment Opportunity and Affirmative Action employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender, gender identity, gender expression, transgender status, sexual stereotypes, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics. We also consider qualified applicants with criminal histories, consistent with applicable federal, state and local law. Meta participates in the E-Verify program in certain locations, as required by law. Please note that Meta may leverage artificial intelligence and machine learning technologies in connection with applications for employment.

  Meta is committed to providing reasonable accommodations for candidates with disabilities in our recruiting process. If you need any assistance or accommodations due to a disability, please let us know at [email protected].

Comments
Welcome to zdrecruit comments! Please keep conversations courteous and on-topic. To fosterproductive and respectful conversations, you may see comments from our Community Managers.
Sign up to post
Sort by
Show More Comments
SIMILAR JOBS
Sr. Director Application Security Engineering
Who Are We? Taking care of our customers, our communities and each other. That’s the Travelers Promise. By honoring this commitment, we have maintained our reputation as one of the best property casu
Lead Repository Technician
Overview ATCC is seeking a full time direct hire Lead Repository Technician to work in Gaithersburg, Maryalnd. The Lead Repository Technician oversees and coordinates the daily activities of a team o
Custodian II
JOB REQUIREMENTS: Duration: 6+ Months Shift: 2nd Shift (3 pm 11 pm 3rdShift (10:30 pm 06:30 am) Job Description: Dust and clean desks,shelves, radiators, moldings, and windowsills. Spot mop spills an
Night Shift Fulfillment - Urgently Hiring
Now Hiring up to $21.50 / hour Chewy Fulfillment Center: Recruiting Office 1281 Couchville Pike, Mt. Juliet, TN 37122 Our Opportunity: As a member of our fulfillment center and warehouse team, you’ll
Environmental Engineer 6
36659BR Requisition ID: 36659BR Business Unit: TSU Job Description: We are currently seeking a Senior Environmental Engineer to support a wide variety of environmental projects. The preferred candida
Hotel General Manager
Why us? Sage Hospitality Group is set to hire a Hotel General Manager to join us at the Springhill Suites Denver Downtown. Located in LoDo, the SpringHill Suites by Marriott Denver Downtown is a mode
Human Resources Manager
Copeland is looking for an innovative experienced Human Resources leader who desires to make a positive impact on people and organizational success! As a Human Resources Manager, you will collaborate
Assistant Manager - Restaurant
Req ID: 427622 Address: 1010 N. Main Palestine, AR, 72372 Welcome to Love’s! Where People are the Heart of Our Success Restaurant Assistant Manager Working at Love′s as a Restaurant Assistant Manager
Business Planning & Analysis Intern
As a Grad Intern at Amgen, you'll assist the team in conducting research, preparing reports, and performing administrative tasks. You'll gain hands-on experience in your chosen field, develop profess
Safety Manger
Description Job Title: Safety Manager Location: Los Alamitos, CA Reports to: VP of Operations FLSA Status: Exempt Salary Range: 95K-140K Prepared By: Human Resources Prepared Date: January 16, 2024 C
Copyright 2023-2026 - www.zdrecruit.com All Rights Reserved