Home
/
Comprehensive
/
Software Development Engineer III, ML_AI
Software Development Engineer III, ML_AI-February 2024
Santa Clara
Feb 14, 2026
ABOUT AMAZON
Our mission is to be the world’s most customer-centric company.
10,000+ employees
Technology
VIEW COMPANY PROFILE >>
About Software Development Engineer III, ML_AI

  Description

  At AWS AI, we want to make it easy for our customers to train their deep learning workloads in the cloud. With Amazon SageMaker, AWS is building customer-facing services to empower data scientists and software engineers in their deep learning endeavors. As our customers rapidly adopt LLMs and Generative AI for their business, we’re building the next-generation AI platform to accelerate their development. We’re seeking a dedicated engineering team lead to drive building our next-generation AI compute platform that’s optimized for LLMs and distributed training.

  As an SDE, you will be responsible for designing, developing, testing, and deploying distributed machine learning systems and large-scale solutions for our world-wide customer base. In this, you will collaborate closely with a team of ML scientists and customers to influence our overall strategy and define the team’s roadmap. You'll assist in gathering and analyzing business and functional requirements, and translate requirements into technical specifications for robust, scalable, supportable solutions that work well within the overall system architecture. You will also drive the system architecture, spearhead best practices that enable a quality product, and help coach and develop junior engineers. A successful candidate will have an established background in engineering large scale software systems, a strong technical ability, great communication skills, and a motivation to achieve results in a fast paced environment.

  Key job responsibilities

  About You:

  You are passionate about building platform and products for large scale deep learning model training (100+ billion parameter GPT, 1000s of GPU devices). You have a proven track record of bringing innovative research to customers. You are able to thrive and succeed in an entrepreneurial environment and not be hindered by ambiguity or competing priorities. Ownership, delivering results, thinking big and analytical leadership are essential to success in this role.

  You have solid experience in multi-threaded asynchronous C++/Go development. You have prior experience in one of: resource orchestrators like slurm/kubernetes, high performance computing, building scalable systems, experience in large language model training.

  A successful candidate will possess both technical and customer-facing skills that will allow you to be the technical “face” of AWS within our solution providers’ ecosystem/environment as well as directly to end customers. You will be able to drive discussions with senior technical and management personnel within customers and partners, as well as the technical background that enables them to interact with and give guidance to software developers and applied scientists.

  This is a great team to come to have a huge impact on AWS and the world's customers we serve!

  Every day will bring new and exciting challenges on the job while you:

  Build and improve next-generation AI platform

  Collaborate with internal engineering teams, leading technology companies around the world and open source community - PyTorch, NVIDIA/GPU

  Create innovative products to run at scale on the AI platform, and see them launched in high volume production

  About the team

  Inclusive Team Culture

  Here at AWS, we embrace our differences. We are committed to furthering our culture of inclusion. We have ten employee-led affinity groups, reaching 40,000 employees in over 190 chapters globally. We have innovative benefit offerings, and host annual and ongoing learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences. Amazon’s culture of inclusion is reinforced within our 14 Leadership Principles, which remind team members to seek diverse perspectives, learn and be curious, and earn trust.

  Work/Life Balance

  Our team puts a high value on work-life balance. It isn’t about how many hours you spend at home or at work; it’s about the flow you establish that brings energy to both parts of your life. We believe striking the right balance between your personal and professional life is critical to life-long happiness and fulfillment. We offer flexibility in working hours and encourage you to find your own balance between your work and personal lives.

  Mentorship & Career Growth

  Our team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we’re building an environment that celebrates knowledge sharing and mentorship.

  We are open to hiring candidates to work out of one of the following locations:

  Santa Clara, CA, USA

  Basic Qualifications

  5+ years of non-internship professional software development experience

  5+ years of programming with at least one software programming language experience

  5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience

  Experience as a mentor, tech lead or leading an engineering team

  Preferred Qualifications

  5+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience

  Bachelor's degree in computer science or equivalent

  Amazon is committed to a diverse and inclusive workplace. Amazon is an equal opportunity employer and does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status. For individuals with disabilities who would like to request an accommodation, please visit https://www.amazon.jobs/en/disability/us.

  Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $134,500/year in our lowest geographic market up to $261,500/year in our highest geographic market. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience. Amazon is a total compensation company. Dependent on the position offered, equity, sign-on payments, and other forms of compensation may be provided as part of a total compensation package, in addition to a full range of medical, financial, and/or other benefits. For more information, please visit https://www.aboutamazon.com/workplace/employee-benefits. Applicants should apply via our internal or external career site.

Comments
Welcome to zdrecruit comments! Please keep conversations courteous and on-topic. To fosterproductive and respectful conversations, you may see comments from our Community Managers.
Sign up to post
Sort by
Show More Comments
SIMILAR JOBS
Medicaid Program Manager 3
Medicaid Program Manager 3 Print (https://www.governmentjobs.com/careers/louisiana/jobs/newprint/4358673) Apply  Medicaid Program Manager 3 Salary $6,271.00 - $12,296.00 Monthly Location Baton Rouge
Sr. Software Engineer
Job Description Senior Member of Technical Staff - Security Products Group Oracle Cloud Infrastructure Preferred Qualifications At Oracle Cloud Infrastructure (OCI), we build the future of the cloud
Sr. Manager Author, PDT Global Regulatory Affairs
By clicking the “Apply” button, I understand that my employment application process with Takeda will commence and that the information I provide in my application will be processed in line with Taked
Technology Technical Support Representative-Remote
Overview About TP Teleperformance is a global, digital business services company. We deliver the most advanced, digitally powered business services to help the world’s best brands streamline their bu
Senior Professional Staff Nurse, Peri anesthesia UPMC Carlisle
UPMC is committed to investing in nurses like you –financially, personally, and professionally –starting on day one of your career. From tackling student loans to advancing your career later in life,
Shift Lead
You are applying for work with a franchisee of Taco Bell, not Taco Bell Corp. or any of its affiliates. If hired, the franchisee will be your only employer. Franchisees are independent business owner
Patient Access Representative
Serving the needs of Garfield Heights and the Southeast communities of Cuyahoga County since 1949, Marymount Hospital is a 269-registered bed acute care, faith-based hospital and became the first reg
Construction Account Manager
Gordian is the leading provider of Building Intelligence™ Solutions, delivering unrivaled insights, robust technology and comprehensive expertise that fuel customers’ success during every phase of th
Assistant Store Leader Trainee
If you enjoy working as part of a management team and have previous supervisory experience, we would love the opportunity to talk with you about our Assistant Store Leader Trainee role! We’re hiring
Senior Team Counselor *Internal Candidates Only*
We make a difference - in your community and in your career. Senior Team Counselor - Adult Community Clinical Services (ACCS) Internal Candidates Only ACCS Integrated Teams provide clinical intervent
Copyright 2023-2026 - www.zdrecruit.com All Rights Reserved