Home
/
Software Engineering
/
Codec Avatars Large Scale Experimentation Lead
Codec Avatars Large Scale Experimentation Lead-July 2024
Redmond
Jul 14, 2025
ABOUT META
We’re building a team as diverse as the communities and billions of people we serve every day. Our teammates don’t need to conform here. Lived experiences are an asset, and we value your unique perspe
10,000+ employees
Social Media, Technology
VIEW COMPANY PROFILE >>
About Codec Avatars Large Scale Experimentation Lead

  Meta Reality Lab's Codec Avatar Research team is building technology to enable immersive, photorealistic social presence. Codec Avatars are real-time live-drivable representations that match the appearance of their users. As part of the Lab's Instant Codec Avatar group, you'll work to scale up Codec Avatar technology by modeling the diversity of human appearance and applying that model to the process of rapidly generating new avatars.This role is focused on our Large Scale Experimentation efforts, which both support our new Research Supercluster compute resource and uses that resource to run large-scale machine learning experiments that advance the state-of-the-art in Codec Avatar technology. In this role, you will lead a team of software engineers, research engineers, and research scientists to plan and deliver software systems needed to support large scale model training over thousands of GPUs. These systems ingest, store, and serve some of the largest ML training datasets in the world, and coordinate complex workflows composed from a mixture of traditional graphics and ML algorithms. You'll also plan, design and execute research experiments using those workflows to advance our understanding of how appearance modeling scales over large populations.

  Codec Avatars Large Scale Experimentation Lead Responsibilities:

  Develop and debug machine learning workflows on a large multi-node clusterAutomation of data ingress into clusterImplement compute allocation policy for the clusterDefine and implement strategy for compute environment management and deploymentDevelopment of data read/access layer using proprietary frameworkDefine and communicate cluster software requirements, based on research needsEnabling adoption of the cluster by additional research casesDefinition, design and implementation of automated testingPoint of contact for hardware & software questions regarding cluster capabilitiesReporting on progress, presenting technical risks, challenges and status to executive managementPartner with Data Collection and Asset Generation teams to specify and ingest assets required for large scale trainingPartner with Codec Avatars Universal Avatar Research team to support large scale experimentation based on Python workflowsPartner with Research SuperCluster production engineering team to support reliable operationPartner with Research SuperCluster storage engineering team to support development of features required for Codec Avatars datasetsPartner with security, privacy, and policy teams to ensure workflow compliance with company policyMinimum Qualifications:

  Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience.Experience providing technical leadership for teams of 5 or more engineersExperience with multi-node ML training workflows and frameworksExperience developing and debugging distributed systemsExperience operating in a self-directed environment with multiple stakeholders across multiple teamsProven communication skills, including experience driving decision makingExperience working with cross functional teams including hardware, software, network, legal, privacy and securityProven Python experienceProven Linux/shell scripting development experienceExperience developing and supporting reliable multi-stage data pipelinesProven quantitative reasoning skills, analyzing trade-offs of different hardware and software solutionsPreferred Qualifications:

  Experience providing technical leadership for teams of 12 or more engineersMasters or higher degree in Computer Science or related technical field, or equivalent experience8+ years of experience in ML or distributed systemsExperience developing or applying computer graphics algorithmsExperience developing or applying computer vision algorithms5+ years of experience developing workflows for large scale AI trainingUnderstanding of deep neural network trainingExperience with securing sensitive data (encryption, access control, audit logging)Experience with HPC (High Performance Computing)Experience with scheduling systems such as Slurm or KubernetesExperience with large scale object storage services (S3 or similar)Experience in research or converting research to productsExperience using gitExperience using CondaSQL databases experienceModern C++ development experienceAbout Meta:

  

  Meta builds technologies that help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed the way people connect. Apps like Messenger, Instagram and WhatsApp further empowered billions around the world. Now, Meta is moving beyond 2D screens toward immersive experiences like augmented and virtual reality to help build the next evolution in social technology. People who choose to build their careers by building with us at Meta help shape a future that will take us beyond what digital connection makes possible today-beyond the constraints of screens, the limits of distance, and even the rules of physics.

  Meta is proud to be an Equal Employment Opportunity and Affirmative Action employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender, gender identity, gender expression, transgender status, sexual stereotypes, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics. We also consider qualified applicants with criminal histories, consistent with applicable federal, state and local law. Meta participates in the E-Verify program in certain locations, as required by law. Please note that Meta may leverage artificial intelligence and machine learning technologies in connection with applications for employment.

  Meta is committed to providing reasonable accommodations for candidates with disabilities in our recruiting process. If you need any assistance or accommodations due to a disability, please let us know at [email protected].

Comments
Welcome to zdrecruit comments! Please keep conversations courteous and on-topic. To fosterproductive and respectful conversations, you may see comments from our Community Managers.
Sign up to post
Sort by
Show More Comments
SIMILAR JOBS
Software Engineer (Hybrid)
Software Engineer - IE08DE We're determined to make a difference and are proud to be an insurance company that goes well beyond coverages and policies. Working here means having every opportunity to
Staff Software Engineer - Backend (Growth Data Platform Team)
Hinge Health is creating a new health care system, built around you. Accessible to 26 million members across 1,500 customers, Hinge Health is the #1 digital clinic for joint and muscle pain, deliveri
Senior Software Engineer, Experience Containerization
Every day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with friends in 3D immersive digital experiences– all created by our global community of developers a
Software Engineer - Full Stack
OVERVIEW This position can be based out of San Francisco or New York City We're looking for Full-Stack Software Engineers to join our Engineering team. In this role, you will build innovative payment
Software Engineer - Card Processing and Authorisation
Company Description Checkout.com is one of the most exciting FinTechs in the world. Our mission is to enable businesses and their communities to thrive in the digital economy. We’re the strategic pay
Engineering Manager - Corlu IC
ABOUT UNILEVER With 3.4 billion people in over 190 countries using our products every day, Unilever is a business that makes a real impact on the world. Work on brands that are loved and improve the
Sr. Manager, Analytics Engineer - Biopharma
ROLE SUMMARY: Pfizer is seeking hardworking, passionate and results-oriented individuals to join our Analytics Engineering team to build data foundations and tools to craft the future. You will desig
Lagerleiter*in (d/w/m)
DU BIST MEHR ALS DEIN JOB-TITEL. MEHR ALS ZAHLEN UND BUCHSTABEN IN DEINEM LEBENSLAUF. UND WIR SIND MEHR ALS EIN UNTERNEHMEN. WIE WÄR'S ALSO, WENN WIR UNS EINFACH ZUSAMMENTUN - UND GEMEINSAM NOCH MEHR
Software Developer in Test - Vice President
iCapital is powering the world’s alternative investment marketplace. Our financial technology platform has transformed how advisors, wealth management firms, asset managers, and banks evaluate and re
Site Reliability Engineer
At Broadridge, we've built a culture where the highest goal is to empower others to accomplish more. If you're passionate about developing your career, while helping others along the way, come join t
Copyright 2023-2025 - www.zdrecruit.com All Rights Reserved