Home
/
Data and Analytics
/
Global Research Postdoctoral Fellow - AI for Patient Stratification and Precision Medicine
Global Research Postdoctoral Fellow - AI for Patient Stratification and Precision Medicine-October 2024
Cambridge
Oct 29, 2025
ABOUT SANOFI U.S.
People at Sanofi are dedicated to making a difference in patients’ daily lives, enabling them to enjoy a healthier life.
10,000+ employees
Biotechnology
VIEW COMPANY PROFILE >>
About Global Research Postdoctoral Fellow - AI for Patient Stratification and Precision Medicine

  Job Title:

  Global Research Postdoctoral Fellow - AI for Patient Stratification and Precision Medicine

  Our Team:

  The Computational Biology Cluster and the Biomarker & Patient Stratification Cluster are part of the Precision Medicine & Computation Biology (PMCB) global research function at Sanofi. This multidisciplinary group has developed a wealth of biomedical and multi-omics data and is at the frontier of developing advanced AI methodologies that influence the next generation of precision medicine. We seek a motivated postdoctoral fellow to work within the two cluster teams on next generation of foundational models for biology.

  Job Description:

  As a postdoctoral fellow, you will play a critical role in advancing target discovery and patient stratification methods through the harnessing of deep learning models that integrate multi-modal data.

  Your responsibilities will include:

  Collaborating closely with interdisciplinary teams to develop and optimize foundational deep learning models for identifying patient subgroups.

  Utilizing large datasets including genetic, transcriptomic, single cell, proteomic, imaging and electronic medical record data to train foundational deep learning models on disease specific context.

  Contribute to enhancing the precision of patient stratification and disease endotyping.

  Contributing to the identification and prioritization of targeted treatments for heterogeneous diseases.

  Developing biomarkers that facilitate the classification of patients into identified endotypes.

  Participating and leading manuscript preparations and submission, and presentation of findings in international forums internally and externally.

  Minimum Required Skills:

  Solid experience in deep learning and artificial intelligence methodologies.

  Knowledge and experience with foundational/generative AI models for omics or medical record data.

  Proficient in machine learning and deep learning frameworks such as TensorFlow or PyTorch.

  Proficiency in programming languages such as Python or R.

  Solid understanding of computational biology, bioinformatics, or a related field.

  Demonstrated ability in data analysis and visualization.

  Strong communication skills and ability to work in a collaborative multidisciplinary research environment.

  Preferred Skills:

  Knowledge of patient stratification and precision medicine concepts.

  Experience working with multi-omics data is a plus.

  Multi-GPU cloud computing experience

  Education:

  Ph.D. in Computational Biology, Bioinformatics, Computer Science, or a closely related field.

  Project Description:

  Patient stratification is a crucial component of precision medicine, as it enables the identification of distinct patient subgroups with different underlying molecular disease profiles, clinical manifestations and response to therapy. Stratification allows for the development of targeted treatments that are more effective, breaking observed efficacy ceilings while reducing adverse events for patients. For heterogenous diseases like Inflammatory Bowel Disease or Parkinson's disease (which has limited disease-modifying treatments available) patient stratification is especially important to identify the right targets for the right patients and thus improve clinical efficacy rates.

  With the availability of large amounts of multi-omics and electronic health record (EHR) data deep learning models can leverage an unprecedented wealth of information to identify patient subgroups that were previously unrecognized. However, one limitation of current patient stratification methods is that they typically consider only one layer of data, leaving a lot of value unused. By using deep learning models that can integrate multiple layers of data, we can improve the accuracy and precision of patient stratification.

  One challenge is that deep multi-omics and EHR data are rarely available together in the same cohort. Another challenge is that developing complex multi-omics biomarker models can be costly and difficult to translate into a clinical setting. To address these challenges, we propose to investigate deep learning models that can derive patient endotypes from multi-omics and real-world data, in disease specific context. We will consider foundational deep learning models that can be used to infer omics layers when they are not available and identify targets associated with disease progression in specific endotypes. Finally, we will develop simple biomarkers that can be used in a clinical setting to classify patients into the identified endotypes and inform on treatment decisions.

  We will explore various foundational models, including autoencoders, recurrent autoencoders, and adversarial neural networks. Foundational autoencoder models have already been developed for single cell multi-omic data integration (Lotfollahi et al.) and several models have been proposed for EHR (Landi et al.), including recurrent autoencoders that can be used to simulate records (Merkelbach et al.). Stanford HAI provides a recent review of existing method in this last domain. We will adapt and combine these approaches to integrate genetic, single cell, bulk transcriptomic, proteomic and EHR data, as well as other data type as relevant. These models will be used for patient clustering and visualization, and developing simpler classification models for clinical biomarkers. Furthermore, we will use these models to infer missing omics and medical record layers and to identify therapeutic targets.

  We will use a variety of datasets, including EHR and genetics data from private cohort data, to train pan-disease foundational models and disease specialized models. One of the important aspects of the project is to correct and adapt models across different technologies, populations and healthcare systems. This will enable us to develop models that are robust and applicable across different healthcare settings, thereby facilitating the translation of our findings into clinical practice.

  References :

  Lotfollahi, Mohammad, Anastasia Litinetskaya, and Fabian J. Theis. "Multigrate: single-cell multi-omic data integration." BioRxiv (2022): 2022-03.

  Landi, Isotta, et al. "Deep representation learning of electronic health records to unlock patient stratification at scale." NPJ digital medicine 3.1 (2020): 96.

  Merkelbach, Kilian, et al. "Novel architecture for gated recurrent unit autoencoder trained on time series from electronic health records enables detection of ICU patient subgroups." Scientific Reports 13.1 (2023): 4053.

  Sanofi Inc. and its U.S. affiliates are Equal Opportunity and Affirmative Action employers committed to a culturally diverse workforce. All qualified applicants will receive consideration for employment without regard to race; color; creed; religion; national origin; age; ancestry; nationality; marital, domestic partnership or civil union status; sex, gender, gender identity or expression; affectional or sexual orientation; disability; veteran or military status or liability for military status; domestic violence victim status; atypical cellular or blood trait; genetic information (including the refusal to submit to genetic testing) or any other characteristic protected by law.

  #GD-SA

  #LI-SA

  At Sanofi diversity and inclusion is foundational to how we operate and embedded in our Core Values. We recognize to truly tap into the richness diversity brings we must lead with inclusion and have a workplace where those differences can thrive and be leveraged to empower the lives of our colleagues, patients and customers. We respect and celebrate the diversity of our people, their backgrounds and experiences and provide equal opportunity for all.

Comments
Welcome to zdrecruit comments! Please keep conversations courteous and on-topic. To fosterproductive and respectful conversations, you may see comments from our Community Managers.
Sign up to post
Sort by
Show More Comments
SIMILAR JOBS
Staff Data Engineer - Business Operations
Data Engineers at Riot bring deep knowledge of specific technical areas and also value the opportunity to work in many broader domains. Our engineers are player-focused and aim to find solutions that
Data Scientist I
We are seeking a driven and analytically minded Data Scientist I with a strong data science foundation and expertise in machine learning and artificial intelligence. The ideal candidate will be a bri
Motorsports DiL Systems Engineer
Description Onsite - Position requires an employee to be onsite on a full-time basis. The Role We are seeking a highly skilled and passionate DiL (Driver in the Loop) Systems Engineer to join our dyn
Research Analyst
Competition Number: REQ 5690 TITLE: Research Analyst DIVISION: Strategic Planning and Institutional Analysis SALARY: Payband H, starting rate $35.06 per hour HOURS: 9:00 am to 5:00 pm HOURS PER WEEK:
Contract Administrator
As a Contract Administrator, you will play an integral role in the administration of the contract, with a primary focus on managing and analyzing data. If you enjoy working with large data sets, deve
Global Internal Audit & Advisory Compliance Manager
TransUnion's Job Applicant Privacy Notice Personal Information We Collect Your Privacy Choices What We'll Bring: At TransUnion, we have a welcoming and energetic environment that encourages collabora
Senior Data Engineer
Job Description Note: Contractors (C2C, C2H) that directly apply will not be considered. Individual applicants only Spokeo is a people search engine and identity platform that enlightens and empowers
AI Research Engineer, New Graduate PhD (2023-2024)
Our mission at Duolingo is to develop the best education in the world and make it universally available. But we’ve got more left to do — and that’s where you come in! Duolingo is the most popular lan
Senior Analyst, Product or Marketing (Bangalore, India) - Multiple Headcount
Join Skillz and Level Up Your Career! Are you ready to take your career to the next level? Join Skillz, the first publicly-traded mobile esports platform that hosts billions of casual mobile gaming t
Staff Machine Learning Engineer, Gen AI
  On-site work model with 5 days in office/week in Sunnyvale, CA  Targeted hire date of February 1, 2024 About the Team: Illumio's new Machine Learning (ML) team embodies a culture of thought leaders
Copyright 2023-2025 - www.zdrecruit.com All Rights Reserved