Home
/
Comprehensive
/
Solutions Architect, Retrieval Augmented Generative AI
Solutions Architect, Retrieval Augmented Generative AI-May 2024
Remote
May 21, 2026
ABOUT NVIDIA
NVIDIA is a computing platform company, innovating at the intersection of graphics, HPC, and AI.
10,000+ employees
Technology
VIEW COMPANY PROFILE >>
About Solutions Architect, Retrieval Augmented Generative AI

  Do you want to be part of the team that brings Artificial Intelligence (AI) emerging technology to the field? We are looking for a Solution Architect or Data Scientist to join the NVIDIA AI Specialist team focused on Generative AI and Retrieval Augmented Generation (RAG). If you are passionate about Generative AI and how it can be applied to solve real-world problems, we should talk. NVIDIA is the world leader in GPU accelerated computing and AI, and is looking for developers like you to design and build enterprise RAG solutions using our newest technology. As a member of the AI Specialist Solution Architecture team, you will work closely with customers and partners to solve hard problems across industries and build and deploy AI solutions in production at scale.

  What you’ll be doing:

  A big part of our day-to-day job is developing end-to-end RAG solutions and recipes that enable domain-specific Enterprise use cases. We work with customers to successfully adopt NVIDIA AI microservices and APIs by providing deep technical product and engineering expertise.

  Some of the hands-on development activities include:

  Developing, Training, Fine-tuning, and Deploying multimodal large language models for retrieval augmented generation

  Apply instruction tuning, reinforcement learning from human feedback (RLHF), and parameter-efficient fine-tuning such as p-tuning, adaptors, LoRA, and so on to improve LLMs for different domain-specific RAG use cases

  Measure and benchmark models and applications performance. Analyze model accuracy & bias and recommend the next course of action and improvements.

  As we work with customers across multiple industries, we identify common trends that lead to success. With this knowledge, we help improve NVIDIA products and build creative solutions to overcome any adoption challenges.

  We contribute to the wider organization and community by sharing our expert knowledge with others. This can vary from building hands-on training to writing papers, developer blogs, and teaching.

  What we need to see:

  Strong foundational expertise, from a BS, MS, or Ph.D. degree in Engineering, Mathematics, Physics, Computer Science, Data Science, or similar or equivalent experience.

  5+ years of experience demonstrating an established track record in Deep Learning and Machine Learning as well as experience with GPUs.

  Strong analytical and problem-solving skills.

  Excellent programming skills with strong fundamentals in programming, optimizations, software design, and debugging skills. Including experience with Python, Bash, as well as Cloud services, and Linux.

  Experience working with DevOps and MLOps including but not limited to Docker/Containers, Kubernetes, and Data Center or Cloud AI deployments.

  Ability to multitask effectively in a dynamic environment.

  Clear written and oral communication skills with the ability to effectively collaborate with executives and engineering teams.

  Successful candidates will be able to demonstrate a strong desire to share knowledge with clients, partners, and co-workers.

  Ways to stand out from the crowd:

  Experience working with RAG technologies such as LLM frameworks (Langchain and LLamaIndex), LLM model registries (Hugging Face), LLM APIs, embedding models, and vector databases (FAISS and Milvus).

  Demonstrate expertise and hands-on experience with NVIDIA AI products. Some products of interest include Natural Language Processing and Large Language Models (NVIDIA NEMO (https://www.nvidia.com/en-us/gpu-cloud/nemo-llm-service/) ), LLM inferencing (NVIDIA Triton (https://developer.nvidia.com/triton-inference-server) ), Recommender systems (NVIDIA Merlin (https://developer.nvidia.com/merlin) ), and Generative AI technologies (AI Foundations (https://catalog.ngc.nvidia.com/ai-foundation-models) and GenAI Examples (https://github.com/NVIDIA/GenerativeAIExamples) )

  Experience and understanding of the latest Deep Learning Architectures and training techniques. For example, Transformers Models and the latest customization techniques such as prompt engineering, p-tuning, and Reinforcement Learning Human Feedback.

  Leadership experience working with customers and managing large projects with multiple collaborators.

  Show willingness and ability to dig into unfamiliar territories to solve complex problems relying on experience from previous work.

  Above all, you will be part of the team that helps bring NVIDIA technology to life in the Enterprise! We empower you and give you the tools to achieve this with the backing of all of NVIDIA, including other Solution Architects, Product, and Engineering teams. You’ll get to be the face and trusted expert advisor that our customers rely on.

  The base salary range is 144,000 USD - 270,250 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

  You will also be eligible for equity and benefits (https://www.nvidia.com/en-us/benefits/) . NVIDIA accepts applications on an ongoing basis.

  NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

  NVIDIA is a Learning Machine

  NVIDIA pioneered accelerated computing to tackle challenges no one else can solve. Our work in AI and the metaverse is transforming the world's largest industries and profoundly impacting society.

  Learn more about NVIDIA .

Comments
Welcome to zdrecruit comments! Please keep conversations courteous and on-topic. To fosterproductive and respectful conversations, you may see comments from our Community Managers.
Sign up to post
Sort by
Show More Comments
SIMILAR JOBS
Physical Therapist - $5000 Sign on Bonus!
Description Lifespark is a complete senior health company headquartered in St. Louis Park, Minnesota. Since 2004, we've been helping seniors stay healthy, navigate their health care options with conf
Shift Lead
Pay from $17.00/hour You are applying for work with a franchisee of Taco Bell, not Taco Bell Corp. or any of its affiliates. If hired, the franchisee will be your only employer. Franchisees are inde
Sr. Business Intelligence Engineer (L6), EU Hardlines Private Brands
Description Amazon Hardlines Private Brands is looking for a motivated Sr. Business Intelligence Engineer. WW Hardlines Private Brands Business Intelligence team powers the automation and influences
Software Development Engineer, AWS Support
Description Amazon Web Services (“AWS”) is the world’s most comprehensive and broadly adopted cloud platform. AWS offers over 100+ fully featured services to millions of active customers around the w
QA Engineer - Amazon Kids, Amazon Kids Madrid
Description Amazon Kids gives children the content that they love and devices built just for them. We also give parents the oversight and controls they need to feel safe. Our team in Madrid offers fu
Security Analyst, SME
KDS Job ID 2325164 Koniag Data Solutions, a Koniag Government Services company , is seeking a Security Analyst, SME with a Secret Security Clearance to support KDS and our government customer in Pica
Senior Faculty Advisor to the Vice President of Student Affairs - Division of St
PostedMay 17, 2023Title: Senior Faculty Advisor to the Vice President of Student AffairsDepartment: Division of Student AffairsManagement Center: University GeneralLocation: TBDSupervisor Name and Ti
Sr. Applied Scientist, 7815 - Seller Pricing & Shared Services
Description At Amazon, a large portion of our business is driven by third-party Sellers who set their own prices. The Pricing team is seeking a Sr. Applied Scientist to use statistical and machine le
Assistant Professor of Clinical - OBGYN, Gynecology Oncology
Current Employees: If you are a current Staff, Faculty or Temporary employee at the University of Miami, please click here (https://www.myworkday.com/umiami/d/task/1422$7248.htmld) to log in to Workd
Retail Key Holder Part-Time
Overview At Office Depot Inc., the Service Advisor - Key Carrier (KC) is a part-time role providing “total solutions” to our customers encompassing Products, Technology, Services, Furniture and Print
Copyright 2023-2026 - www.zdrecruit.com All Rights Reserved