Home
/
Comprehensive
/
Solutions Architect, Retrieval Augmented Generative AI
Solutions Architect, Retrieval Augmented Generative AI-August 2024
Remote
Aug 2, 2025
ABOUT NVIDIA
NVIDIA is a computing platform company, innovating at the intersection of graphics, HPC, and AI.
10,000+ employees
Technology
VIEW COMPANY PROFILE >>
About Solutions Architect, Retrieval Augmented Generative AI

  Do you want to be part of the team that brings Artificial Intelligence (AI) emerging technology to the field? We are looking for a Solution Architect or Data Scientist to join the NVIDIA AI Specialist team focused on Generative AI and Retrieval Augmented Generation (RAG). If you are passionate about Generative AI and how it can be applied to solve real-world problems, we should talk. NVIDIA is the world leader in GPU accelerated computing and AI, and is looking for developers like you to design and build enterprise RAG solutions using our newest technology. As a member of the AI Specialist Solution Architecture team, you will work closely with customers and partners to solve hard problems across industries and build and deploy AI solutions in production at scale.

  What you’ll be doing:

  A big part of our day-to-day job is developing end-to-end RAG solutions and recipes that enable domain-specific Enterprise use cases. We work with customers to successfully adopt NVIDIA AI microservices and APIs by providing deep technical product and engineering expertise.

  Some of the hands-on development activities include:

  Developing, Training, Fine-tuning, and Deploying multimodal large language models for retrieval augmented generation

  Apply instruction tuning, reinforcement learning from human feedback (RLHF), and parameter-efficient fine-tuning such as p-tuning, adaptors, LoRA, and so on to improve LLMs for different domain-specific RAG use cases

  Measure and benchmark models and applications performance. Analyze model accuracy & bias and recommend the next course of action and improvements.

  As we work with customers across multiple industries, we identify common trends that lead to success. With this knowledge, we help improve NVIDIA products and build creative solutions to overcome any adoption challenges.

  We contribute to the wider organization and community by sharing our expert knowledge with others. This can vary from building hands-on training to writing papers, developer blogs, and teaching.

  What we need to see:

  Strong foundational expertise, from a BS, MS, or Ph.D. degree in Engineering, Mathematics, Physics, Computer Science, Data Science, or similar or equivalent experience.

  5+ years of experience demonstrating an established track record in Deep Learning and Machine Learning as well as experience with GPUs.

  Strong analytical and problem-solving skills.

  Excellent programming skills with strong fundamentals in programming, optimizations, software design, and debugging skills. Including experience with Python, Bash, as well as Cloud services, and Linux.

  Experience working with DevOps and MLOps including but not limited to Docker/Containers, Kubernetes, and Data Center or Cloud AI deployments.

  Ability to multitask effectively in a dynamic environment.

  Clear written and oral communication skills with the ability to effectively collaborate with executives and engineering teams.

  Successful candidates will be able to demonstrate a strong desire to share knowledge with clients, partners, and co-workers.

  Ways to stand out from the crowd:

  Experience working with RAG technologies such as LLM frameworks (Langchain and LLamaIndex), LLM model registries (Hugging Face), LLM APIs, embedding models, and vector databases (FAISS and Milvus).

  Demonstrate expertise and hands-on experience with NVIDIA AI products. Some products of interest include Natural Language Processing and Large Language Models (NVIDIA NEMO (https://www.nvidia.com/en-us/gpu-cloud/nemo-llm-service/) ), LLM inferencing (NVIDIA Triton (https://developer.nvidia.com/triton-inference-server) ), Recommender systems (NVIDIA Merlin (https://developer.nvidia.com/merlin) ), and Generative AI technologies (AI Foundations (https://catalog.ngc.nvidia.com/ai-foundation-models) and GenAI Examples (https://github.com/NVIDIA/GenerativeAIExamples) )

  Experience and understanding of the latest Deep Learning Architectures and training techniques. For example, Transformers Models and the latest customization techniques such as prompt engineering, p-tuning, and Reinforcement Learning Human Feedback.

  Leadership experience working with customers and managing large projects with multiple collaborators.

  Show willingness and ability to dig into unfamiliar territories to solve complex problems relying on experience from previous work.

  Above all, you will be part of the team that helps bring NVIDIA technology to life in the Enterprise! We empower you and give you the tools to achieve this with the backing of all of NVIDIA, including other Solution Architects, Product, and Engineering teams. You’ll get to be the face and trusted expert advisor that our customers rely on.

  The base salary range is 144,000 USD - 270,250 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

  You will also be eligible for equity and benefits (https://www.nvidia.com/en-us/benefits/) . NVIDIA accepts applications on an ongoing basis.

  NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

  NVIDIA is a Learning Machine

  NVIDIA pioneered accelerated computing to tackle challenges no one else can solve. Our work in AI and the metaverse is transforming the world's largest industries and profoundly impacting society.

  Learn more about NVIDIA .

Comments
Welcome to zdrecruit comments! Please keep conversations courteous and on-topic. To fosterproductive and respectful conversations, you may see comments from our Community Managers.
Sign up to post
Sort by
Show More Comments
SIMILAR JOBS
Curriculum Coordinator: Secondary Science
Curriculum Coordinator: Secondary Science JobID: 8256 Position Type: Teaching and Learning - District Wide Positions/ Curriculum Coordinators Date Posted: 1/19/2024 Location: TEACHING AND LEARNING Da
Software Developer
Description At Leidos, we deliver innovative solutions through the efforts of our diverse and talented people who are dedicated to our customers’ success. We empower our teams, contribute to our comm
Hospice Medical Social Worker
Seeking a Social Worker to join our Hospice Team in the Wichita, KS and surrounding areas! Our Hospice Medical Social Workers help to provide personalized support to clients and their families to hel
Diesel Mechanic
Company Description: We are currently seeking a skilled GSE Mechanic. Previous Ground Support Equipment experience preferred, strong knowledge of gas and diesel engines required, as well as the abili
Work Study Coordinator
Employment Type: Permanent Staff (SHRA) Vacancy ID: P018577 Position Summary/Description: This position may be eligible for a hybrid work arrangement that may include a partially remote work location
Nurse Clinical/UKHC
University of KentuckyEqual Employment Opportunity/M/F/disability/protected veteran status.Posting Details Posting Details Job TitleNurse Clinical/UKHCRequisition NumberNR12178Working TitleRN/Pavilio
Software Developer 3
Job Description Supports the design, development, deployment, and operations of a large-scale global Oracle cloud computing environment (Oracle Cloud Infrastructure - OCI). Primarily focused on devel
Pediatric Dentist (part time)
Job Description: *JOB SUMMARY: *The Pediatric Dentist provides dental care to children that include diagnosis and treatment of oral diseases, development of treatment plans, education to patients and
F&B Service Expert (Waiter/Waitress) - Ginger Moon
Job Number 24012022 Job Category Food and Beverage & Culinary Location W Dubai-Mina Seyahi, King Salman Bin Abdulaziz Al Saud Street, Dubai, United Arab Emirates, United Arab Emirates Schedule Fu
Registered Nurse 2
Registered Nurse 2 Print (https://www.governmentjobs.com/careers/louisiana/jobs/newprint/4354204) Apply  Registered Nurse 2 Salary $4,470.00 - $8,044.00 Monthly Location LaPlace, LA Job Type Classif
Copyright 2023-2025 - www.zdrecruit.com All Rights Reserved