Senior Software Engineer - Conversational AI

2 Months ago • 10 Years + • Artificial Intelligence

Job Summary

Job Description

NVIDIA seeks a Senior Software Engineer to architect, implement, and optimize low-latency, full-duplex conversational AI pipelines. Responsibilities include building speech-to-speech models, designing multi-modal conversational agents, analyzing RAG and AI agent accuracy, and collaborating on new product features. The ideal candidate possesses 10+ years of experience in speech technology, LLMs, RAG, and agents, with strong Python/C++ skills and experience with microservices and scalable deployments. This role involves working on cutting-edge Digital Human solutions, leveraging NVIDIA's high-performance computing capabilities.
Must have:
  • 10+ years experience in relevant fields
  • Strong Python/C++ programming skills
  • Deep understanding of speech technologies (ASR, TTS)
  • Experience with LLMs, RAG, and conversational agents
  • Microservices and scalable deployment expertise
Good to have:
  • Experience with LangChain, LlamaIndex
  • Knowledge of ML/DL techniques
  • Familiarity with CUDA, CuDNN, TensorRT
  • Experience deploying models on various platforms

Job Details

Widely considered to be one of the technology world’s most desirable employers, NVIDIA is an industry leader with groundbreaking developments in High-Performance Computing, Artificial Intelligence and Visualization. NVIDIA's technology is at the heart of the AI revolution, touching people across the planet by powering everything from self-driving cars, robotics, and intelligent assistants. Come join the team and see how you can make a lasting impact on the world! We're looking to grow our company, and build our teams with the smartest people in the world. Join us at the forefront of technological advancement.

NVIDIA is looking for a highly experienced Senior Software Engineer, to build the next generation Multimodal Conversational AI systems, driven by world class high performance Speech and LLM models, orchestrated by Multimodal AI Agents, creating seamless experiences for our Digital Human solutions.If you're creative and passionate about solving real world Conversational AI problems, come join us. You can check https://build.nvidia.com/nvidia/digital-humans-for-customer-service for a glimpse of what you could be working on.

What you’ll be doing:

  • Architect, implement and optimize reliable low latency full duplex conversation pipelines and dialog systems, that excel across various application areas and challenging environments.

  • Build and benchmark cascaded and unified speech-to-speech models and systems that reflect real human conversations.

  • Designing, implementing and testing domain specific agents and workflows and a framework which can support multi-turn, multi-modal, multi-user conversations with LLM driven agents.

  • Analyze RAG and conversational AI agent end to end accuracy and limitations and recommend the next course of action & Improvements.

  • Characterize performance and quality metrics across platforms for various AI and system components

  • Collaborate with various teams on new product features and improvements of existing products. Customize and integrate the conversational AI framework with other NVIDIA products

  • Participate in developing and reviewing code, design documents, use case reviews, and test plan reviews and help innovate, identify problems, recommend solutions and perform triage in a collaborative team environment.

What we need to see:

  • Bachelor's degree or Master’s degree (or equivalent experience) in Computer Science, Electrical Engineering, Artificial Intelligence, or Applied Math

  • 10+ years of experience, with a very good hands-on exposure to building solutions that touch various technology areas that cover Speech, LLM, RAG and Agents.

  • Excellent programming skills in Python and/or C++, with ability to debug complex asynchronous systems

  • Deep understanding of various Speech technologies like VAD, ASR, TTS, Translation, End-to-End Speech Models, etc. to build conversation systems.

  • Experience working with RAG and LLM based applications as a key part of building Dialog and Q & A systems. Additional exposure to LLM function calling, Information Retrieval, Vector Databases, Embedding and Rerank models, autonomous agents etc.is welcome.

  • Understanding of scalable deployment of multiple microservices involving Speech components, LLM driven RAG and Agent applications in production environment

  • Experience working with protocol and transports like HTTP REST, gRPC, Websockets, WebRTC, etc.

  • Hands on experience with building microservices and client-server applications.

  • Familiarity with Docker, helm, kubernetes etc.

  • Experience of working on end to end Software lifecycle, release packaging & CI/CD pipeline

  • General background around version control and code review tools like Git, Gerrit, Gitlab.

Ways to stand out from the crowd:

  • Strong fundamentals in Programming, Optimizations and Software design

  • Experience of working with open source frameworks like LangChain, LlamaIndex for building LLM driven applications

  • Strong knowledge of ML/DL techniques, algorithms and tools with exposure to Speech and Language Models

  • Familiarity with GPU based technologies like CUDA, CuDNN and TensorRT

  • Background with deploying machine learning models on data center, cloud, and embedded systems

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

bytedance - Tech Lead Manager, Large Language Models & Generative AI

bytedance

San Jose, California, United States (On-Site)
7 Months ago
Equifax - Manager Data Scientist

Equifax

Bengaluru, Karnataka, India (On-Site)
6 Months ago
Qualcomm - AI/ML framework Staff Engineer

Qualcomm

Bengaluru, Karnataka, India (On-Site)
5 Days ago
bytedance - Global Monetization Product Counsel, Ads

bytedance

Singapore (On-Site)
2 Months ago
shiro games - Senior Game Programmer

shiro games

Bordeaux, Nouvelle-Aquitaine, France (On-Site)
2 Weeks ago
Meta - Software Engineer, Machine Learning

Meta

Washington, District Of Columbia, United States (On-Site)
1 Month ago
ElevenLabs - Machine Learning Researcher

ElevenLabs

Poland (Remote)
2 Months ago
bytedance - Senior Software Engineer / Researcher, AI-Native Database Systems

bytedance

Seattle, Washington, United States (On-Site)
1 Month ago
Postman - Software Engineer - Applied AI Engineer

Postman

Bengaluru, Karnataka, India (On-Site)
7 Months ago
Orion Innovation - Data Engineer-AI,ML

Orion Innovation

Chennai, Tamil Nadu, India (On-Site)
7 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Balbix - AI/ML Architect

Balbix

Bengaluru, Karnataka, India (On-Site)
7 Months ago
zoox - Senior/Staff Software Engineer - Machine Learning

zoox

Boston, Massachusetts, United States (Hybrid)
7 Months ago
PwC - Senior Data Scientist

PwC

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (On-Site)
8 Months ago
Apple - Wireless Cellular Standards Engineer

Apple

Sunnyvale, California, United States (On-Site)
2 Weeks ago
AppLovin - Backend Infrastructure Engineer II

AppLovin

Palo Alto, California, United States (On-Site)
1 Month ago
bytedance - Backend Software Engineer Intern (PDI-CSP-FE-i18n) - 2025 Summer (BS/MS)

bytedance

Seattle, Washington, United States (On-Site)
4 Months ago
Windranger Labs - Technical AI Researcher

Windranger Labs

Singapore (On-Site)
2 Months ago
Stake logic - Java Back-end Developer

Stake logic

(Remote)
3 Months ago
Stake logic - Senior Java Back-End Developer

Stake logic

(Remote)
1 Week ago
Applike Group - Software Developer - Working Student

Applike Group

Hamburg, Hamburg, Germany (Hybrid)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Pune, Maharashtra, India

Revenra - Technical Consultant

Revenra

India (On-Site)
1 Month ago
Glean - Solutions Architect - ANZ / Singapore region customer hours

Glean

Bengaluru, Karnataka, India (On-Site)
2 Weeks ago
DNEG - Pipeline Technical Director (Feature Animation)

DNEG

Mumbai, Maharashtra, India (On-Site)
2 Months ago
Imanage - Full Stack Senior Developer (ReactJS, NodeJS)

Imanage

Bengaluru, Karnataka, India (Hybrid)
7 Months ago
OpenText - Software Security Research

OpenText

Bengaluru, Karnataka, India (On-Site)
8 Months ago
AlphaSense - Entitlements Analyst

AlphaSense

Pune, Maharashtra, India (On-Site)
4 Days ago
Survay Monkey - Senior Cloud Engineer

Survay Monkey

Bengaluru, Karnataka, India (Hybrid)
1 Month ago
FICO - Technical Architecture - Sr. Engineer

FICO

Bengaluru, Karnataka, India (On-Site)
1 Year ago
AliveCor - Senior Regulatory Affairs Specialist

AliveCor

Bengaluru, Karnataka, India (Hybrid)
11 Months ago
PwC - IN-Manager – SAP MDG -Enterprise Apps SAP– Advisory  - Pan India

PwC

Gurugram, Haryana, India (On-Site)
7 Months ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Inworld AI - AI Trainer (Contractor) - Writing & Gaming

Inworld AI

Mountain View, California, United States (Remote)
2 Months ago
Google - Senior Technical Program Manager I, Machine Learning, Google Cloud Platforms

Google

Kirkland, Washington, United States (On-Site)
1 Month ago
NVIDIA - Senior AI-HPC Cluster Engineer

NVIDIA

Westford, Massachusetts, United States (Hybrid)
2 Months ago
Google - Software Engineer III, AI/ML GenAI, Search

Google

Mountain View, California, United States (On-Site)
1 Month ago
zoox - Senior/Staff Machine Learning Engineer - Prediction & Behavior ML

zoox

Foster City, California, United States (Hybrid)
7 Months ago
Krafton - Deep Learning Engineer - LLM Game Agent

Krafton

Seoul, South Korea (On-Site)
3 Months ago
PlayStation Global - Machine Learning Engineer

PlayStation Global

London, England, United Kingdom (On-Site)
1 Month ago
bytedance - Research Scientist- Foundation Model, Video Generation

bytedance

San Jose, California, United States (On-Site)
7 Months ago
Google - Student Researcher, BS/MS, Winter/Summer 2025

Google

Mountain View, California, United States (On-Site)
6 Months ago
Google - Government Affairs and Public Policy Manager, AI and State Management

Google

New Delhi, Delhi, India (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Santa Clara, California, United States (On-Site)

Massachusetts, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Texas, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

Pune, Maharashtra, India (On-Site)

Taipei City, Taiwan (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug
OSZAR »