Senior DL Software Engineer, Model Optimization and Edge Deployment - Autonomous Vehicles at NVIDIA

**Who this is for** This role is for a seasoned Deep Learning Software Engineer focused on optimizing large language and visual language models for real-time pe

Work type: onsite

Location: US, CA, Santa Clara

Type: Full-time

Summary

**Who this is for** This role is for a seasoned Deep Learning Software Engineer focused on optimizing large language and visual language models for real-time performance in autonomous vehicles. You will be crucial in bridging the gap between advanced AI research and practical, on-edge deployment. **Key highlights** You will be responsible for developing and implementing cutting-edge model optimization techniques to ensure complex AI models run efficiently and reliably on autonomous vehicle hardware. This involves deep profiling, compression, and scaling DL models across NVIDIA's edge architectures. **You might be a good fit if you...** - Have expert-level proficiency in ML frameworks like PyTorch or JAX, and experience with LLM/VLM inference stacks. - Possess a strong track record of training, deploying, or optimizing large-scale DL models in production. - Have deep familiarity with NVIDIA's DL SDKs, particularly TensorRT and CUDA, and understand GPU architecture. - Are passionate about making self-driving vehicles a reality and are ready to contribute to this impactful technology.

Job Description

NVIDIA is at the forefront of the AI revolution, specifically in the constantly evolving field of Embodied AI. We are seeking a high-caliber Deep Learning Engineer to bridge the gap between cutting-edge multimodal architectures and real-time robotic execution for autonomous vehicles. In this role, you will design and implement SOTA algorithms to make LLM/VLM fast, lean, and reliable enough to power an end-to-end driving stack. You won’t just be "running" models; you will be re-architecting them for the edge, ensuring that models capable of complex scene reasoning can operate within the strict latency and safety constraints of an AV compute platform.

What You’ll Be Doing:









What We Need to See:







Ways to Stand Out from the crowd:






At NVIDIA, we’re dedicated to making self-driving vehicles a reality and believe this technology can save millions of lives. Join a team of innovative thinkers at one of the world’s most respected technology companies. If you’re motivated, curious, and ready to make a difference, we’d love to meet you! We believe that building self-driving vehicles will be a defining contribution of our generation (e.g. traffic accidents are responsible for ~1.25 million deaths per year world-wide). We have the funding and scale, but we need your help on our team. NVIDIA is widely considered to be one of the technology world’s most desirable employers with some of the most forward-thinking people in the world working here. If you're entrepreneurial and autonomous, we want to hear from you!

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5.

You will also be eligible for equity and [benefits](https://www.nvidia.com/en-us/benefits/).

Applications for this job will be accepted at least until April 25, 2026.

This posting is for an existing vacancy. 

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

View this job on nocollar jobs