Senior Software Engineer - AI Inference at NVIDIA

You're a seasoned software engineer with at least 5 years of experience building production software, bringing solid systems engineering fundamentals and a trac

Work type: remote

Location: US, CA, Santa Clara | US, TX, Remote | US, NY, Remote | US, CA, Remote

Salary: $152,000 – $287,500/yr

Type: Full-time

Summary

You're a seasoned software engineer with at least 5 years of experience building production software, bringing solid systems engineering fundamentals and a track record of performance or reliability improvements, ideally with a BS/MS in Computer Science or a related field. **What makes it worth a look...** NVIDIA is offering a fully remote Senior Software Engineer - AI Inference role with an annual salary range of $152,000 to $287,500 USD. You'll be advancing open-source LLM serving, focusing on making inference engines like vLLM and SGLang run best-in-class on NVIDIA GPUs. **You might be a good fit if you...** * Have experience with LLM inference/serving stacks like vLLM or SGLang. * Are proficient in Python and C++/CUDA, with experience debugging performance-critical code. * Possess experience with profiling tools and a measurement-driven approach to optimization. * Are familiar with distributed systems concepts and concurrency.

Job Description

NVIDIA is the platform upon which every new AI‑powered application is built. We are seeking a Senior Software Engineer – AI Inference to advance open‑source LLM serving by contributing directly to upstream inference engines like vLLM and SGLang-ensuring they run best‑in‑class on NVIDIA GPUs and systems-and by improving the underlying stack that enables high‑throughput, low‑latency inference at scale.

This is a hands-on role for an engineer who enjoys digging into performance bottlenecks, designing pragmatic runtime improvements, and shipping high‑quality changes that are broadly useful to the community and production deployments.

What you'll be doing:







What we need to see:








Ways to stand out from the crowd:





We are widely considered to be one of the technology world’s most desirable employers. We have some of the most forward‑thinking and creative people in the world working for us. If you're creative and autonomous with a real passion for technology, we want to hear from you.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 152,000 USD - 241,500 USD for Level 3, and 184,000 USD - 287,500 USD for Level 4.

You will also be eligible for equity and [benefits](https://www.nvidia.com/en-us/benefits/).

Applications for this job will be accepted at least until April 18, 2026.

This posting is for an existing vacancy. 

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

View this job on nocollar jobs