Engineering Manager, Model Inference at Abridge

You're a seasoned engineering leader with at least a year of management experience and over five years in engineering, specifically focused on machine learning

Work type: hybrid

Location: SF Office

Salary: $220,000 – $270,000/yr

Type: Full-time

You're a seasoned engineering leader with at least a year of management experience and over five years in engineering, specifically focused on machine learning systems. You have a strong grasp of LLM architectures and inference optimizations. **What makes it worth a look...** Abridge is offering a full-time, hybrid Engineering Manager role in San Francisco with a compensation of $220,000 - $270,000 annually. They are pioneers in generative AI for healthcare, aiming to improve clinical documentation. **You might be a good fit if you...** * Have deep, hands-on experience with ML inference frameworks like PyTorch, TensorFlow, TensorRT, or vLLM. * Possess experience with inference optimizations such as batching, quantization, and kernel fusion. * Are familiar with deploying reliable, distributed, real-time systems at scale. * Have experience with parallelism strategies including tensor and pipeline parallelism.

View this job on nocollar jobs