Lead Engineer, Inference Platform at MongoDB

This role is ideal for a high-level backend or ML infrastructure expert with 8+ years of experience, specifically someone who has transitioned into technical le

Work type: onsite

Location: Palo Alto; Seattle

Salary: $137,000 – $270,000/yr

Type: Full-time

This role is ideal for a high-level backend or ML infrastructure expert with 8+ years of experience, specifically someone who has transitioned into technical leadership. You should have a deep background in productionizing embedding models and a strong grasp of systems programming (Go, C++, or Rust). This is a "builder-leader" position that sits at the intersection of infrastructure and AI research. A major highlight is the opportunity to work directly with the recently acquired Voyage.ai team to scale state-of-the-art embedding models within the MongoDB Atlas ecosystem. The compensation is highly competitive ($137k–$270k base), and the package includes premium benefits like 20 weeks of parental leave and fertility assistance. It’s a high-impact role where you’ll solve complex multi-tenancy and latency challenges for thousands of global customers. **You might be a good fit if you...** * Have at least 1 year of experience as a Technical Lead for large-scale ML inference or training platforms. * Are comfortable with low-level performance optimization and tools like vLLM, ONNX Runtime, and Kubernetes. * Have hands-on experience with vector search systems (Faiss, HNSW) and hybrid retrieval. * Prefer a hybrid work model in either Palo Alto or Seattle.

View this job on nocollar jobs