Staff Machine Learning Engineer, GenAI Platform at Reddit

You're a seasoned Staff Machine Learning Engineer with over 10 years of experience, holding a degree in ML, Engineering, Computer Science, or a related field, l

Work type: remote

Location: Remote - United States

Salary: $253,300 – $354,600/yr

Type: Full-time

You're a seasoned Staff Machine Learning Engineer with over 10 years of experience, holding a degree in ML, Engineering, Computer Science, or a related field, looking to lead the architecture and scaling of Generative AI infrastructure. **What makes it worth a look...** Reddit is offering a full-time, fully remote role in the United States with a salary range of USD 253,300 - 354,600 per year. They are building foundational infrastructure to support massive-scale LLM workloads, making GenAI a strategic priority. **You might be a good fit if you...** * Have a proven track record designing and operating large-scale ML systems, specifically with distributed training frameworks like FSDP, DeepSpeed, or Megatron-LM, and LLM serving optimization using tools like vLLM or TensorRT-LLM. * Possess hands-on experience managing fault-tolerant, petabyte-scale distributed systems and multi-node/multi-GPU training clusters. * Are proficient with CUDA environments, GPU virtualization/containerization within Kubernetes, and production-grade Python and/or Go. * Have deep knowledge of modern ML orchestration, fine-tuning pipelines, and model evaluation methodologies, with experience in tools like Ray or MLflow.

View this job on nocollar jobs