Applied Research - RL & Agents at Prime Intellect

**Who this is for** This role is for individuals passionate about the intersection of cutting-edge AI research and practical infrastructure development. If you

Work type: onsite

Location: San Francisco

Salary: $150,000 – $300,000/yr

Type: Full-time

Summary

**Who this is for** This role is for individuals passionate about the intersection of cutting-edge AI research and practical infrastructure development. If you want to build and shape the systems that empower frontier AI labs, this is your opportunity. **Key highlights** You will be instrumental in advancing AI agent capabilities and building the robust infrastructure needed for their reliable and efficient operation at massive scale. This role bridges the gap between ambitious research objectives and tangible technical requirements. **You might be a good fit if you...** - Have a strong background in machine learning engineering, with experience in post-training or Reinforcement Learning. - Possess experience with agent frameworks and tooling (e.g., DSPy, LangGraph). - Are familiar with distributed training/inference frameworks (e.g., vLLM, Ray). - Have a track record of research contributions in ML/RL.

Job Description

Be Your Own Lab
Prime Intellect builds the infrastructure that frontier AI labs build internally, and makes it available to everyone. Our platform, Lab, unifies environments, evaluations, sandboxes, and high-performance training into a single full-stack system for post-training at frontier scale, from RL and SFT to tool use, agent workflows, and deployment. We validate everything by using it ourselves, training open state-of-the-art models on the same stack we put in your hands. We're looking for people who want to build at the intersection of frontier research and real infrastructure.

We recently raised [$15mm in funding](https://www.primeintellect.ai/blog/fundraise) (total of $20mm raised) led by Founders Fund, with participation from Menlo Ventures and prominent angels including Andrej Karpathy (Eureka AI, Tesla, OpenAI), Tri Dao (Chief Scientific Officer of Together AI), Dylan Patel (SemiAnalysis), Clem Delangue (Huggingface), Emad Mostaque (Stability AI) and many others.

## Role Impact

This is a role at the intersection of cutting-edge RL/post-training methods and applied agent systems. You’ll have a direct impact on shaping how advanced models are aligned, deployed, and used in the real world by:





###

Application-Driven Research & Infrastructure





Post-training & Reinforcement Learning





Agent Development & Infrastructure





## Requirements








## Nice-to-Haves




## What We Offer






## Growth Opportunity

You’ll join a mission-driven team working at the frontier of open, superintelligence infra. In this role, you’ll have the opportunity to:




If you’re excited to move fast, build boldly, and help define how agentic AI is developed and deployed, we’d love to hear from you.

Ready to build the open superintelligence infrastructure of tomorrow?
Apply now to help us make powerful, open AGI accessible to everyone.

View this job on nocollar jobs