Research Engineer - Reinforcement Learning
Building Open Superintelligence Infrastructure
Prime Intellect is building the open superintelligence stack - from frontier agentic models to the infra that enables anyone to create, train, and deploy them. We aggregate and orchestrate global compute into a single control plane and pair it with the full rl post-training stack: environments, secure sandboxes, verifiable evals, and our async RL trainer. We enable researchers, startups and enterprises to run end-to-end reinforcement learning at frontier scale, adapting models to real tools, workflows, and deployment contexts.
As a Research Engineer in our Reasoning team, you'll play a crucial role in shaping our technological direction, focusing on our test-time compute scaling research ideas. If you love working with synthetic data and teach LLMs reasoning abilities, this role is for you.
For more details about the project you would be working on, check out our outlook on decentralized training in the inference-compute paradigm.
RESPONSIBILITIES
- Lead and participate in novel research to build a massive scale synthetic data generation pipeline and orchestration solution
- Optimize the performance, cost, and resource utilization of AI inference workloads by leveraging the most recent advances for compute & memory optimization techniques.
- Contribute to the development of our open-source libraries and frameworks for synthetic data generation and distributed RL frameworks.
- Publish research in top-tier AI conferences such as ICML & NeurIPS.
- Distill highly technical project outcomes in layman approachable technical blogs to our customers and developers.
- Stay up-to-date with the latest advancements in AI/ML infrastructure and tools, synthetic data gen research and proactively identify opportunities to enhance our platform's capabilities and user experience.
REQUIREMENTS
- Strong background in AI/ML engineering, with extensive experience in desi...