P

Research Engineer / Research Scientist

Preference Model

FULL TIME 📍 San Francisco, California, California, US 💰 $198k – $198k

About This Role

About Us Preference Model is building automated ML research engineering. Existing frontier models are brittle when applied to real-world ML tasks. The present bottleneck is the lack of high-quality RL training environments. Our first step is to build RL environments that reflect real-world complexity, with diverse tasks and robust reward functions. Our founding team has previous experience on Anthropic’s data team building data infrastructure, and datasets behind Claude. We are partnering with …
Posted on May 7, 2026 · Apply by June 6, 2026