Preference Model
Preference Model is building automated ML research engineering and the next generation of training data to power the future of AI. They focus on creating high-quality reinforcement learning (RL) environments that reflect real-world complexity, featuring diverse tasks and robust reward functions. This effort aims to solve the bottleneck of brittle frontier models when applied to real-world ML research and engineering tasks.
Preference Model Offices
OnSite Workspace
Employees work from physical offices.
Typical time on-site:
United States