Resources for hybrid preferences research where we learn how to route preference instances for human vs. AI feedback
Lj V. Miranda PRO
ljvmiranda921
AI & ML interests
NLP - multilinguality, data-centric AI
Recent Activity
updated
a dataset
about 1 hour ago
ai2-adapt-dev/tool-use-synthetic-reasoning-gpt-4.1-p3
published
a dataset
about 1 hour ago
ai2-adapt-dev/tool-use-synthetic-reasoning-gpt-4.1-p3
updated
a dataset
about 3 hours ago
ai2-adapt-dev/tool-use-synthetic-gpt-o3-p1