Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Mila Iterative DPO
university
Activity Feed
Follow
3
AI & ML interests
None defined yet.
Recent Activity
arianhosseini
authored
a paper
19 days ago
When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoning
mnoukhov
authored
a paper
6 months ago
The N+ Implementation Details of RLHF with PPO: A Case Study on TL;DR Summarization
sophiex
authored
a paper
10 months ago
Efficient Adversarial Training in LLMs with Continuous Attacks
View all activity
Team members
3
models
None public yet
datasets
None public yet