Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
lv12
's Collections
Agent Based Modeling
Representation Learning
Preference Optimization
Information Retrieval
Preference Optimization
updated
18 days ago
Upvote
1
A Roadmap to Pluralistic Alignment
Paper
•
2402.05070
•
Published
Feb 7, 2024
Self-Rewarding Language Models
Paper
•
2401.10020
•
Published
Jan 18, 2024
•
146
SakanaAI/DiscoPOP-zephyr-7b-gemma
Text Generation
•
Updated
Jun 13, 2024
•
5.25k
•
36
Upvote
1
Share collection
View history
Collection guide
Browse collections