Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
anakin87
/
gemma-2b-orpo
like
28
Text Generation
Transformers
Safetensors
alvarobartt/dpo-mix-7k-simplified
English
gemma
trl
orpo
Generated from Trainer
conversational
Eval Results
text-generation-inference
Inference Endpoints
arxiv:
2403.07691
License:
gemma-terms-of-use
Model card
Files
Files and versions
Community
2
Train
Deploy
Use this model
refs/pr/2
gemma-2b-orpo
/
README.md
Commit History
Adding Evaluation Results
dda322f
verified
leaderboard-pr-bot
commited on
May 6
link to GGUF version
76e5b9c
verified
anakin87
commited on
Apr 6
improve readme
1a06bf8
anakin87
commited on
Mar 26
fix
cd3951b
anakin87
commited on
Mar 25
fixes
189413b
anakin87
commited on
Mar 25
improve readme
ce4ba3c
anakin87
commited on
Mar 25
material
4db7146
anakin87
commited on
Mar 25
Update README.md
5cbf999
verified
anakin87
commited on
Mar 25
little change
15a13e0
anakin87
commited on
Mar 25
End of training
7fbb0bb
verified
anakin87
commited on
Mar 24