Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
anakin87
/
gemma-2b-orpo
like
28
Text Generation
Transformers
Safetensors
alvarobartt/dpo-mix-7k-simplified
English
gemma
trl
orpo
Generated from Trainer
conversational
Eval Results
text-generation-inference
Inference Endpoints
arxiv:
2403.07691
License:
gemma-terms-of-use
Model card
Files
Files and versions
Community
2
Train
Deploy
Use this model
main
gemma-2b-orpo
Commit History
Update README.md
bf6bfe3
verified
anakin87
commited on
May 6, 2024
Update README.md
1f9a318
verified
anakin87
commited on
May 6, 2024
add evaluation on Open LLM Leaderboard
7569a46
verified
anakin87
commited on
May 6, 2024
link to GGUF version
76e5b9c
verified
anakin87
commited on
Apr 6, 2024
Upload tokenizer.model
b946408
verified
anakin87
commited on
Apr 6, 2024
improve readme
1a06bf8
anakin87
commited on
Mar 26, 2024
retry nb visualization
f18f009
anakin87
commited on
Mar 26, 2024
improve notebook visualization
c8b9386
anakin87
commited on
Mar 26, 2024
fix
cd3951b
anakin87
commited on
Mar 25, 2024
fixes
189413b
anakin87
commited on
Mar 25, 2024
improve readme
ce4ba3c
anakin87
commited on
Mar 25, 2024
Upload gemma-2b-orpo.png
159c797
verified
anakin87
commited on
Mar 25, 2024
material
4db7146
anakin87
commited on
Mar 25, 2024
Update README.md
5cbf999
verified
anakin87
commited on
Mar 25, 2024
little change
15a13e0
anakin87
commited on
Mar 25, 2024
End of training
7fbb0bb
verified
anakin87
commited on
Mar 24, 2024
Training in progress, epoch 2
b6e4162
verified
anakin87
commited on
Mar 24, 2024
Training in progress, epoch 2
d241042
verified
anakin87
commited on
Mar 24, 2024
Training in progress, epoch 0
3c43e7b
verified
anakin87
commited on
Mar 24, 2024
initial commit
dcc2f5c
verified
anakin87
commited on
Mar 24, 2024