Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
gabrielloiseau
/
TAROT-DPO
like
1
Text Generation
Transformers
Safetensors
Yelp/yelp_review_full
English
gpt2
dpo
text-generation-inference
Inference Endpoints
arxiv:
2407.21630
License:
gpl-3.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
TAROT-DPO
Commit History
Update README.md
bfdb04f
verified
gabrielloiseau
commited on
Sep 5
Update README.md
1cceef4
verified
gabrielloiseau
commited on
Aug 1
Update README.md
98452fc
verified
gabrielloiseau
commited on
Jul 31
Update README.md
1269d02
verified
gabrielloiseau
commited on
Jul 30
Update README.md
6d2906c
verified
gabrielloiseau
commited on
Jul 30
Update README.md
583e180
verified
gabrielloiseau
commited on
Jul 30
Update README.md
f9387c8
verified
gabrielloiseau
commited on
Jul 30
Update README.md
55677b0
verified
gabrielloiseau
commited on
Jul 29
Update README.md
f25748f
verified
gabrielloiseau
commited on
Jul 29
Update README.md
41b071b
verified
gabrielloiseau
commited on
Jul 29
Upload tokenizer
1ab0bc9
verified
gabrielloiseau
commited on
Jul 25
Upload model
99d2bff
verified
gabrielloiseau
commited on
Jul 25
initial commit
3cd7d0f
verified
gabrielloiseau
commited on
Jul 25