Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
nicholasKluge
/
Aira-2-124M-DPO
like
1
Text Generation
Transformers
PyTorch
Safetensors
nicholasKluge/instruct-aira-dataset
nicholasKluge/reward-aira-dataset
English
gpt2
alignment
instruction tuned
text generation
conversation
assistant
dpo
Carbon Emissions
text-generation-inference
Inference Endpoints
arxiv:
1803.05457
arxiv:
2109.07958
arxiv:
2203.09509
License:
apache-2.0
Model card
Files
Files and versions
Community
2
Train
Deploy
Use this model
b14f9e9
Aira-2-124M-DPO
/
README.md
Commit History
Update README.md
f09d00d
nicholasKluge
commited on
Dec 4, 2023
Update README.md
1f9e9fa
nicholasKluge
commited on
Dec 4, 2023
Update README.md
6ce8837
nicholasKluge
commited on
Dec 4, 2023
Create README.md
1a8184c
nicholasKluge
commited on
Dec 3, 2023