Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
DebateLabKIT
/
Llama-3.1-Argunaut-1-8B-HIRPO
like
0
Follow
DebateLab at KIT
11
Text Generation
Transformers
Safetensors
DebateLabKIT/arguments-and-debates
llama
logic
argumentation
critical-thinking
argument-mapping
Generated from Trainer
trl
rlvr
hirpo
conversational
text-generation-inference
arxiv:
2302.05206
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Llama-3.1-Argunaut-1-8B-HIRPO
/
tokenizer_config.json
Commit History
Model save
d1bd114
verified
ggbetz
commited on
Jun 23