Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
AMindToThink
/
GEMMA-2-2B-FT-ORPO-ISAERFT_gemma-2-2b-lr6.1e-06-beta0.2-20250301-0243
like
0
Transformers
Generated from Trainer
smol-course
module_1
isaerft
lr_6.082515167208541e-06
beta_0.2
arxiv:
2403.07691
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
New discussion
New pull request
Resources
PR & discussions documentation
Code of Conduct
Hub documentation
All
Discussions
Pull requests
View closed (0)
Sort: Recently created
Welcome to the community
The community tab is the place to discuss and collaborate with the HF community!