StarChat2 15B - a HuggingFaceH4 Collection

HuggingFaceH4 's Collections

Scaling Test-Time Compute with Open Models

Zephyr 7B Gemma

Papers We've Read

Awesome SFT datasets

Awesome feedback datasets

Awesome reward models

StarChat2 15B

updated Apr 12, 2024

Model, datasets, and demo for StarChat2 15B. For code to train the models, see: https://github.com/huggingface/alignment-handbook

HuggingFaceH4/starchat2-15b-v0.1

Text Generation • 16B • Updated Mar 13, 2024 • 3.2k • 112
HuggingFaceH4/starchat2-15b-sft-v0.1

Text Generation • 16B • Updated Mar 12, 2024 • 18 • 5

Note The SFT model that was used for alignment with DPO
jondurbin/airoboros-3.2

Viewer • Updated Jan 2, 2024 • 58.7k • 72 • 46

Note Part of the SFT mix
abacusai/SystemChat

Viewer • Updated Mar 4, 2024 • 7.02k • 44 • 134

Note Part of the SFT mix
microsoft/orca-math-word-problems-200k

Viewer • Updated Mar 4, 2024 • 200k • 1.52k • 458

Note Part of the SFT mix
m-a-p/Code-Feedback

Viewer • Updated Feb 26, 2024 • 66.4k • 287 • 210

Note Part of the SFT mix
LDJnr/Capybara

Viewer • Updated Jun 7, 2024 • 16k • 283 • 242

Note Part of the SFT mix
HuggingFaceH4/ultrafeedback_binarized

Viewer • Updated Oct 16, 2024 • 187k • 12.3k • 299

Note Part of the DPO mix
Intel/orca_dpo_pairs

Viewer • Updated Nov 29, 2023 • 12.9k • 2.4k • 311

Note Part of the DPO mix