Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
HuggingFaceH4 's Collections
Scaling Test-Time Compute with Open Models
Zephyr ORPO
Zephyr 7B
Zephyr 7B Gemma
StarChat2 15B
Journal Club
Papers We've Read
Awesome SFT datasets
Awesome feedback datasets
Awesome reward models

StarChat2 15B

updated Apr 12, 2024

Model, datasets, and demo for StarChat2 15B. For code to train the models, see: https://github.com/huggingface/alignment-handbook

Upvote
13

  • HuggingFaceH4/starchat2-15b-v0.1

    Text Generation • 16B • Updated Mar 13, 2024 • 3.2k • 112

  • HuggingFaceH4/starchat2-15b-sft-v0.1

    Text Generation • 16B • Updated Mar 12, 2024 • 18 • 5

    Note The SFT model that was used for alignment with DPO


  • jondurbin/airoboros-3.2

    Viewer • Updated Jan 2, 2024 • 58.7k • 72 • 46

    Note Part of the SFT mix


  • abacusai/SystemChat

    Viewer • Updated Mar 4, 2024 • 7.02k • 44 • 134

    Note Part of the SFT mix


  • microsoft/orca-math-word-problems-200k

    Viewer • Updated Mar 4, 2024 • 200k • 1.52k • 458

    Note Part of the SFT mix


  • m-a-p/Code-Feedback

    Viewer • Updated Feb 26, 2024 • 66.4k • 287 • 210

    Note Part of the SFT mix


  • LDJnr/Capybara

    Viewer • Updated Jun 7, 2024 • 16k • 283 • 242

    Note Part of the SFT mix


  • HuggingFaceH4/ultrafeedback_binarized

    Viewer • Updated Oct 16, 2024 • 187k • 12.3k • 299

    Note Part of the DPO mix


  • Intel/orca_dpo_pairs

    Viewer • Updated Nov 29, 2023 • 12.9k • 2.4k • 311

    Note Part of the DPO mix

Upvote
13
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs