Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
xz's picture
7 1

xz

mxz
·

AI & ML interests

NLP ML RL

Organizations

None yet

models 7

mxz/qwen-R1-3B

Updated Mar 4 • 5

mxz/qwen-R1-1.5B

Updated Mar 4 • 7

mxz/qwen-R1-0.5b

Updated Mar 3 • 5

mxz/llama3-8b-dpo

Text Generation • Updated Jul 28, 2024 • 29

mxz/llama3-8b-ppo

Text Generation • Updated Jul 28, 2024 • 11

mxz/llama3-8b-sft

Text Generation • Updated Jul 28, 2024 • 13

mxz/ppo-LunarLander-v2

Reinforcement Learning • Updated Jul 17, 2024 • 2

datasets 4

mxz/awesome-dpo

Viewer • Updated Jul 28, 2024 • 302k • 24

mxz/CValues

Viewer • Updated Jul 26, 2024 • 146k • 25

mxz/CValues_DPO

Viewer • Updated Jul 26, 2024 • 146k • 26

mxz/alpaca_en_zh_ruozhiba_gpt4-data

Viewer • Updated Jul 26, 2024 • 190k • 23
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs