Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Privileged On-Policy Exploration

Team
classroom
Activity Feed

AI & ML interests

None defined yet.

Yuxiao Qu's profile picture

models 48

CMU-POPE/LLaDA-8B-Instruct_Mixture-of-Thoughts-math-4096_40_with_reasoning_mbz_1021

8B • Updated 12 days ago • 110

CMU-POPE/LLaDA-8B-Instruct_Mixture-of-Thoughts-math-4096_40_without_reasoning_mbz_1021

8B • Updated 12 days ago • 31

CMU-POPE/LLaDA-8B-Instruct_Mixture-of-Thoughts-math-1024_160_with_reasoning_mbz_1021

8B • Updated 13 days ago • 178

CMU-POPE/LLaDA-8B-Instruct_Mixture-of-Thoughts-math-1024_160_without_reasoning_mbz_1021

8B • Updated 13 days ago • 154

CMU-POPE/LLaDA-8B-Instruct_Mixture-of-Thoughts-math-256_640_with_reasoning_mbz_1021

8B • Updated 13 days ago • 69

CMU-POPE/LLaDA-8B-Instruct_Mixture-of-Thoughts-math-256_640_without_reasoning_mbz_1021

8B • Updated 13 days ago • 290

CMU-POPE/LLaDA-8B-Instruct_Mixture-of-Thoughts-math-64_2560_with_reasoning_mbz_1021

8B • Updated 14 days ago • 207

CMU-POPE/LLaDA-8B-Instruct_Mixture-of-Thoughts-math-64_2560_without_reasoning_mbz_1021

8B • Updated 14 days ago • 161

CMU-POPE/Instruct-POPE-hard-no_guide

4B • Updated 24 days ago • 94

CMU-POPE/Instruct-HARD-ALL-gemini_first_no-guide

4B • Updated 29 days ago • 100
View 48 models

datasets 0

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs