Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
m-a-p 's Collections
TreePO
CriticLean
Hybrid Linear Attention Research
MARBLE
COIG-P-Models
COIG-P-Datasets
YuE
FineFineWeb
MERT
MuPT
COIG
OpenCodeInterpreter
ChatMusician
M-A-P Full Paper List
Amber-Reproduce-Intermediate-CKPTs (The Fine Line)
OpenLLaMA-Reproduce-Intermediate-CKPTs (The Fine Line)
Chinese Tiny LLM
MusiLingo
Neo-Models
Neo-Datasets

TreePO

updated about 8 hours ago
Upvote
-

  • TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling

    Paper • 2508.17445 • Published 3 days ago • 55

  • m-a-p/TreePO-Qwen2.5-7B

    8B • Updated 1 day ago • 3 • 1

  • m-a-p/TreePO_data

    Viewer • Updated 1 day ago • 49.3k • 7

  • m-a-p/TreePO-Qwen2.5-7B_fixed-div

    8B • Updated 1 day ago • 10
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs