Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Yihua Zhang's picture
1 2 3

Yihua Zhang

NormalUhr
vinhnx90's profile picture zhaoyixing's profile picture Rabai's profile picture
·
https://www.yihua-zhang.com
  • zyh2022
  • normaluhr
  • zhangyihua

AI & ML interests

None yet

Recent Activity

upvoted a paper 7 days ago
Reinforcing Multi-Turn Reasoning in LLM Agents via Turn-Level Credit Assignment
published an article 3 months ago
DualPipe Explained: A Comprehensive Guide to DualPipe That Anyone Can Understand—Even Without a Distributed Training Background
published an article 4 months ago
Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment
View all activity

Organizations

OPTML Group @ MSU's profile picture

NormalUhr's activity

upvoted a paper 7 days ago

Reinforcing Multi-Turn Reasoning in LLM Agents via Turn-Level Credit Assignment

Paper • 2505.11821 • Published 20 days ago • 13
upvoted an article 9 months ago
view article
Article

Optimizing your LLM in production

By patrickvonplaten •
Sep 15, 2023
• 18
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs