Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Pengyu Cheng's picture
3 8 9

Pengyu Cheng

Linear95
cuisijia's profile picture
ยท

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago
GD^2PO: Mitigating Multi-Reward Conflicts via Group-Dynamic reward-Decoupled Policy Optimization
upvoted a paper 9 days ago
MARCH: Multi-Agent Reinforced Self-Check for LLM Hallucination
upvoted a paper 9 days ago
CLIPO: Contrastive Learning in Policy Optimization Generalizes RLVR
View all activity

Organizations

Quark's profile picture

Linear95 's datasets

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs