Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Bowen's picture
5 2

Bowen

PeterJinGo
Xian2025's profile picture Lriver's profile picture munchong915's profile picture
·

AI & ML interests

None yet

Recent Activity

updated a dataset 17 days ago
Archive-models/musique
published a dataset 17 days ago
Archive-models/musique
updated a dataset 17 days ago
Archive-models/nq_hotpotqa_train_search_sample10
View all activity

Organizations

ptllama's profile picture rubricrm's profile picture Cell-O1's profile picture archive's profile picture longRAG's profile picture

PeterJinGo's activity

upvoted a collection 19 days ago

Search-R1-v0.3

Collection
RL with outcome reward + format reward. https://arxiv.org/abs/2505.15117 • 11 items • Updated 17 days ago • 2
upvoted a paper about 1 month ago

RM-R1: Reward Modeling as Reasoning

Paper • 2505.02387 • Published May 5 • 76
upvoted 2 collections 2 months ago

Search-R1-v0.2

Collection
Exploration with a more stable RL pipeline with outcome-only reward and scaled-up LLMs. https://arxiv.org/abs/2503.09516 • 25 items • Updated 17 days ago • 4

Search-R1

Collection
Preliminary checkpoints with outcome-only RL. • 14 items • Updated Apr 7 • 9
upvoted a paper 3 months ago

Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning

Paper • 2503.09516 • Published Mar 12 • 31
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs