TsinghuaC3I

university

http://c3i.ee.tsinghua.edu.cn/en/

TsinghuaC3I

Activity Feed

AI & ML interests

Large Language Models

Recent Activity

iseesaw authored a paper 6 days ago

SSRL: Self-Search Reinforcement Learning

yuchenFan updated a collection 7 days ago

SSRL

iseesaw authored a paper 7 days ago

ReviewRL: Towards Automated Scientific Review with RL

View all activity

iseesaw

authored a paper 6 days ago

SSRL: Self-Search Reinforcement Learning

Paper • 2508.10874 • Published 11 days ago • 88

yuchenFan

updated a collection 7 days ago

SSRL

Collection

6 items • Updated 7 days ago • 2

iseesaw

authored a paper 7 days ago

ReviewRL: Towards Automated Scientific Review with RL

Paper • 2508.10308 • Published 11 days ago

yuchenFan

updated a collection 14 days ago

SSRL

Collection

6 items • Updated 7 days ago • 2

iseesaw

in TsinghuaC3I/MedXpertQA about 2 months ago

Improve dataset card by adding table-question-answering task category and relevant tags

#2 opened 2 months ago by

nielsr

yuchenFan

authored a paper 3 months ago

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

Paper • 2505.22617 • Published May 28 • 129

xuekai

authored a paper 3 months ago

Seek in the Dark: Reasoning via Test-Time Instance-Level Policy Gradient in Latent Space

Paper • 2505.13308 • Published May 19 • 27

lindsay-qu

authored a paper 4 months ago

TTRL: Test-Time Reinforcement Learning

Paper • 2504.16084 • Published Apr 22 • 120

xuekai

authored a paper 4 months ago

TTRL: Test-Time Reinforcement Learning

Paper • 2504.16084 • Published Apr 22 • 120

iseesaw

authored a paper 4 months ago

TTRL: Test-Time Reinforcement Learning

Paper • 2504.16084 • Published Apr 22 • 120

iseesaw

authored 2 papers 5 months ago

GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning

Paper • 2504.00891 • Published Apr 1 • 14

Video-T1: Test-Time Scaling for Video Generation

Paper • 2503.18942 • Published Mar 24 • 91

lindsay-qu

authored a paper 5 months ago

Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models

Paper • 2503.11224 • Published Mar 14 • 29

yuchenFan

authored a paper 5 months ago

Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models

Paper • 2503.11224 • Published Mar 14 • 29

xuekai

authored a paper 5 months ago

Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models

Paper • 2503.11224 • Published Mar 14 • 29

iseesaw

authored a paper 5 months ago

Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models

Paper • 2503.11224 • Published Mar 14 • 29

AI & ML interests

Recent Activity

Team members 4

TsinghuaC3I's activity

Improve dataset card by adding table-question-answering task category and relevant tags