3 12 32

Ning Ding

stingning

https://www.stingning.cn

ningding97

AI & ML interests

NLP

Recent Activity

authored a paper 13 days ago

TTRL: Test-Time Reinforcement Learning

upvoted a paper 13 days ago

TTRL: Test-Time Reinforcement Learning

upvoted a paper about 2 months ago

Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models

View all activity

Organizations

stingning's activity

authored a paper 13 days ago

TTRL: Test-Time Reinforcement Learning

Paper • 2504.16084 • Published 13 days ago • 102

upvoted a paper 13 days ago

TTRL: Test-Time Reinforcement Learning

Paper • 2504.16084 • Published 13 days ago • 102

upvoted a paper about 2 months ago

Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models

Paper • 2503.11224 • Published Mar 14 • 27

liked a model about 2 months ago

PRIME-RL/Eurus-2-7B-PRIME-Zero

Text Generation • Updated Mar 14 • 100 • 1

updated 3 datasets 3 months ago

updated 4 models 3 months ago

PRIME-RL/EurusPRM-Stage1

Updated Feb 19 • 331 • 4

PRIME-RL/Eurus-2-7B-SFT

Updated Feb 19 • 3.88k • 2

PRIME-RL/Eurus-2-7B-PRIME

Text Generation • Updated Feb 19 • 653 • 60

PRIME-RL/EurusPRM-Stage2

Updated Feb 19 • 270 • 6

updated a Space 3 months ago

README

🏃

liked a dataset 3 months ago

TsinghuaC3I/MedXpertQA

Viewer • Updated Feb 9 • 4.46k • 1.22k • 15

upvoted 2 papers 3 months ago

UltraIF: Advancing Instruction Following from the Wild

Paper • 2502.04153 • Published Feb 6 • 23

Process Reinforcement through Implicit Rewards

Paper • 2502.01456 • Published Feb 3 • 61

authored 5 papers 3 months ago

Tool Learning with Foundation Models

Paper • 2304.08354 • Published Apr 17, 2023 • 3

UltraFeedback: Boosting Language Models with High-quality Feedback

Paper • 2310.01377 • Published Oct 2, 2023 • 5

Unlock Predictable Scaling from Emergent Abilities

Paper • 2310.03262 • Published Oct 5, 2023 • 3

CRaSh: Clustering, Removing, and Sharing Enhance Fine-tuning without Full Large Language Model

Paper • 2310.15477 • Published Oct 24, 2023

Sparse Low-rank Adaptation of Pre-trained Language Models

Paper • 2311.11696 • Published Nov 20, 2023 • 2