1 5 1

WANG Rui

Ray121381

AI & ML interests

None yet

Recent Activity

liked a Space 3 days ago

BeyondHsueh/ReliableMath-Leaderboard

upvoted a paper 3 days ago

GRPO-CARE: Consistency-Aware Reinforcement Learning for Multimodal Reasoning

upvoted a paper 2 months ago

ToolRL: Reward is All Tool Learning Needs

View all activity

Organizations

None yet

liked a Space 3 days ago

ReliableMath Leaderboard

🚀

This is ReliableMath Leaderboard!

upvoted a paper 3 days ago

GRPO-CARE: Consistency-Aware Reinforcement Learning for Multimodal Reasoning

Paper • 2506.16141 • Published 9 days ago • 25

upvoted a paper 2 months ago

ToolRL: Reward is All Tool Learning Needs

Paper • 2504.13958 • Published Apr 16 • 44

authored a paper 3 months ago

Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models

Paper • 2503.24377 • Published Mar 31 • 17

upvoted a paper 3 months ago

Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models

Paper • 2503.24377 • Published Mar 31 • 17

commented a paper 3 months ago

Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models

Paper • 2503.24377 • Published Mar 31 • 17 •

upvoted a paper 3 months ago

Exploring the Effect of Reinforcement Learning on Video Understanding: Insights from SEED-Bench-R1

Paper • 2503.24376 • Published Mar 31 • 38

updated 2 models 4 months ago

Ray121381/audio

Updated Mar 3

Ray121381/lora_ckp

Updated Mar 3

published 2 models 4 months ago

Ray121381/audio

Updated Mar 3

Ray121381/lora_ckp

Updated Mar 3

upvoted a paper 7 months ago

Moto: Latent Motion Token as the Bridging Language for Robot Manipulation

Paper • 2412.04445 • Published Dec 5, 2024 • 23

WANG Rui

AI & ML interests

Recent Activity

Organizations

Ray121381's activity

ReliableMath Leaderboard