Tingchen Fu's picture

11 9 7

Tingchen Fu

TingchenFu

·

https://tingchenfu.github.io/

AI & ML interests

None yet

Organizations

None yet

upvoted 4 papers 2 months ago

The Climb Carves Wisdom Deeper Than the Summit: On the Noisy Rewards in Learning to Reason

Paper • 2505.22653 • Published May 28 • 67

QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning

Paper • 2505.17667 • Published May 23 • 89

Tool-Star: Empowering LLM-Brained Multi-Tool Reasoner via Reinforcement Learning

Paper • 2505.16410 • Published May 22 • 57

Scaling Reasoning, Losing Control: Evaluating Instruction Following in Large Reasoning Models

Paper • 2505.14810 • Published May 20 • 63

upvoted a paper 3 months ago

Learning to Reason under Off-Policy Guidance

Paper • 2504.14945 • Published Apr 21 • 86

upvoted an article 4 months ago

Article

Open R1: Update #3

By

and 9 others •

Mar 11

• 295

upvoted 2 papers 6 months ago

Autonomy-of-Experts Models

Paper • 2501.13074 • Published Jan 22 • 45

Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback

Paper • 2501.12895 • Published Jan 22 • 62

upvoted a collection over 1 year ago

Model Merging

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12, 2024 • 244