Lutalica's picture

3 2

Lutalica

Lutalica

·

https://github.com/RewindL

RewindL

AI & ML interests

Computer vision, Image Processing

Recent Activity

upvoted a paper about 1 month ago

Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination

commented on a paper 4 months ago

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

upvoted a paper 4 months ago

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

View all activity

Organizations

commented 2 papers 4 months ago

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

Paper • 2504.20571 • Published Apr 29 • 97 •

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published Apr 18 • 134 •

New activity in monology/pile-uncopyrighted 11 months ago

Format issue when loading dataset

#1 opened over 1 year ago by