Joakim Lee's picture

19

Joakim Lee

Reinforcement4All

AI & ML interests

None yet

Recent Activity

upvoted a paper about 20 hours ago

Learning to Reason without External Rewards

upvoted a paper about 20 hours ago

Agents of Change: Self-Evolving LLM Agents for Strategic Planning

upvoted a paper about 21 hours ago

Show-o2: Improved Native Unified Multimodal Models

View all activity

Organizations

None yet

Reinforcement4All's activity

upvoted 2 papers about 20 hours ago

Learning to Reason without External Rewards

Paper • 2505.19590 • Published 27 days ago • 29

Agents of Change: Self-Evolving LLM Agents for Strategic Planning

Paper • 2506.04651 • Published 17 days ago • 7

upvoted a paper about 21 hours ago

Show-o2: Improved Native Unified Multimodal Models

Paper • 2506.15564 • Published 3 days ago • 11

upvoted 2 papers 1 day ago

SwarmAgentic: Towards Fully Automated Agentic System Generation via Swarm Intelligence

Paper • 2506.15672 • Published 3 days ago • 9

Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective

Paper • 2506.14965 • Published 4 days ago • 31

upvoted a paper 3 days ago

Sekai: A Video Dataset towards World Exploration

Paper • 2506.15675 • Published 3 days ago • 54

upvoted 3 papers 4 days ago

Scaling Test-time Compute for LLM Agents

Paper • 2506.12928 • Published 6 days ago • 54

Reasoning with Exploration: An Entropy Perspective

Paper • 2506.14758 • Published 4 days ago • 24

LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs

Paper • 2506.14429 • Published 5 days ago • 40

upvoted 3 papers 5 days ago

Scientists' First Exam: Probing Cognitive Abilities of MLLM via Perception, Understanding, and Reasoning

Paper • 2506.10521 • Published 10 days ago • 64

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published 5 days ago • 219

pLSTM: parallelizable Linear Source Transition Mark networks

Paper • 2506.11997 • Published 8 days ago • 8

upvoted 2 papers 6 days ago

Darwin Godel Machine: Open-Ended Evolution of Self-Improving Agents

Paper • 2505.22954 • Published 24 days ago • 11

The Diffusion Duality

Paper • 2506.10892 • Published 9 days ago • 35

upvoted 2 papers 7 days ago

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

Paper • 2505.24864 • Published 22 days ago • 127

Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning

Paper • 2505.24726 • Published 22 days ago • 197

upvoted 3 papers 8 days ago

Reinforcement Pre-Training

Paper • 2506.08007 • Published 12 days ago • 218

Comment on The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity

Paper • 2506.09250 • Published 11 days ago • 27

Magistral

Paper • 2506.10910 • Published 9 days ago • 58