11 47 13

Wei Liu

PeterV09

https://vpeterv.github.io

AI & ML interests

Machine Learning, Natural Language Processing

Recent Activity

upvoted a paper 4 days ago

Unfolding Spatial Cognition: Evaluating Multimodal Models on Visual Simulations

liked a dataset 4 days ago

GPUMODE/KernelBook

liked a model 4 days ago

facebook/KernelLLM

View all activity

Organizations

PeterV09's activity

upvoted a paper 4 days ago

Unfolding Spatial Cognition: Evaluating Multimodal Models on Visual Simulations

Paper • 2506.04633 • Published 5 days ago • 18

liked a dataset 4 days ago

GPUMODE/KernelBook

Viewer • Updated 14 days ago • 18.2k • 708 • 17

liked a model 4 days ago

facebook/KernelLLM

Text Generation • Updated 14 days ago • 12.3k • 147

upvoted a paper 5 days ago

Advancing Multimodal Reasoning: From Optimized Cold Start to Staged Reinforcement Learning

Paper • 2506.04207 • Published 5 days ago • 44

upvoted a paper 6 days ago

REASONING GYM: Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Paper • 2505.24760 • Published 10 days ago • 61

upvoted 2 papers 7 days ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published 7 days ago • 139

ARIA: Training Language Agents with Intention-Driven Reward Aggregation

Paper • 2506.00539 • Published 10 days ago • 28

upvoted 2 papers 13 days ago

ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows

Paper • 2505.19897 • Published 15 days ago • 101

SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond

Paper • 2505.19641 • Published 15 days ago • 64

upvoted a paper 14 days ago

Scaling Image and Video Generation via Test-Time Evolutionary Search

Paper • 2505.17618 • Published 18 days ago • 39

upvoted a paper 15 days ago

FullFront: Benchmarking MLLMs Across the Full Front-End Engineering Workflow

Paper • 2505.17399 • Published 18 days ago • 14

upvoted a paper 18 days ago

Scaling Reasoning, Losing Control: Evaluating Instruction Following in Large Reasoning Models

Paper • 2505.14810 • Published 20 days ago • 61

authored 4 papers 19 days ago

Diving into Self-Evolving Training for Multimodal Reasoning

Paper • 2412.17451 • Published Dec 23, 2024 • 44

SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild

Paper • 2503.18892 • Published Mar 24 • 30

Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging

Paper • 2505.05464 • Published May 8 • 10

Learn to Reason Efficiently with Adaptive Length-based Reward Shaping

Paper • 2505.15612 • Published 19 days ago • 32

updated a collection 19 days ago

Laser

Collection

The collection for the Paper "Laser: Learn to Reason Efficiently with Adaptive Length-based Reward Shaping" • 13 items • Updated 19 days ago

upvoted a paper 19 days ago

Learn to Reason Efficiently with Adaptive Length-based Reward Shaping

Paper • 2505.15612 • Published 19 days ago • 32

commented a paper 19 days ago

Learn to Reason Efficiently with Adaptive Length-based Reward Shaping

Paper • 2505.15612 • Published 19 days ago • 32 •

updated a collection 19 days ago

Laser

Collection

The collection for the Paper "Laser: Learn to Reason Efficiently with Adaptive Length-based Reward Shaping" • 13 items • Updated 19 days ago