madehua's picture

1 14 9

madehua PRO

mdh98

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 9 days ago

First Return, Entropy-Eliciting Explore

upvoted a paper 9 days ago

OmniPart: Part-Aware 3D Generation with Semantic Decoupling and Structural Cohesion

upvoted a paper 16 days ago

Kwai Keye-VL Technical Report

View all activity

Organizations

upvoted 2 papers 9 days ago

First Return, Entropy-Eliciting Explore

Paper • 2507.07017 • Published 9 days ago • 23

OmniPart: Part-Aware 3D Generation with Semantic Decoupling and Structural Cohesion

Paper • 2507.06165 • Published 10 days ago • 54

upvoted a paper 16 days ago

Kwai Keye-VL Technical Report

Paper • 2507.01949 • Published 16 days ago • 121

upvoted a paper about 1 month ago

Scaling Test-time Compute for LLM Agents

Paper • 2506.12928 • Published Jun 15 • 61

upvoted a paper about 2 months ago

Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning

Paper • 2505.15966 • Published May 21 • 53

upvoted a paper 2 months ago

Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models

Paper • 2505.04921 • Published May 8 • 179

upvoted an article 2 months ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

By

and 2 others •

Jan 28

• 876

upvoted 2 papers 3 months ago

IV-Bench: A Benchmark for Image-Grounded Video Perception and Reasoning in Multimodal LLMs

Paper • 2504.15415 • Published Apr 21 • 22

COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values

Paper • 2504.05535 • Published Apr 7 • 44

upvoted 2 papers 4 months ago

Video-R1: Reinforcing Video Reasoning in MLLMs

Paper • 2503.21776 • Published Mar 27 • 80

YuE: Scaling Open Foundation Models for Long-Form Music Generation

Paper • 2503.08638 • Published Mar 11 • 68

upvoted 3 papers 5 months ago

Can Large Language Models Detect Errors in Long Chain-of-Thought Reasoning?

Paper • 2502.19361 • Published Feb 26 • 28

CodeCriticBench: A Holistic Code Critique Benchmark for Large Language Models

Paper • 2502.16614 • Published Feb 23 • 27

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published Feb 20 • 105