-
Robix: A Unified Model for Robot Interaction, Reasoning and Planning
Paper • 2509.01106 • Published • 39 -
Open Data Synthesis For Deep Research
Paper • 2509.00375 • Published • 50 -
Mixture of Global and Local Experts with Diffusion Transformer for Controllable Face Generation
Paper • 2509.00428 • Published • 11 -
LMEnt: A Suite for Analyzing Knowledge in Language Models from Pretraining Data to Representations
Paper • 2509.03405 • Published • 17
Shiqiang Wu
ShiqiangWoo
AI & ML interests
None yet
Recent Activity
updated
a collection
2 days ago
20250904
updated
a collection
2 days ago
20250904
updated
a collection
2 days ago
20250904
Organizations
None yet
20250902
-
PVPO: Pre-Estimated Value-Based Policy Optimization for Agentic Reasoning
Paper • 2508.21104 • Published • 28 -
T2R-bench: A Benchmark for Generating Article-Level Reports from Real World Industrial Tables
Paper • 2508.19813 • Published • 20 -
No Label Left Behind: A Unified Surface Defect Detection Model for all Supervision Regimes
Paper • 2508.19060 • Published • 8 -
From reactive to cognitive: brain-inspired spatial intelligence for embodied agents
Paper • 2508.17198 • Published • 6
AI-generaed code
20250903
-
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey
Paper • 2509.02547 • Published • 145 -
SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning
Paper • 2509.02479 • Published • 76 -
POINTS-Reader: Distillation-Free Adaptation of Vision-Language Models for Document Conversion
Paper • 2509.01215 • Published • 42 -
LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model
Paper • 2509.00676 • Published • 74
20250901
-
Think in Games: Learning to Reason in Games via Reinforcement Learning with Large Language Models
Paper • 2508.21365 • Published • 22 -
TiKMiX: Take Data Influence into Dynamic Mixture for Language Model Pre-training
Paper • 2508.17677 • Published • 14 -
UItron: Foundational GUI Agent with Advanced Perception and Planning
Paper • 2508.21767 • Published • 12 -
AHELM: A Holistic Evaluation of Audio-Language Models
Paper • 2508.21376 • Published • 9
EO
20250904
-
Robix: A Unified Model for Robot Interaction, Reasoning and Planning
Paper • 2509.01106 • Published • 39 -
Open Data Synthesis For Deep Research
Paper • 2509.00375 • Published • 50 -
Mixture of Global and Local Experts with Diffusion Transformer for Controllable Face Generation
Paper • 2509.00428 • Published • 11 -
LMEnt: A Suite for Analyzing Knowledge in Language Models from Pretraining Data to Representations
Paper • 2509.03405 • Published • 17
20250903
-
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey
Paper • 2509.02547 • Published • 145 -
SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning
Paper • 2509.02479 • Published • 76 -
POINTS-Reader: Distillation-Free Adaptation of Vision-Language Models for Document Conversion
Paper • 2509.01215 • Published • 42 -
LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model
Paper • 2509.00676 • Published • 74
20250902
-
PVPO: Pre-Estimated Value-Based Policy Optimization for Agentic Reasoning
Paper • 2508.21104 • Published • 28 -
T2R-bench: A Benchmark for Generating Article-Level Reports from Real World Industrial Tables
Paper • 2508.19813 • Published • 20 -
No Label Left Behind: A Unified Surface Defect Detection Model for all Supervision Regimes
Paper • 2508.19060 • Published • 8 -
From reactive to cognitive: brain-inspired spatial intelligence for embodied agents
Paper • 2508.17198 • Published • 6
20250901
-
Think in Games: Learning to Reason in Games via Reinforcement Learning with Large Language Models
Paper • 2508.21365 • Published • 22 -
TiKMiX: Take Data Influence into Dynamic Mixture for Language Model Pre-training
Paper • 2508.17677 • Published • 14 -
UItron: Foundational GUI Agent with Advanced Perception and Planning
Paper • 2508.21767 • Published • 12 -
AHELM: A Holistic Evaluation of Audio-Language Models
Paper • 2508.21376 • Published • 9
AI-generaed code
EO