Multimodal DeepResearcher: Generating Text-Chart Interleaved Reports From Scratch with Agentic Framework Paper • 2506.02454 • Published 3 days ago • 3
CSVQA: A Chinese Multimodal Benchmark for Evaluating STEM Reasoning Capabilities of VLMs Paper • 2505.24120 • Published 7 days ago • 46
ImgEdit: A Unified Image Editing Dataset and Benchmark Paper • 2505.20275 • Published 10 days ago • 17
🌸 April 2025 - Open releases from the Chinese community Collection 42 items • Updated 21 days ago • 13
Skywork-VL Reward: An Effective Reward Model for Multimodal Understanding and Reasoning Paper • 2505.07263 • Published 25 days ago • 29
Harmonizing Visual Representations for Unified Multimodal Understanding and Generation Paper • 2503.21979 • Published Mar 27 • 3
Skywork R1V2: Multimodal Hybrid Reinforcement Learning for Reasoning Paper • 2504.16656 • Published Apr 23 • 57
Skywork-R1V2 Collection Multimodal Hybrid Reinforcement Learning for Reasoning • 5 items • Updated 24 days ago • 10
Skywork R1V: Pioneering Multimodal Reasoning with Chain-of-Thought Paper • 2504.05599 • Published Apr 8 • 83