CSVQA: A Chinese Multimodal Benchmark for Evaluating STEM Reasoning Capabilities of VLMs Paper • 2505.24120 • Published 7 days ago • 46 • 4
Multimodal DeepResearcher: Generating Text-Chart Interleaved Reports From Scratch with Agentic Framework Paper • 2506.02454 • Published 3 days ago • 3
CSVQA: A Chinese Multimodal Benchmark for Evaluating STEM Reasoning Capabilities of VLMs Paper • 2505.24120 • Published 7 days ago • 46 • 4
CSVQA: A Chinese Multimodal Benchmark for Evaluating STEM Reasoning Capabilities of VLMs Paper • 2505.24120 • Published 7 days ago • 46
ImgEdit: A Unified Image Editing Dataset and Benchmark Paper • 2505.20275 • Published 10 days ago • 17
🌸 April 2025 - Open releases from the Chinese community Collection 42 items • Updated 21 days ago • 13
view post Post 2513 Matrix Game 🎮 an interactive foundation model for controllable game world generation, released by Skywork AI. Skywork/Matrix-Game✨ 17B with MIT licensed✨ Diffusion-based image-to-world video generation via keyboard & mouse input✨ GameWorld Score benchmark for Minecraft world models✨ Massive Matrix Game Dataset with fine-grained action labels See translation 🚀 6 6 👀 2 2 + Reply
view post Post 2677 Skywork-VL Reward🔥A multimodal reward model for both understanding & reasoning tasks, released by Skywork 昆仑万物-天工 Paper: Skywork-VL Reward: An Effective Reward Model for Multimodal Understanding and Reasoning (2505.07263)Model: Skywork/Skywork-VL-Reward-7B✨ 7B ✨ Trained on large scale, high-quality preference data✨ SOTA on VL-RewardBench + boosts reasoning via MPO See translation 🔥 7 7 + Reply
Skywork-VL Reward: An Effective Reward Model for Multimodal Understanding and Reasoning Paper • 2505.07263 • Published 25 days ago • 29 • 3
Skywork-VL Reward: An Effective Reward Model for Multimodal Understanding and Reasoning Paper • 2505.07263 • Published 25 days ago • 29
Skywork-VL Reward: An Effective Reward Model for Multimodal Understanding and Reasoning Paper • 2505.07263 • Published 25 days ago • 29