Submitted by taesiri 146 InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency · 61 authors 8.89k 5
Submitted by taesiri 38 Visual-CoG: Stage-Aware Reinforcement Learning with Chain of Guidance for Text-to-Image Generation · 9 authors 2
Submitted by Ironieser 26 MMTok: Multimodal Coverage Maximization for Efficient Inference of VLMs · 6 authors 3 3
Submitted by Kaiyue 23 T2I-ReasonBench: Benchmarking Reasoning-Informed Text-to-Image Generation · 5 authors 23 2
Submitted by BAOLONGZHANSHEN 19 Breaking the Exploration Bottleneck: Rubric-Scaffolded Reinforcement Learning for General LLM Reasoning · 13 authors 2
Submitted by mbur 19 Beyond Memorization: Extending Reasoning Depth with Recurrence, Memory and Test-Time Compute Scaling · 12 authors 47 9
Submitted by Wyattz23 13 PosterGen: Aesthetic-Aware Paper-to-Poster Generation via Multi-Agent LLMs · 5 authors 48 3
Submitted by omidgh 8 MEENA (PersianMMMU): Multimodal-Multilingual Educational Exams for N-level Assessment · 11 authors 3
Submitted by taesiri 6 ST-Raptor: LLM-Powered Semi-Structured Table Question Answering · 9 authors 11 2
Submitted by RuijieZhu 4 MeshSplat: Generalizable Sparse-View Surface Reconstruction via Gaussian Splatting · 8 authors 17 2
Submitted by taesiri 3 Neither Valid nor Reliable? Investigating the Use of LLMs as Judges · 4 authors 2
Submitted by ControlNet 3 Explain Before You Answer: A Survey on Compositional Visual Reasoning · 13 authors 11 2
Submitted by Hecheng0625 2 TaDiCodec: Text-aware Diffusion Speech Tokenizer for Speech Language Modeling · 6 authors 105 2
Submitted by stefan-it 1 German4All - A Dataset and Model for Readability-Controlled Paraphrasing in German · 6 authors 2 5
Submitted by tristan-deep 1 Semantic Diffusion Posterior Sampling for Cardiac Ultrasound Dehazing · 3 authors 1 2
Submitted by stefanos50 - REGEN: Real-Time Photorealism Enhancement in Games via a Dual-Stage Generative Network Framework · 2 authors 4 2
Submitted by dipta007 - If We May De-Presuppose: Robustly Verifying Claims through Presupposition-Free Question Decomposition · 2 authors 0 2