OpenThinkIMG: Learning to Think with Images via Visual Tool Reinforcement Learning Paper ⢠2505.08617 ⢠Published 7 days ago ⢠36
A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond Paper ⢠2503.21614 ⢠Published Mar 27 ⢠39
From Head to Tail: Towards Balanced Representation in Large Vision-Language Models through Adaptive Data Calibration Paper ⢠2503.12821 ⢠Published Mar 17 ⢠9
Linear-MoE: Linear Sequence Modeling Meets Mixture-of-Experts Paper ⢠2503.05447 ⢠Published Mar 7 ⢠8
Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback Paper ⢠2501.12895 ⢠Published Jan 22 ⢠61
LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training Paper ⢠2411.15708 ⢠Published Nov 24, 2024
Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback Paper ⢠2501.12895 ⢠Published Jan 22 ⢠61
NesTools: A Dataset for Evaluating Nested Tool Learning Abilities of Large Language Models Paper ⢠2410.11805 ⢠Published Oct 15, 2024 ⢠14
ConflictBank: A Benchmark for Evaluating the Influence of Knowledge Conflicts in LLM Paper ⢠2408.12076 ⢠Published Aug 22, 2024 ⢠12
Timo: Towards Better Temporal Reasoning for Language Models Paper ⢠2406.14192 ⢠Published Jun 20, 2024 ⢠1
Seal-Tools: Self-Instruct Tool Learning Dataset for Agent Tuning and Detailed Benchmark Paper ⢠2405.08355 ⢠Published May 14, 2024
CLIP-MoE: Towards Building Mixture of Experts for CLIP with Diversified Multiplet Upcycling Paper ⢠2409.19291 ⢠Published Sep 28, 2024 ⢠20
Mirror: A Universal Framework for Various Information Extraction Tasks Paper ⢠2311.05419 ⢠Published Nov 9, 2023
Enhancing Low-Resource Relation Representations through Multi-View Decoupling Paper ⢠2312.17267 ⢠Published Dec 26, 2023 ⢠1
LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training Paper ⢠2406.16554 ⢠Published Jun 24, 2024 ⢠1
Twin-Merging: Dynamic Integration of Modular Expertise in Model Merging Paper ⢠2406.15479 ⢠Published Jun 17, 2024 ⢠2
On Giant's Shoulders: Effortless Weak to Strong by Dynamic Logits Fusion Paper ⢠2406.15480 ⢠Published Jun 17, 2024 ⢠2
ConflictBank: A Benchmark for Evaluating the Influence of Knowledge Conflicts in LLM Paper ⢠2408.12076 ⢠Published Aug 22, 2024 ⢠12