VisCoder: Fine-Tuning LLMs for Executable Python Visualization Code Generation Paper β’ 2506.03930 β’ Published 1 day ago β’ 9
Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem Paper β’ 2506.03295 β’ Published 2 days ago β’ 11
Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evolution Paper β’ 2505.20286 β’ Published 10 days ago β’ 6
AdaCtrl: Towards Adaptive and Controllable Reasoning via Difficulty-Aware Budgeting Paper β’ 2505.18822 β’ Published 12 days ago β’ 14
StructEval: Benchmarking LLMs' Capabilities to Generate Structural Outputs Paper β’ 2505.20139 β’ Published 10 days ago β’ 18
AdaCtrl: Towards Adaptive and Controllable Reasoning via Difficulty-Aware Budgeting Paper β’ 2505.18822 β’ Published 12 days ago β’ 14
II-Bench: An Image Implication Understanding Benchmark for Multimodal Large Language Models Paper β’ 2406.05862 β’ Published Jun 9, 2024 β’ 4
Let Androids Dream of Electric Sheep: A Human-like Image Implication Understanding and Reasoning Framework Paper β’ 2505.17019 β’ Published 14 days ago β’ 4
General-Reasoner: Advancing LLM Reasoning Across All Domains Paper β’ 2505.14652 β’ Published 16 days ago β’ 22
General-Reasoner: Advancing LLM Reasoning Across All Domains Paper β’ 2505.14652 β’ Published 16 days ago β’ 22
A Comprehensive Survey on Long Context Language Modeling Paper β’ 2503.17407 β’ Published Mar 20 β’ 49
AttentionInfluence: Adopting Attention Head Influence for Weak-to-Strong Pretraining Data Selection Paper β’ 2505.07293 β’ Published 24 days ago β’ 26
Flow-GRPO: Training Flow Matching Models via Online RL Paper β’ 2505.05470 β’ Published 28 days ago β’ 77
Beyond One-Size-Fits-All: Inversion Learning for Highly Effective NLG Evaluation Prompts Paper β’ 2504.21117 β’ Published Apr 29 β’ 25
FormalMATH: Benchmarking Formal Mathematical Reasoning of Large Language Models Paper β’ 2505.02735 β’ Published about 1 month ago β’ 31
FormalMATH: Benchmarking Formal Mathematical Reasoning of Large Language Models Paper β’ 2505.02735 β’ Published about 1 month ago β’ 31