-
Let LLMs Break Free from Overthinking via Self-Braking Tuning
Paper • 2505.14604 • Published • 23 -
AGENTIF: Benchmarking Instruction Following of Large Language Models in Agentic Scenarios
Paper • 2505.16944 • Published • 8 -
Training Step-Level Reasoning Verifiers with Formal Verification Tools
Paper • 2505.15960 • Published • 7 -
The Unreasonable Effectiveness of Entropy Minimization in LLM Reasoning
Paper • 2505.15134 • Published • 6
Felix Tuma
floom
AI & ML interests
NLP
Recent Activity
upvoted
a
paper
2 days ago
EMLoC: Emulator-based Memory-efficient Fine-tuning with LoRA Correction
updated
a collection
3 days ago
ShowAndTell
updated
a collection
3 days ago
PotentialApplication
Organizations
None yet