-
Sketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired Sketching
Paper • 2503.05179 • Published • 46 -
SafeArena: Evaluating the Safety of Autonomous Web Agents
Paper • 2503.04957 • Published • 21 -
Learning from Failures in Multi-Attempt Reinforcement Learning
Paper • 2503.04808 • Published • 18 -
START: Self-taught Reasoner with Tools
Paper • 2503.04625 • Published • 114
Muntasir Adnan
adnaan525
AI & ML interests
None yet
Recent Activity
authored
a paper
1 day ago
The Debugging Decay Index: Rethinking Debugging Strategies for Code LLMs
upvoted
a
paper
2 days ago
The Debugging Decay Index: Rethinking Debugging Strategies for Code LLMs
commented on
a paper
2 days ago
The Debugging Decay Index: Rethinking Debugging Strategies for Code LLMs
Organizations
None yet