-
Sketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired Sketching
Paper • 2503.05179 • Published • 42 -
SafeArena: Evaluating the Safety of Autonomous Web Agents
Paper • 2503.04957 • Published • 18 -
Learning from Failures in Multi-Attempt Reinforcement Learning
Paper • 2503.04808 • Published • 15 -
START: Self-taught Reasoner with Tools
Paper • 2503.04625 • Published • 83
Collections
Discover the best community collections!
Collections including paper arxiv:2503.04957
-
RuCCoD: Towards Automated ICD Coding in Russian
Paper • 2502.21263 • Published • 122 -
Unified Reward Model for Multimodal Understanding and Generation
Paper • 2503.05236 • Published • 103 -
Sketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired Sketching
Paper • 2503.05179 • Published • 42 -
R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning
Paper • 2503.05592 • Published • 24
-
DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails
Paper • 2502.05163 • Published • 22 -
CRANE: Reasoning with constrained LLM generation
Paper • 2502.09061 • Published • 18 -
Investigating the Impact of Quantization Methods on the Safety and Reliability of Large Language Models
Paper • 2502.15799 • Published • 6 -
AISafetyLab: A Comprehensive Framework for AI Safety Evaluation and Improvement
Paper • 2502.16776 • Published • 5
-
Learning Language Games through Interaction
Paper • 1606.02447 • Published -
Naturalizing a Programming Language via Interactive Learning
Paper • 1704.06956 • Published -
Reinforcement Learning on Web Interfaces Using Workflow-Guided Exploration
Paper • 1802.08802 • Published -
Mapping Natural Language Commands to Web Elements
Paper • 1808.09132 • Published