Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2503.04957

Sketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired Sketching

Paper • 2503.05179 • Published 6 days ago • 42
SafeArena: Evaluating the Safety of Autonomous Web Agents

Paper • 2503.04957 • Published 7 days ago • 18
Learning from Failures in Multi-Attempt Reinforcement Learning

Paper • 2503.04808 • Published 9 days ago • 15
START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published 7 days ago • 83

about 5 hours ago

RuCCoD: Towards Automated ICD Coding in Russian

Paper • 2502.21263 • Published 13 days ago • 122
Unified Reward Model for Multimodal Understanding and Generation

Paper • 2503.05236 • Published 6 days ago • 103
Sketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired Sketching

Paper • 2503.05179 • Published 6 days ago • 42
R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning

Paper • 2503.05592 • Published 6 days ago • 24

McGill-NLP/safearena

Updated 3 days ago • 7 • 1
SafeArena: Evaluating the Safety of Autonomous Web Agents

Paper • 2503.04957 • Published 7 days ago • 18
Running

2

2

Safearena Leaderboard

🏃

SafeArena Leaderboard

DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails

Paper • 2502.05163 • Published Feb 7 • 22
CRANE: Reasoning with constrained LLM generation

Paper • 2502.09061 • Published 28 days ago • 18
Investigating the Impact of Quantization Methods on the Safety and Reliability of Large Language Models

Paper • 2502.15799 • Published 23 days ago • 6
AISafetyLab: A Comprehensive Framework for AI Safety Evaluation and Improvement

Paper • 2502.16776 • Published 17 days ago • 5

a collection of algorithmic agents for user interfaces/interactions, program synthesis, and robots

about 4 hours ago

Learning Language Games through Interaction

Paper • 1606.02447 • Published Jun 8, 2016
Naturalizing a Programming Language via Interactive Learning

Paper • 1704.06956 • Published Apr 23, 2017
Reinforcement Learning on Web Interfaces Using Workflow-Guided Exploration

Paper • 1802.08802 • Published Feb 24, 2018
Mapping Natural Language Commands to Web Elements

Paper • 1808.09132 • Published Aug 28, 2018

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs