Web-Shepherd: Advancing PRMs for Reinforcing Web Agents Paper • 2505.15277 • Published 20 days ago • 99
Web Agents with World Models: Learning and Leveraging Environment Dynamics in Web Navigation Paper • 2410.13232 • Published Oct 17, 2024 • 45
Coffee-Gym: An Environment for Evaluating and Improving Natural Language Feedback on Erroneous Code Paper • 2409.19715 • Published Sep 29, 2024 • 11
THEANINE: Revisiting Memory Management in Long-term Conversations with Timeline-augmented Response Generation Paper • 2406.10996 • Published Jun 16, 2024 • 36
THEANINE: Revisiting Memory Management in Long-term Conversations with Timeline-augmented Response Generation Paper • 2406.10996 • Published Jun 16, 2024 • 36
THEANINE: Revisiting Memory Management in Long-term Conversations with Timeline-augmented Response Generation Paper • 2406.10996 • Published Jun 16, 2024 • 36 • 1
Language Models as Compilers: Simulating Pseudocode Execution Improves Algorithmic Reasoning in Language Models Paper • 2404.02575 • Published Apr 3, 2024 • 51
Commonsense-augmented Memory Construction and Management in Long-term Conversations via Context-aware Persona Refinement Paper • 2401.14215 • Published Jan 25, 2024 • 2
Coffee: Boost Your Code LLMs by Fixing Bugs with Feedback Paper • 2311.07215 • Published Nov 13, 2023 • 3
Dialogue Chain-of-Thought Distillation for Commonsense-aware Conversational Agents Paper • 2310.09343 • Published Oct 13, 2023 • 2
Language Models as Compilers: Simulating Pseudocode Execution Improves Algorithmic Reasoning in Language Models Paper • 2404.02575 • Published Apr 3, 2024 • 51