Dialogue Chain-of-Thought Distillation for Commonsense-aware Conversational Agents Paper • 2310.09343 • Published Oct 13, 2023 • 2
Coffee: Boost Your Code LLMs by Fixing Bugs with Feedback Paper • 2311.07215 • Published Nov 13, 2023 • 3
Language Models as Compilers: Simulating Pseudocode Execution Improves Algorithmic Reasoning in Language Models Paper • 2404.02575 • Published Apr 3, 2024 • 51
Can Large Language Models be Good Emotional Supporter? Mitigating Preference Bias on Emotional Support Conversation Paper • 2402.13211 • Published Feb 20, 2024
Evaluating Robustness of Reward Models for Mathematical Reasoning Paper • 2410.01729 • Published Oct 2, 2024
Web-Shepherd: Advancing PRMs for Reinforcing Web Agents Paper • 2505.15277 • Published 22 days ago • 99
Embodied Agents Meet Personalization: Exploring Memory Utilization for Personalized Assistance Paper • 2505.16348 • Published 21 days ago • 46
Web-Shepherd: Advancing PRMs for Reinforcing Web Agents Paper • 2505.15277 • Published 22 days ago • 99
Self-Training Elicits Concise Reasoning in Large Language Models Paper • 2502.20122 • Published Feb 27 • 2
Web Agents with World Models: Learning and Leveraging Environment Dynamics in Web Navigation Paper • 2410.13232 • Published Oct 17, 2024 • 45
Evaluating Robustness of Reward Models for Mathematical Reasoning Paper • 2410.01729 • Published Oct 2, 2024
Coffee-Gym: An Environment for Evaluating and Improving Natural Language Feedback on Erroneous Code Paper • 2409.19715 • Published Sep 29, 2024 • 11
Can Large Language Models be Good Emotional Supporter? Mitigating Preference Bias on Emotional Support Conversation Paper • 2402.13211 • Published Feb 20, 2024
Cactus: Towards Psychological Counseling Conversations using Cognitive Behavioral Theory Paper • 2407.03103 • Published Jul 3, 2024 • 1
THEANINE: Revisiting Memory Management in Long-term Conversations with Timeline-augmented Response Generation Paper • 2406.10996 • Published Jun 16, 2024 • 36
THEANINE: Revisiting Memory Management in Long-term Conversations with Timeline-augmented Response Generation Paper • 2406.10996 • Published Jun 16, 2024 • 36
Pearl: A Review-driven Persona-Knowledge Grounded Conversational Recommendation Dataset Paper • 2403.04460 • Published Mar 7, 2024
Dialogue Chain-of-Thought Distillation for Commonsense-aware Conversational Agents Paper • 2310.09343 • Published Oct 13, 2023 • 2
Evidence-empowered Transfer Learning for Alzheimer's Disease Paper • 2303.01105 • Published Mar 2, 2023
Pearl: A Review-driven Persona-Knowledge Grounded Conversational Recommendation Dataset Paper • 2403.04460 • Published Mar 7, 2024