SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild Paper • 2503.18892 • Published 5 days ago • 27
SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild Paper • 2503.18892 • Published 5 days ago • 27
Revisit Input Perturbation Problems for LLMs: A Unified Robustness Evaluation Framework for Noisy Slot Filling Task Paper • 2310.06504 • Published Oct 10, 2023 • 1
Large Language Models Meet Open-World Intent Discovery and Recognition: An Evaluation of ChatGPT Paper • 2310.10176 • Published Oct 16, 2023 • 1
What Makes Good Data for Alignment? A Comprehensive Study of Automatic Data Selection in Instruction Tuning Paper • 2312.15685 • Published Dec 25, 2023 • 16
FutureTOD: Teaching Future Knowledge to Pre-trained Language Model for Task-Oriented Dialogue Paper • 2306.10315 • Published Jun 17, 2023 • 1
DolphCoder: Echo-Locating Code Large Language Models with Diverse and Multi-Objective Instruction Tuning Paper • 2402.09136 • Published Feb 14, 2024 • 1
PreAct: Predicting Future in ReAct Enhances Agent's Planning Ability Paper • 2402.11534 • Published Feb 18, 2024 • 1
Semi-Supervised Knowledge-Grounded Pre-training for Task-Oriented Dialog Systems Paper • 2210.08873 • Published Oct 17, 2022 • 1
How Do Your Code LLMs Perform? Empowering Code Instruction Tuning with High-Quality Data Paper • 2409.03810 • Published Sep 5, 2024 • 35
How Do Your Code LLMs Perform? Empowering Code Instruction Tuning with High-Quality Data Paper • 2409.03810 • Published Sep 5, 2024 • 35