Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding Paper • 2411.04282 • Published Nov 6, 2024 • 36
Learning from Sparse Offline Datasets via Conservative Density Estimation Paper • 2401.08819 • Published Jan 16, 2024
MobileAIBench: Benchmarking LLMs and LMMs for On-Device Use Cases Paper • 2406.10290 • Published Jun 12, 2024
Diversity Empowers Intelligence: Integrating Expertise of Software Engineering Agents Paper • 2408.07060 • Published Aug 13, 2024 • 43
APIGen: Automated Pipeline for Generating Verifiable and Diverse Function-Calling Datasets Paper • 2406.18518 • Published Jun 26, 2024 • 25
Learning Shared Safety Constraints from Multi-task Demonstrations Paper • 2309.00711 • Published Sep 1, 2023
TAIL: Task-specific Adapters for Imitation Learning with Large Pretrained Models Paper • 2310.05905 • Published Oct 9, 2023 • 2
AgentLite: A Lightweight Library for Building and Advancing Task-Oriented LLM Agent System Paper • 2402.15538 • Published Feb 23, 2024 • 6
AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning Paper • 2402.15506 • Published Feb 23, 2024 • 17
Constrained Decision Transformer for Offline Safe Reinforcement Learning Paper • 2302.07351 • Published Feb 14, 2023