-
S-LoRA: Serving Thousands of Concurrent LoRA Adapters
Paper • 2311.03285 • Published • 28 -
Tailoring Self-Rationalizers with Multi-Reward Distillation
Paper • 2311.02805 • Published • 3 -
Ultra-Long Sequence Distributed Transformer
Paper • 2311.02382 • Published • 2 -
OpenChat: Advancing Open-source Language Models with Mixed-Quality Data
Paper • 2309.11235 • Published • 16
Collections
Discover the best community collections!
Collections including paper arxiv:2401.07950
-
A Zero-Shot Language Agent for Computer Control with Structured Reflection
Paper • 2310.08740 • Published • 14 -
ExpeL: LLM Agents Are Experiential Learners
Paper • 2308.10144 • Published • 2 -
Demystifying GPT Self-Repair for Code Generation
Paper • 2306.09896 • Published • 19 -
Large Language Models are Better Reasoners with Self-Verification
Paper • 2212.09561 • Published • 1
-
Automated Annotation with Generative AI Requires Validation
Paper • 2306.00176 • Published • 1 -
Fabricator: An Open Source Toolkit for Generating Labeled Training Data with Teacher LLMs
Paper • 2309.09582 • Published • 4 -
PromptMix: A Class Boundary Augmentation Method for Large Language Model Distillation
Paper • 2310.14192 • Published • 1 -
ICLEF: In-Context Learning with Expert Feedback for Explainable Style Transfer
Paper • 2309.08583 • Published • 1
-
Sci-CoT: Leveraging Large Language Models for Enhanced Knowledge Distillation in Small Models for Scientific QA
Paper • 2308.04679 • Published • 1 -
CLIN: A Continually Learning Language Agent for Rapid Task Adaptation and Generalization
Paper • 2310.10134 • Published • 1 -
AstroLLaMA: Towards Specialized Foundation Models in Astronomy
Paper • 2309.06126 • Published • 16 -
Large Language Model for Science: A Study on P vs. NP
Paper • 2309.05689 • Published • 20
-
Ensemble-Instruct: Generating Instruction-Tuning Data with a Heterogeneous Mixture of LMs
Paper • 2310.13961 • Published • 4 -
Fabricator: An Open Source Toolkit for Generating Labeled Training Data with Teacher LLMs
Paper • 2309.09582 • Published • 4 -
Auto-Instruct: Automatic Instruction Generation and Ranking for Black-Box Language Models
Paper • 2310.13127 • Published • 11 -
Evaluating the Robustness to Instructions of Large Language Models
Paper • 2308.14306 • Published • 1