Models
Datasets
Spaces
Posts
Docs
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2401.07950

Alignment: FineTuning-Preference

S-LoRA: Serving Thousands of Concurrent LoRA Adapters

Paper • 2311.03285 • Published Nov 6, 2023 • 28
Tailoring Self-Rationalizers with Multi-Reward Distillation

Paper • 2311.02805 • Published Nov 6, 2023 • 3
Ultra-Long Sequence Distributed Transformer

Paper • 2311.02382 • Published Nov 4, 2023 • 2
OpenChat: Advancing Open-source Language Models with Mixed-Quality Data

Paper • 2309.11235 • Published Sep 20, 2023 • 16

A Zero-Shot Language Agent for Computer Control with Structured Reflection

Paper • 2310.08740 • Published Oct 12, 2023 • 14
ExpeL: LLM Agents Are Experiential Learners

Paper • 2308.10144 • Published Aug 20, 2023 • 2
Demystifying GPT Self-Repair for Code Generation

Paper • 2306.09896 • Published Jun 16, 2023 • 19
Large Language Models are Better Reasoners with Self-Verification

Paper • 2212.09561 • Published Dec 19, 2022 • 1

Automated Annotation with Generative AI Requires Validation

Paper • 2306.00176 • Published May 31, 2023 • 1
Fabricator: An Open Source Toolkit for Generating Labeled Training Data with Teacher LLMs

Paper • 2309.09582 • Published Sep 18, 2023 • 4
PromptMix: A Class Boundary Augmentation Method for Large Language Model Distillation

Paper • 2310.14192 • Published Oct 22, 2023 • 1
ICLEF: In-Context Learning with Expert Feedback for Explainable Style Transfer

Paper • 2309.08583 • Published Sep 15, 2023 • 1

Sci-CoT: Leveraging Large Language Models for Enhanced Knowledge Distillation in Small Models for Scientific QA

Paper • 2308.04679 • Published Aug 9, 2023 • 1
CLIN: A Continually Learning Language Agent for Rapid Task Adaptation and Generalization

Paper • 2310.10134 • Published Oct 16, 2023 • 1
AstroLLaMA: Towards Specialized Foundation Models in Astronomy

Paper • 2309.06126 • Published Sep 12, 2023 • 16
Large Language Model for Science: A Study on P vs. NP

Paper • 2309.05689 • Published Sep 11, 2023 • 20

Ensemble-Instruct: Generating Instruction-Tuning Data with a Heterogeneous Mixture of LMs

Paper • 2310.13961 • Published Oct 21, 2023 • 4
Fabricator: An Open Source Toolkit for Generating Labeled Training Data with Teacher LLMs

Paper • 2309.09582 • Published Sep 18, 2023 • 4
Auto-Instruct: Automatic Instruction Generation and Ranking for Black-Box Language Models

Paper • 2310.13127 • Published Oct 19, 2023 • 11
Evaluating the Robustness to Instructions of Large Language Models

Paper • 2308.14306 • Published Aug 28, 2023 • 1

Company

© Hugging Face

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs