LLaMA Beyond English: An Empirical Study on Language Capability Transfer Paper โข 2401.01055 โข Published Jan 2, 2024 โข 54
Llama 2: Open Foundation and Fine-Tuned Chat Models Paper โข 2307.09288 โข Published Jul 18, 2023 โข 244
BioBART: Pretraining and Evaluation of A Biomedical Generative Language Model Paper โข 2204.03905 โข Published Apr 8, 2022 โข 4
DeepSpeed-Chat: Easy, Fast and Affordable RLHF Training of ChatGPT-like Models at All Scales Paper โข 2308.01320 โข Published Aug 2, 2023 โข 45