view article Article Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models By loubnabnl and 2 others • Mar 20, 2024 • 87
instruction-pretrain/ft-instruction-synthesizer-collection Viewer • Updated Mar 1 • 249k • 3.15k • 61