CLAP: Contrastive Language-Audio Pretraining Collection CLAP is to audio what CLIP is to image. • 5 items • Updated Oct 31, 2023 • 8
view article Article Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models Mar 20 • 66
🪐 SmolLM Collection A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated Aug 18 • 195
📀 Dataset comparison models Collection 1.8B models trained on 350BT to compare different pretraining datasets • 8 items • Updated Jun 12 • 30
My Best Models Collection These all mark personal achievements in my journey • 7 items • Updated Mar 31 • 4
Personal Favorites Collection Recommended models I use often or like for any reason. I recommend reading their cards for more details. • 9 items • Updated Aug 13 • 56
story writing favourites Collection Models I personally liked for generating stories in the past. Not a recommendation, many of these are outdated. • 17 items • Updated 2 days ago • 19
Quantized Models (GGUF, IQ, Imatrix) Collection Various quantizations of models in the GGUF format. Models with a "checkmark" are personal favorites. An "orange arrow" means it's being uploaded. • 89 items • Updated 18 days ago • 48
Utilities Collection No crazy stuff, but useful ones for in-between steps • 15 items • Updated 7 days ago • 4
Recent models: last 100 repos, sorted by creation date Collection The last 100 repos I have created. Sorted by creation date descending, so the most recently created repos appear at the top. • 121 items • Updated Jan 31 • 505