view article Article Enhance Your Models in 5 Minutes with the Hugging Face Kernel Hub By drbh and 6 others • Jun 12 • 125
view article Article Welcome GPT OSS, the new open-source model family from OpenAI! By reach-vb and 11 others • 17 days ago • 469
view article Article A Dive into Pretraining Strategies for Vision-Language Models By adirik and 1 other • Feb 3, 2023 • 74
view article Article Personal Copilot: Train Your Own Coding Assistant By smangrul and 1 other • Oct 27, 2023 • 67
view article Article State of open video generation models in Diffusers By sayakpaul and 2 others • Jan 27 • 59
view article Article Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders By thomwolf and 1 other • Jul 9 • 651
Radial Attention: O(nlog n) Sparse Attention with Energy Decay for Long Video Generation Paper • 2506.19852 • Published Jun 24 • 41
view article Article (LoRA) Fine-Tuning FLUX.1-dev on Consumer Hardware By derekl35 and 4 others • Jun 19 • 83
view article Article How to train a new language model from scratch using Transformers and Tokenizers By julien-c • Feb 14, 2020 • 44
view article Article Exploring Quantization Backends in Diffusers By derekl35 and 2 others • May 21 • 40
view article Article CinePile 2.0 - making stronger datasets with adversarial refinement By mfarre and 3 others • Oct 23, 2024 • 18
view article Article Welcome Llama 4 Maverick & Scout on Hugging Face! By burtenshaw and 6 others • Apr 5 • 146
SANA-1.5 Collection SANA-1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer • 6 items • Updated Apr 17 • 6
view article Article Don't repeat yourself - 🤗 Transformers Design Philosophy By patrickvonplaten • Apr 5, 2022 • 39
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM By ariG23498 and 3 others • Mar 12 • 453