Sayak Paul's picture

Sayak Paul

sayakpaul

·

https://sayak.dev

AI & ML interests

Diffusion models, representation learning

Recent Activity

updated a model 1 minute ago

zerogpu-aoti/fa3-wheel-zerogpu

published a model 1 minute ago

zerogpu-aoti/fa3-wheel-zerogpu

updated a dataset about 4 hours ago

huggingface/diffusers-metadata

View all activity

Organizations

upvoted an article 8 days ago

Article

Enhance Your Models in 5 Minutes with the Hugging Face Kernel Hub

By

and 6 others •

Jun 12

• 125

upvoted an article 16 days ago

Article

Welcome GPT OSS, the new open-source model family from OpenAI!

By

and 11 others •

17 days ago

• 469

upvoted 2 articles 20 days ago

Article

A Dive into Pretraining Strategies for Vision-Language Models

By

and 1 other •

Feb 3, 2023

• 74

Article

Personal Copilot: Train Your Own Coding Assistant

By

and 1 other •

Oct 27, 2023

• 67

upvoted an article 29 days ago

Article

State of open video generation models in Diffusers

By

and 2 others •

Jan 27

• 59

upvoted 2 articles about 1 month ago

Article

Building the Hugging Face MCP Server

By

and 3 others •

Jul 10

• 60

Article

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

By

and 1 other •

Jul 9

• 651

upvoted a paper about 2 months ago

Radial Attention: O(nlog n) Sparse Attention with Energy Decay for Long Video Generation

Paper • 2506.19852 • Published Jun 24 • 41

upvoted 2 articles 2 months ago

Article

(LoRA) Fine-Tuning FLUX.1-dev on Consumer Hardware

By

and 4 others •

Jun 19

• 83

Article

Mixture of Experts Explained

By

and 5 others •

Dec 11, 2023

• 842

upvoted 2 articles 3 months ago

Article

How to train a new language model from scratch using Transformers and Tokenizers

By

•

Feb 14, 2020

• 44

Article

Exploring Quantization Backends in Diffusers

By

and 2 others •

May 21

• 40

upvoted 2 articles 4 months ago

Article

Welcoming Llama Guard 4 on Hugging Face Hub

By

and 3 others •

Apr 29

• 40

Article

CinePile 2.0 - making stronger datasets with adversarial refinement

By

and 3 others •

Oct 23, 2024

• 18

upvoted an article 5 months ago

Article

Welcome Llama 4 Maverick & Scout on Hugging Face!

By

and 6 others •

Apr 5

• 146

upvoted 2 papers 5 months ago

A Refined Analysis of Massive Activations in LLMs

Paper • 2503.22329 • Published Mar 28 • 14

ZClip: Adaptive Spike Mitigation for LLM Pre-Training

Paper • 2504.02507 • Published Apr 3 • 90

upvoted a collection 5 months ago

SANA-1.5

SANA-1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer • 6 items • Updated Apr 17 • 6

upvoted 2 articles 5 months ago

Article

Don't repeat yourself - 🤗 Transformers Design Philosophy

By

•

Apr 5, 2022

• 39

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

By

and 3 others •

Mar 12

• 453