Sayak Paul's picture

Sayak Paul

sayakpaul

AI & ML interests

Diffusion models, representation learning

Recent Activity

updated a model about 7 hours ago
sayakpaul/trained-lumina2-lora-yarn
published a model about 7 hours ago
sayakpaul/trained-lumina2-lora-yarn
updated a model 1 day ago
sayakpaul/dummy-lora-state-dicts
View all activity

Organizations

Hugging Face's profile picture Deprem Private's profile picture 🧨Diffusers's profile picture TensorFlow TPU's profile picture Hugging Face Internal Testing Organization's profile picture ControlNet 1.1 Preview's profile picture Keras's profile picture All Things ViTs's profile picture Carted's profile picture Amazon ML's profile picture Instruction-tuned Diffusion Models's profile picture Probing ViTs's profile picture Evaluation on the Hub's profile picture JAX ♥️ Diffusers 🧨's profile picture (De)fusing's profile picture Huggingface Projects's profile picture Keras Dreambooth Event's profile picture Hugging Face OSS Metrics's profile picture Deploy HF TF ViTs's profile picture Parti Prompts Diffusers's profile picture Open Generative AI's profile picture UniDiffuser Testing's profile picture Kandinsky Community's profile picture Personal Coding Assistant's profile picture Diffusers Demo at ICCV 2023's profile picture diffusers-adapter's profile picture xin-ic sayak-hf's profile picture PixArt's profile picture huggingPartyParis's profile picture Latent Consistency's profile picture Spatial-T2I's profile picture ZeroGPU Explorers's profile picture SPRIGHT's profile picture PEFT's profile picture NYU VisionX's profile picture Data Is Better Together's profile picture Social Post Explorers's profile picture MaPO's profile picture diffusers-internal-dev's profile picture AuraFlow's profile picture Anonymous T2I Model's profile picture Optimum Internal Testing's profile picture ZP's profile picture Diffusion Guidance's profile picture syn-t2i's profile picture Vchitect-XL's profile picture Data Is Better Together Contributor's profile picture DDUF's profile picture HunyuanVideo Community's profile picture Video Intrinsics's profile picture Finetrainers's profile picture Adaptive Summarization's profile picture

Posts 21

view post
Post
2639
Inference-time scaling meets Flux.1-Dev (and others) 🔥

Presenting a simple re-implementation of "Inference-time scaling diffusion models beyond denoising steps" by Ma et al.

I did the simplest random search strategy, but results can potentially be improved with better-guided search methods.

Supports Gemini 2 Flash & Qwen2.5 as verifiers for "LLMGrading" 🤗

The steps are simple:

For each round:

1> Starting by sampling 2 starting noises with different seeds.
2> Score the generations w.r.t a metric.
3> Obtain the best generation from the current round.

If you have more compute budget, go to the next search round. Scale the noise pool (2 ** search_round) and repeat 1 - 3.

This constitutes the random search method as done in the paper by Google DeepMind.

Code, more results, and a bunch of other stuff are in the repository. Check it out here: https://github.com/sayakpaul/tt-scale-flux/ 🤗

Articles 27

Article
24

Build awesome datasets for video generation