SANA 1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer
Paper
โข
2501.18427
โข
Published
โข
18
None defined yet.
2 ** search_round
) and repeat 1 - 3.diffusers
๐งจbistandbytes
as the official backend but using others like torchao
is already very simple. enable_model_cpu_offload()
torch.compile()
them.