Seperate files instead of checkpoints.

#11

by natalie5 - opened 3 days ago

3 days ago

•

The current model files include vocoder, VAE, Audio VAE alongisde the transformer blocks itself. This is an extremely unoptimized way of loading the model when being used with low VRAM. Having seperate VAE, vocoder files will make implementations/quantizations, VRAM use much better. This also leads to duplicated files where each checkpoint has the same VAEs, vocoder, it will save 4GB of RAM when sampling, and ~14GB in SSD space.

RuneXX

2 days ago

Looks like that might have been added ;-)
https://huggingface.co/Lightricks/LTX-2/tree/main

natalie5

2 days ago

Looks like that might have been added ;-)
https://huggingface.co/Lightricks/LTX-2/tree/main

Also this one https://huggingface.co/GitMylo/LTX-2-comfy_gemma_fp8_e4m3fn

Which works in comfyui

RuneXX

1 day ago

Looks like Kijai is also adding some versions ;-)
https://huggingface.co/Kijai/LTXV2_comfy

mingyi456

1 day ago

@RuneXX those newly added files are most likely for diffusers, not comfyui. The commit message that adds them says that they are weights for diffusers, as well.

natalie5

1 day ago

•

edited 1 day ago

@RuneXX those newly added files are most likely for diffusers, not comfyui. The commit message that adds them says that they are weights for diffusers, as well.

https://huggingface.co/GitMylo/LTX-2-comfy_gemma_fp8_e4m3fn these work in native comfy, we just need the diffusion model itself (without all the extra fluff) to make it fully seperate, plus the audio vae needs some code changes to make it work with the normal vae loader

Phr00t

about 16 hours ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment