ZeroGPU AoTI
community
AI & ML interests
AoT compilation, ZeroGPU inference optimization
Recent Activity
View all activity
Enlists the resources to serialize and load compiled graph modules from the Hub to skip compilation time 🔥
Creative applications and accelerated demos with QwenImageEdit
optimized demo for Flux kontext [dev], using FP8 quantization and AoT compilation
Enlists the resources to serialize and load compiled graph modules from the Hub to skip compilation time 🔥
optimized demos for Wan 2.2 14B models, using FP8 quantization + AoT compilation & community LoRAs for fast & high quality inference on ZeroGPU 💨
Creative applications and accelerated demos with QwenImageEdit
Compare AoTI vs. base version
optimized demo for Flux kontext [dev], using FP8 quantization and AoT compilation