amd
/

Text-to-Video

Advanced text-to-video Diffusion Models

⚡️ This repository provides training recipes for the AMD efficient text-to-video models, which are designed for high performance and efficiency. The training process includes two key steps:

  • Distillation and Pruning: We distill and prune the popular text-to-video model VideoCrafter2, reducing the parameters to a compact 945M while maintaining competitive performance.

  • Optimization with T2V-Turbo: We apply the T2V-Turbo method on the distilled model to reduce inference steps and further enhance model quality.

This implementation is released to promote further research and innovation in the field of efficient text-to-video generation, optimized for AMD Instinct accelerators.

Vbench performance

8-Steps Results

A cute happy Corgi playing in park, sunset, pixel. A cute happy Corgi playing in park, sunset, animated style.gif A cute raccoon playing guitar in the beach. A cute raccoon playing guitar in the forest.
A quiet beach at dawn and the waves gently lapping. A cute teddy bear, dressed in a red silk outfit, stands in a vibrant street, Chinese New Year. A sandcastle being eroded by the incoming tide. An astronaut flying in space, in cyberpunk style.
A cat DJ at a party. A 3D model of a 1800s victorian house. A drone flying over a snowy forest. A ghost ship navigating through a sea under a moon.

License

Copyright (c) 2024 Advanced Micro Devices, Inc. All Rights Reserved.

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model has no library tag.

Model tree for amd/AMD-Hummingbird-T2V

Finetuned
(1)
this model

Datasets used to train amd/AMD-Hummingbird-T2V