LifuWang
/

DistillT5

text-generation-inference

Model card Files Files and versions Community

Lifu Wang commited on Dec 9, 2024

Commit

e794be4

·

verified ·

1 Parent(s): 634301a

Update README.md

Files changed (1) hide show

README.md +17 -1

README.md CHANGED Viewed

@@ -7,4 +7,20 @@ tags: []
 Official Repository of the paper: *[Scaling Down Text Encoders of Text-to-Image Diffusion Models](https://github.com/LifuWang-66/ScalingDownTextEncoder/tree/main)*.
-Project Page: https://lifuwang-66.github.io/ScalingDownTE/

 Official Repository of the paper: *[Scaling Down Text Encoders of Text-to-Image Diffusion Models](https://github.com/LifuWang-66/ScalingDownTextEncoder/tree/main)*.
+Project Page: https://lifuwang-66.github.io/ScalingDownTE/
+## Model Descriptions:
+T5-Base distilled from [T5-XXL](https://huggingface.co/google/flan-t5-xxl) using [Flux](https://huggingface.co/runwayml/stable-diffusion-v1-5).
+It is 50 times smaller and retains most capability of T5-XXL.
+## Generation Results:
+<p align="center">
+    <img src="teaser.png">
+</p>
+By distilling classifier-free guidance into the model's input, LCM can generate high-quality images in very short inference time. We compare the inference time at the setting of 768 x 768 resolution, CFG scale w=8, batchsize=4, using a A800 GPU.
+<p align="center">
+    <img src="speed_fid.png">
+</p>