tk93 commited on
Commit
7c59827
·
verified ·
1 Parent(s): cf3279c

update README

Browse files
Files changed (1) hide show
  1. README.md +11 -0
README.md CHANGED
@@ -2,6 +2,7 @@
2
  tags:
3
  - text-to-image
4
  - stable-diffusion
 
5
  license: apache-2.0
6
  language:
7
  - en
@@ -24,4 +25,14 @@ library_name: diffusers
24
 
25
  ### Audio Encoder
26
 
 
 
 
 
 
 
27
  ### V-Express
 
 
 
 
 
2
  tags:
3
  - text-to-image
4
  - stable-diffusion
5
+ - audio-to-video
6
  license: apache-2.0
7
  language:
8
  - en
 
25
 
26
  ### Audio Encoder
27
 
28
+ - [model_ckpts/wav2vec2-base-960h](https://huggingface.co/tk93/V-Express/tree/main/model_ckpts/wav2vec2-base-960h). (It is also available from the original model card [facebook/wav2vec2-base-960h](https://huggingface.co/facebook/wav2vec2-base-960h))
29
+
30
+ ### Face Analysis
31
+
32
+ - [model_ckpts/insightface_models/models/buffalo_l](https://huggingface.co/tk93/V-Express/tree/main/model_ckpts/insightface_models/models/buffalo_l). (It is also available from the original repository [insightface/buffalo_l](https://github.com/deepinsight/insightface/releases/download/v0.7/buffalo_l.zip))
33
+
34
  ### V-Express
35
+
36
+ - [model_ckpts/sd-vae-ft-mse](https://huggingface.co/tk93/V-Express/tree/main/model_ckpts/sd-vae-ft-mse) VAE encoder. (original model card [stabilityai/sd-vae-ft-mse](https://huggingface.co/stabilityai/sd-vae-ft-mse))
37
+ - [model_ckpts/stable-diffusion-v1-5](https://huggingface.co/tk93/V-Express/tree/main/model_ckpts/stable-diffusion-v1-5) Only the model configuration file for unet is needed here. (original model card [runwayml/stable-diffusion-v1-5](https://huggingface.co/runwayml/stable-diffusion-v1-5))
38
+ - [model_ckpts/v-express](https://huggingface.co/tk93/V-Express/tree/main/model_ckpts/v-express) The video generation model conditional on audio and V-kps we call V-Express.