MuteApo
/

CamI2V

Model card Files Files and versions Community

MuteApo commited on 5 days ago

Commit

40ca72a

verified ·

1 Parent(s): 4d69a59

Update README.md

Browse files

Files changed (1) hide show

README.md +14 -19

README.md CHANGED Viewed

@@ -3,7 +3,6 @@ license: mit
 ---
 # CamI2V: Camera-Controlled Image-to-Video Diffusion Model
 <div align="center">
     <a href="https://arxiv.org/abs/2410.15957">
         <img src="https://img.shields.io/static/v1?label=arXiv&message=2410.15957&color=b21d1a" style="display: inline-block; vertical-align: middle;">
@@ -16,22 +15,6 @@ license: mit
     </a>
 </div>
-## 🌟 News and Todo List
-- 🔥 25/03/17: Upload test metadata used in our paper to make easier evaluation.
-- 🔥 25/02/15: Release demo of [RealCam-I2V](https://zgctroy.github.io/RealCam-I2V/) for real-world applications, code will be available at [repo](https://github.com/ZGCTroy/RealCam-I2V).
-- 🔥 25/01/12: Release checkpoint of [CamI2V (512x320, 100k)](https://huggingface.co/MuteApo/CamI2V/blob/main/512_cami2v_100k.pt). We plan to release a more advanced model with longer training soon.
-- 🔥 25/01/02: Release checkpoint of [CamI2V (512x320, 50k)](https://huggingface.co/MuteApo/CamI2V/blob/main/512_cami2v_50k.pt), which is suitable for research propose and comparison.
-- 🔥 24/12/24: Integrate [Qwen2-VL](https://github.com/QwenLM/Qwen2-VL) in gradio demo, you can now caption your own input image by this powerful VLM.
-- 🔥 24/12/23: Release checkpoint of [CamI2V (256x256, 50k)](https://huggingface.co/MuteApo/CamI2V/blob/main/256_cami2v.pt).
-- 🔥 24/12/16: Release reproduced non-official checkpoints of [MotionCtrl (256x256, 50k)](https://huggingface.co/MuteApo/CamI2V/blob/main/256_motionctrl.pt) and [CameraCtrl (256x256, 50k)](https://huggingface.co/MuteApo/CamI2V/blob/main/256_cameractrl.pt) on [DynamiCrafter](https://github.com/Doubiiu/DynamiCrafter).
-- 🔥 24/12/09: Release training configs and scripts.
-- 🔥 24/12/06: Release [dataset pre-process code](datasets) for RealEstate10K.
-- 🔥 24/12/02: Release [evaluation code](evaluation) for RotErr, TransErr, CamMC and FVD.
-- 🌱 24/11/16: Release model code of CamI2V for training and inference, including implementation for MotionCtrl and CameraCtrl.
 ## 🎥 Gallery
 <table>
@@ -69,6 +52,20 @@ license: mit
     </tr>
 </table>
 ## 📈 Performance
 Measured under 256x256 resolution, 50k training steps, 25 DDIM steps, text-image CFG 7.5, camera CFG 1.0 (no camera CFG).
@@ -141,7 +138,6 @@ python cami2v_gradio_app.py --use_qwenvl_captioner
 Gradio may struggle to establish network connection, please re-try with `--use_host_ip`.
 ## 🤗 Related Repo
 [RealCam-I2V: https://github.com/ZGCTroy/RealCam-I2V](https://github.com/ZGCTroy/RealCam-I2V)
@@ -152,7 +148,6 @@ Gradio may struggle to establish network connection, please re-try with `--use_h
 [DynamiCrafter: https://github.com/Doubiiu/DynamiCrafter](https://github.com/Doubiiu/DynamiCrafter)
 ## 🗒️ Citation
 ```

 ---
 # CamI2V: Camera-Controlled Image-to-Video Diffusion Model
 <div align="center">
     <a href="https://arxiv.org/abs/2410.15957">
         <img src="https://img.shields.io/static/v1?label=arXiv&message=2410.15957&color=b21d1a" style="display: inline-block; vertical-align: middle;">
     </a>
 </div>
 ## 🎥 Gallery
 <table>
     </tr>
 </table>
+## 🌟 News and Todo List
+- 🔥 25/03/17: Upload test metadata used in our paper to make easier evaluation.
+- 🔥 25/02/15: Release demo of [RealCam-I2V](https://zgctroy.github.io/RealCam-I2V/) for real-world applications, code will be available at [repo](https://github.com/ZGCTroy/RealCam-I2V).
+- 🔥 25/01/12: Release checkpoint of [CamI2V (512x320, 100k)](https://huggingface.co/MuteApo/CamI2V/blob/main/512_cami2v_100k.pt). We plan to release a more advanced model with longer training soon.
+- 🔥 25/01/02: Release checkpoint of [CamI2V (512x320, 50k)](https://huggingface.co/MuteApo/CamI2V/blob/main/512_cami2v_50k.pt), which is suitable for research propose and comparison.
+- 🔥 24/12/24: Integrate [Qwen2-VL](https://github.com/QwenLM/Qwen2-VL) in gradio demo, you can now caption your own input image by this powerful VLM.
+- 🔥 24/12/23: Release checkpoint of [CamI2V (256x256, 50k)](https://huggingface.co/MuteApo/CamI2V/blob/main/256_cami2v.pt).
+- 🔥 24/12/16: Release reproduced non-official checkpoints of [MotionCtrl (256x256, 50k)](https://huggingface.co/MuteApo/CamI2V/blob/main/256_motionctrl.pt) and [CameraCtrl (256x256, 50k)](https://huggingface.co/MuteApo/CamI2V/blob/main/256_cameractrl.pt) on [DynamiCrafter](https://github.com/Doubiiu/DynamiCrafter).
+- 🔥 24/12/09: Release training configs and scripts.
+- 🔥 24/12/06: Release [dataset pre-process code](datasets) for RealEstate10K.
+- 🔥 24/12/02: Release [evaluation code](evaluation) for RotErr, TransErr, CamMC and FVD.
+- 🌱 24/11/16: Release model code of CamI2V for training and inference, including implementation for MotionCtrl and CameraCtrl.
 ## 📈 Performance
 Measured under 256x256 resolution, 50k training steps, 25 DDIM steps, text-image CFG 7.5, camera CFG 1.0 (no camera CFG).
 Gradio may struggle to establish network connection, please re-try with `--use_host_ip`.
 ## 🤗 Related Repo
 [RealCam-I2V: https://github.com/ZGCTroy/RealCam-I2V](https://github.com/ZGCTroy/RealCam-I2V)
 [DynamiCrafter: https://github.com/Doubiiu/DynamiCrafter](https://github.com/Doubiiu/DynamiCrafter)
 ## 🗒️ Citation
 ```