--- license: mit --- # CamI2V: Camera-Controlled Image-to-Video Diffusion Model
## π News and Todo List - π₯ 25/03/17: Upload test metadata used in our paper to make easier evaluation. - π₯ 25/02/15: Release demo of [RealCam-I2V](https://zgctroy.github.io/RealCam-I2V/) for real-world applications, code will be available at [repo](https://github.com/ZGCTroy/RealCam-I2V). - π₯ 25/01/12: Release checkpoint of [CamI2V (512x320, 100k)](https://huggingface.co/MuteApo/CamI2V/blob/main/512_cami2v_100k.pt). We plan to release a more advanced model with longer training soon. - π₯ 25/01/02: Release checkpoint of [CamI2V (512x320, 50k)](https://huggingface.co/MuteApo/CamI2V/blob/main/512_cami2v_50k.pt), which is suitable for research propose and comparison. - π₯ 24/12/24: Integrate [Qwen2-VL](https://github.com/QwenLM/Qwen2-VL) in gradio demo, you can now caption your own input image by this powerful VLM. - π₯ 24/12/23: Release checkpoint of [CamI2V (256x256, 50k)](https://huggingface.co/MuteApo/CamI2V/blob/main/256_cami2v.pt). - π₯ 24/12/16: Release reproduced non-official checkpoints of [MotionCtrl (256x256, 50k)](https://huggingface.co/MuteApo/CamI2V/blob/main/256_motionctrl.pt) and [CameraCtrl (256x256, 50k)](https://huggingface.co/MuteApo/CamI2V/blob/main/256_cameractrl.pt) on [DynamiCrafter](https://github.com/Doubiiu/DynamiCrafter). - π₯ 24/12/09: Release training configs and scripts. - π₯ 24/12/06: Release [dataset pre-process code](datasets) for RealEstate10K. - π₯ 24/12/02: Release [evaluation code](evaluation) for RotErr, TransErr, CamMC and FVD. - π± 24/11/16: Release model code of CamI2V for training and inference, including implementation for MotionCtrl and CameraCtrl. ## π₯ Gallery
rightward rotation and zoom in (CFG=4, FS=6, step=50, ratio=0.6, scale=0.1) |
leftward rotation and zoom in (CFG=4, FS=6, step=50, ratio=0.6, scale=0.1) |
|
|
zoom in and upward movement (CFG=4, FS=6, step=50, ratio=0.8, scale=0.2) |
downward movement and zoom-out (CFG=4, FS=6, step=50, ratio=0.8, scale=0.2) |
|
|