vrgamedevgirl84 commited on
Commit
6e7ed8c
·
verified ·
1 Parent(s): ac36e47

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +99 -47
README.md CHANGED
@@ -11,91 +11,143 @@ widget:
11
  *(🔸 Before vs After — Left: Wan2.1 | Right: Merged model Wan14BT2V_MasterModel)*
12
  output:
13
  url: videos/AnimateDiff_00001.mp4
14
-
15
-
16
  base_model:
17
  - Wan-AI/Wan2.1-T2V-14B
18
  license: apache-2.0
19
  ---
20
 
21
- # 🎥 Wan2.1_14B_T2V-FusionX - Formerly named MasterModel, which served as a placeholder. Its the exact same model, just a new name. This inlucdes the fp8 and fp16 version.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
22
 
23
- A powerful merged **text-to-video model** based on the original [WAN 2.1 T2V](https://huggingface.co/Wan-AI/Wan2.1-T2V-14B) model, enhanced using multiple open-source components and LoRAs to boost motion realism, temporal consistency, and expressive detail.
24
- multiple open-source models and LoRAs to boost temporal quality, expressiveness, and motion realism.
 
 
 
25
 
26
  ---
27
 
28
- ## 🧠 Model Blend Includes:
29
- - 🔗 [AccVideo](https://github.com/aejion/AccVideo)
30
- - 🔗 [MoviiGen1.1](https://huggingface.co/ZuluVision/MoviiGen1.1)
31
- - 🔗 [CausVid](https://github.com/tianweiy/CausVid)
32
- - 🔗 [MPS Rewards LoRA](https://huggingface.co/alibaba-pai/Wan2.1-Fun-Reward-LoRAs)
33
- - Custom detail-enhancer LoRAs I created specifically for this merge
34
- - Only 8-10 steps are needed to get great results!
35
- -
36
- All are under **Apache 2.0** or **MIT** licenses and fully permitted for merge and reuse.
 
 
37
 
38
  ---
39
 
40
- ## 🖼️ Example Prompt
 
 
41
 
42
- > **Prompt:**
43
- > Tight close-up of her smiling lips and sparkling eyes, catching golden hour sunlight. She wears a white sundress with floral prints and a wide-brimmed straw hat. Camera pulls back in a dolly motion, revealing her twirling under a cherry blossom tree. Petals flutter in the air, casting playful shadows. Soft lens flares enhance the euphoric, dreamlike vibe.
 
 
 
 
 
 
 
44
 
 
45
 
46
- > **Negative Prompt (CN):**
47
- > 色调艳丽,过曝,静态,细节模糊不清,字幕,风格,作品,画作,画面,静止,整体发灰,最差质量,低质量,JPEG压缩残留,丑陋的,残缺的,多余的手指,画得不好的手部,画得不好的脸部,畸形的,毁容的,形态畸形的肢体,手指融合,静止不动的画面,杂乱的背景,三条腿,背景人很多,倒着走
 
 
 
 
 
48
 
49
  ---
50
 
51
- ## 🧰 How to Use (ComfyUI)
52
 
53
- 1. Download the `.safetensors` file from the [Files tab](https://huggingface.co/vrgamedevgirl84/Wan14BT2V_MasterModel/tree/main).
54
- 2. Place it into your `ComfyUI/models/diffusion_models/` folder.
55
- 3. Restart ComfyUI.
56
- 4. Use the **Checkpoint Loader** node and connect it to your text-to-video workflow.
57
- 5. Since CausVid is merged into this model, you only need no more than 10 steps to get great results.
 
 
 
 
58
 
59
  ---
60
 
61
- ## 📦 Downloads
62
 
63
- Weights available in `.safetensors` format
64
- 👉 [Download here](https://huggingface.co/vrgamedevgirl84/Wan14BT2V_MasterModel/tree/main)
 
 
 
 
 
 
65
 
66
  ---
67
 
68
- ## ⚠️ License Notice
69
 
70
- This merged model combines components licensed under **Apache 2.0** and **MIT**both of which are permissive open-source licenses.
71
 
72
- ### ✅ You are allowed to:
73
- - Use, modify, and redistribute the model (including commercial use)
74
- - Integrate it into your own projects or tools
75
 
76
- ### 📌 You must:
77
- - Include original license notices if you redistribute the model
78
- - Avoid implying endorsement or affiliation with the original authors (as required by Apache 2.0)
79
 
80
- ### 🔄 Output Content:
81
- - Generated videos **are not licensed** by the model’s open-source license
82
- - If any merged model relied on datasets with restrictions, those rules may still apply to the outputs
83
 
84
- This model is intended for **research and creative exploration**, not guaranteed for production use without further validation.
 
85
 
 
86
 
87
  ---
88
 
89
- ## 🙌 Thanks To
90
 
91
- - Alibaba-PAI
92
- - aejion
93
- - ZuluVision
94
- - Tianwei Yin (CausVid)
95
- - Kaji
 
96
 
97
- Big thanks to all original devs this merge wouldn't be possible without your amazing work.
98
 
99
  ---
100
 
 
 
 
 
 
 
 
 
 
 
101
 
 
 
11
  *(🔸 Before vs After — Left: Wan2.1 | Right: Merged model Wan14BT2V_MasterModel)*
12
  output:
13
  url: videos/AnimateDiff_00001.mp4
 
 
14
  base_model:
15
  - Wan-AI/Wan2.1-T2V-14B
16
  license: apache-2.0
17
  ---
18
 
19
+ # Wan2.1_14B_FusionX
20
+
21
+ Merged models for faster, richer motion & detail — high performance even at just 8 steps.
22
+
23
+ > 📌 Important: Please read the full description. Small setting changes can drastically affect results. I've tested and documented better settings below — don't skip it!
24
+
25
+ ---
26
+
27
+ ## 📂 Workflows
28
+
29
+ Workflows can be found **[HERE](#)** (WIP — more coming soon)
30
+
31
+ ---
32
+
33
+ ## 🚀 Overview
34
+
35
+ A powerful text-to-video model built on top of **WAN 2.1 14B**, merged with several research-grade models to boost:
36
 
37
+ - Motion quality
38
+ - Scene consistency
39
+ - Visual detail
40
+
41
+ Comparable with closed-source solutions, but open and optimized for **ComfyUI** workflows.
42
 
43
  ---
44
 
45
+ ## 💡 Inside the Fusion
46
+
47
+ This model includes the following merged components:
48
+
49
+ - **CausVid** Causal motion modeling for better flow and dynamics
50
+ - **AccVideo** Better temporal alignment and speed boost
51
+ - **MoviiGen1.1** Cinematic smoothness and lighting
52
+ - **MPS Reward LoRA** – Tuned for motion and detail
53
+ - **Custom LoRAs** For texture, clarity, and facial enhancements
54
+
55
+ All merged models use permissive open licenses (Apache 2.0 / MIT).
56
 
57
  ---
58
 
59
+ ## 🔧 Usage Details
60
+
61
+ ### Text-to-Video
62
 
63
+ - **CGF**: Must be set to `1`
64
+ - **Shift**:
65
+ - `1024x576`: Start at `1`
66
+ - `1080x720`: Start at `2`
67
+ - For realism → lower values
68
+ - For stylized → test `3–9`
69
+ - **Scheduler**:
70
+ - Recommended: `uni_pc`
71
+ - Alternative: `flowmatch_causvid` (better for some details)
72
 
73
+ ### Image-to-Video
74
 
75
+ - **CGF**: `1`
76
+ - **Shift**: `2` works best in most cases
77
+ - **Scheduler**:
78
+ - Recommended: `dmp++_sde/beta`
79
+ - To boost motion and reduce slow-mo effect:
80
+ - Frame count: `121`
81
+ - FPS: `24`
82
 
83
  ---
84
 
85
+ ## 🛠 Technical Notes
86
 
87
+ - Works in as few as **6 steps**
88
+ - Best quality at **8–10 steps**
89
+ - Drop-in replacement for `Wan2.1-T2V-14B`
90
+ - Up to **50% faster rendering**, especially with **SageAttn**
91
+ - Works natively and with **Kaji Wan Wrapper**
92
+ [Wrapper GitHub](https://github.com/kijai/ComfyUI-WanVideoWrapper)
93
+ - Do **not** re-add merged LoRAs (CausVid, AccVideo, MPS)
94
+ - Feel free to add **other LoRAs** for style/variation
95
+ - Native WAN workflows also supported (slightly slower)
96
 
97
  ---
98
 
99
+ ## 🧪 Performance Tips
100
 
101
+ - RTX 5090 ~138 sec/video at 1024x576 / 81 frames
102
+ - If VRAM is limited:
103
+ - Enable block swapping
104
+ - Start with `5` blocks and adjust as needed
105
+ - Use **SageAttn** for ~30% speedup (wrapper only)
106
+ - Do **not** use `teacache`
107
+ - "Enhance a video" (tested): Adds vibrance (try values 2–4)
108
+ - "SLG" not tested — feel free to explore
109
 
110
  ---
111
 
112
+ ## 🧠 Prompt Help
113
 
114
+ Want better cinematic prompts? Try the **WAN Cinematic Video Prompt Generator GPT** — it adds visual richness and makes a big difference in quality. [Download Here](https://chatgpt.com/g/g-67c3a6d6d19c81919b3247d2bfd01d0b-wan-cinematic-video-prompt-generator)
115
 
116
+ ---
 
 
117
 
118
+ ## 📣 Join The Community
 
 
119
 
120
+ We’re building a friendly space to chat, share outputs, and get help.
 
 
121
 
122
+ - Motion LoRAs coming soon
123
+ - Tips, updates, and support from other users
124
 
125
+ 👉 [Join the Discord](https://discord.com/invite/hxPmmXmRW3)
126
 
127
  ---
128
 
129
+ ## ⚖️ License
130
 
131
+ Merged under permissive licenses:
132
+
133
+ - Apache 2.0 / MIT
134
+ - You **can** use, modify, and redistribute
135
+ - You **must** retain original license info
136
+ - Outputs are not necessarily licensed — do your due diligence
137
 
138
+ This model is for **research, education, and personal use** only. Commercial use is your own responsibility. Please consult a legal advisor before monetizing outputs.
139
 
140
  ---
141
 
142
+ ## 🙏 Credits
143
+
144
+ - WAN Team (base model)
145
+ - aejion (AccVideo)
146
+ - Tianwei Yin (CausVid)
147
+ - ZuluVision (MoviiGen)
148
+ - Alibaba PAI (MPS LoRA)
149
+ - Kijai (ComfyUI Wrapper)
150
+
151
+ And thanks to the open-source community!
152
 
153
+ ---