Update README.md
Browse files
README.md
CHANGED
@@ -11,91 +11,143 @@ widget:
|
|
11 |
*(🔸 Before vs After — Left: Wan2.1 | Right: Merged model Wan14BT2V_MasterModel)*
|
12 |
output:
|
13 |
url: videos/AnimateDiff_00001.mp4
|
14 |
-
|
15 |
-
|
16 |
base_model:
|
17 |
- Wan-AI/Wan2.1-T2V-14B
|
18 |
license: apache-2.0
|
19 |
---
|
20 |
|
21 |
-
#
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
22 |
|
23 |
-
|
24 |
-
|
|
|
|
|
|
|
25 |
|
26 |
---
|
27 |
|
28 |
-
##
|
29 |
-
|
30 |
-
|
31 |
-
|
32 |
-
-
|
33 |
-
-
|
34 |
-
-
|
35 |
-
-
|
36 |
-
|
|
|
|
|
37 |
|
38 |
---
|
39 |
|
40 |
-
##
|
|
|
|
|
41 |
|
42 |
-
|
43 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
44 |
|
|
|
45 |
|
46 |
-
|
47 |
-
|
|
|
|
|
|
|
|
|
|
|
48 |
|
49 |
---
|
50 |
|
51 |
-
##
|
52 |
|
53 |
-
|
54 |
-
|
55 |
-
|
56 |
-
|
57 |
-
|
|
|
|
|
|
|
|
|
58 |
|
59 |
---
|
60 |
|
61 |
-
##
|
62 |
|
63 |
-
|
64 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
65 |
|
66 |
---
|
67 |
|
68 |
-
##
|
69 |
|
70 |
-
|
71 |
|
72 |
-
|
73 |
-
- Use, modify, and redistribute the model (including commercial use)
|
74 |
-
- Integrate it into your own projects or tools
|
75 |
|
76 |
-
|
77 |
-
- Include original license notices if you redistribute the model
|
78 |
-
- Avoid implying endorsement or affiliation with the original authors (as required by Apache 2.0)
|
79 |
|
80 |
-
|
81 |
-
- Generated videos **are not licensed** by the model’s open-source license
|
82 |
-
- If any merged model relied on datasets with restrictions, those rules may still apply to the outputs
|
83 |
|
84 |
-
|
|
|
85 |
|
|
|
86 |
|
87 |
---
|
88 |
|
89 |
-
##
|
90 |
|
91 |
-
|
92 |
-
|
93 |
-
-
|
94 |
-
-
|
95 |
-
-
|
|
|
96 |
|
97 |
-
|
98 |
|
99 |
---
|
100 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
101 |
|
|
|
|
11 |
*(🔸 Before vs After — Left: Wan2.1 | Right: Merged model Wan14BT2V_MasterModel)*
|
12 |
output:
|
13 |
url: videos/AnimateDiff_00001.mp4
|
|
|
|
|
14 |
base_model:
|
15 |
- Wan-AI/Wan2.1-T2V-14B
|
16 |
license: apache-2.0
|
17 |
---
|
18 |
|
19 |
+
# Wan2.1_14B_FusionX
|
20 |
+
|
21 |
+
Merged models for faster, richer motion & detail — high performance even at just 8 steps.
|
22 |
+
|
23 |
+
> 📌 Important: Please read the full description. Small setting changes can drastically affect results. I've tested and documented better settings below — don't skip it!
|
24 |
+
|
25 |
+
---
|
26 |
+
|
27 |
+
## 📂 Workflows
|
28 |
+
|
29 |
+
Workflows can be found **[HERE](#)** (WIP — more coming soon)
|
30 |
+
|
31 |
+
---
|
32 |
+
|
33 |
+
## 🚀 Overview
|
34 |
+
|
35 |
+
A powerful text-to-video model built on top of **WAN 2.1 14B**, merged with several research-grade models to boost:
|
36 |
|
37 |
+
- Motion quality
|
38 |
+
- Scene consistency
|
39 |
+
- Visual detail
|
40 |
+
|
41 |
+
Comparable with closed-source solutions, but open and optimized for **ComfyUI** workflows.
|
42 |
|
43 |
---
|
44 |
|
45 |
+
## 💡 Inside the Fusion
|
46 |
+
|
47 |
+
This model includes the following merged components:
|
48 |
+
|
49 |
+
- **CausVid** – Causal motion modeling for better flow and dynamics
|
50 |
+
- **AccVideo** – Better temporal alignment and speed boost
|
51 |
+
- **MoviiGen1.1** – Cinematic smoothness and lighting
|
52 |
+
- **MPS Reward LoRA** – Tuned for motion and detail
|
53 |
+
- **Custom LoRAs** – For texture, clarity, and facial enhancements
|
54 |
+
|
55 |
+
All merged models use permissive open licenses (Apache 2.0 / MIT).
|
56 |
|
57 |
---
|
58 |
|
59 |
+
## 🔧 Usage Details
|
60 |
+
|
61 |
+
### Text-to-Video
|
62 |
|
63 |
+
- **CGF**: Must be set to `1`
|
64 |
+
- **Shift**:
|
65 |
+
- `1024x576`: Start at `1`
|
66 |
+
- `1080x720`: Start at `2`
|
67 |
+
- For realism → lower values
|
68 |
+
- For stylized → test `3–9`
|
69 |
+
- **Scheduler**:
|
70 |
+
- Recommended: `uni_pc`
|
71 |
+
- Alternative: `flowmatch_causvid` (better for some details)
|
72 |
|
73 |
+
### Image-to-Video
|
74 |
|
75 |
+
- **CGF**: `1`
|
76 |
+
- **Shift**: `2` works best in most cases
|
77 |
+
- **Scheduler**:
|
78 |
+
- Recommended: `dmp++_sde/beta`
|
79 |
+
- To boost motion and reduce slow-mo effect:
|
80 |
+
- Frame count: `121`
|
81 |
+
- FPS: `24`
|
82 |
|
83 |
---
|
84 |
|
85 |
+
## 🛠 Technical Notes
|
86 |
|
87 |
+
- Works in as few as **6 steps**
|
88 |
+
- Best quality at **8–10 steps**
|
89 |
+
- Drop-in replacement for `Wan2.1-T2V-14B`
|
90 |
+
- Up to **50% faster rendering**, especially with **SageAttn**
|
91 |
+
- Works natively and with **Kaji Wan Wrapper**
|
92 |
+
[Wrapper GitHub](https://github.com/kijai/ComfyUI-WanVideoWrapper)
|
93 |
+
- Do **not** re-add merged LoRAs (CausVid, AccVideo, MPS)
|
94 |
+
- Feel free to add **other LoRAs** for style/variation
|
95 |
+
- Native WAN workflows also supported (slightly slower)
|
96 |
|
97 |
---
|
98 |
|
99 |
+
## 🧪 Performance Tips
|
100 |
|
101 |
+
- RTX 5090 → ~138 sec/video at 1024x576 / 81 frames
|
102 |
+
- If VRAM is limited:
|
103 |
+
- Enable block swapping
|
104 |
+
- Start with `5` blocks and adjust as needed
|
105 |
+
- Use **SageAttn** for ~30% speedup (wrapper only)
|
106 |
+
- Do **not** use `teacache`
|
107 |
+
- "Enhance a video" (tested): Adds vibrance (try values 2–4)
|
108 |
+
- "SLG" not tested — feel free to explore
|
109 |
|
110 |
---
|
111 |
|
112 |
+
## 🧠 Prompt Help
|
113 |
|
114 |
+
Want better cinematic prompts? Try the **WAN Cinematic Video Prompt Generator GPT** — it adds visual richness and makes a big difference in quality. [Download Here](https://chatgpt.com/g/g-67c3a6d6d19c81919b3247d2bfd01d0b-wan-cinematic-video-prompt-generator)
|
115 |
|
116 |
+
---
|
|
|
|
|
117 |
|
118 |
+
## 📣 Join The Community
|
|
|
|
|
119 |
|
120 |
+
We’re building a friendly space to chat, share outputs, and get help.
|
|
|
|
|
121 |
|
122 |
+
- Motion LoRAs coming soon
|
123 |
+
- Tips, updates, and support from other users
|
124 |
|
125 |
+
👉 [Join the Discord](https://discord.com/invite/hxPmmXmRW3)
|
126 |
|
127 |
---
|
128 |
|
129 |
+
## ⚖️ License
|
130 |
|
131 |
+
Merged under permissive licenses:
|
132 |
+
|
133 |
+
- Apache 2.0 / MIT
|
134 |
+
- You **can** use, modify, and redistribute
|
135 |
+
- You **must** retain original license info
|
136 |
+
- Outputs are not necessarily licensed — do your due diligence
|
137 |
|
138 |
+
This model is for **research, education, and personal use** only. Commercial use is your own responsibility. Please consult a legal advisor before monetizing outputs.
|
139 |
|
140 |
---
|
141 |
|
142 |
+
## 🙏 Credits
|
143 |
+
|
144 |
+
- WAN Team (base model)
|
145 |
+
- aejion (AccVideo)
|
146 |
+
- Tianwei Yin (CausVid)
|
147 |
+
- ZuluVision (MoviiGen)
|
148 |
+
- Alibaba PAI (MPS LoRA)
|
149 |
+
- Kijai (ComfyUI Wrapper)
|
150 |
+
|
151 |
+
And thanks to the open-source community!
|
152 |
|
153 |
+
---
|