File size: 11,984 Bytes
99a7933 a104a11 99a7933 c2628ab 99a7933 e92bd49 99a7933 e6f63a9 a104a11 e92bd49 a104a11 e92bd49 a104a11 e92bd49 a104a11 c2628ab a104a11 99a7933 ceb5625 21a1e4a 8928eaf 6e7ed8c a4fb9f8 c9277c3 180e0e9 c9277c3 e675670 8cfe96a e675670 8cfe96a 0562d1f e675670 8cfe96a 180e0e9 6e7ed8c 180e0e9 7286a62 4c55a1c 7286a62 6e7ed8c 4762ea6 6e7ed8c 4762ea6 6c2893a d892a8d 6e7ed8c 99a7933 6e7ed8c ceb5625 21a1e4a 6e7ed8c 21a1e4a 6e7ed8c fd6eff7 6e7ed8c 21a1e4a 6e7ed8c 21a1e4a 6e7ed8c 21a1e4a 6e7ed8c 99a7933 6e7ed8c 99a7933 6e7ed8c 99a7933 21a1e4a 6e7ed8c 21a1e4a 6e7ed8c bc9c771 6e7ed8c bc9c771 6e7ed8c bc9c771 6e7ed8c bc9c771 6e7ed8c 21a1e4a 6e7ed8c 21a1e4a 6e7ed8c 21a1e4a c9277c3 6e7ed8c c9277c3 21a1e4a 6e7ed8c 4451df0 6e7ed8c |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 |
---
tags:
- text-to-video
- diffusion
- merged-model
- video-generation
- wan2.1
widget:
- text: >-
Prompt: A gritty close-up of an elven princess kneeling in a rocky ravine, calming a wounded, desert dragon. Its scales are cracked, dry, She wears a crimson sash over bone-colored armor, her auburn hair half-tied back. The camera dollies in rapidly as she reaches for its eye ridge. Lighting comes from golden sunlight reflecting off surrounding rock, casting a warm, earthy hue with no artificial glow.
output:
url: videos/Video_00063.mp4
- text: >-
Prompt: Tight close-up of her smiling lips and sparkling eyes, catching golden hour sunlight. She wears a white sundress with floral prints and a wide-brimmed straw hat. Camera pulls back in a dolly motion, revealing her twirling under a cherry blossom tree. Petals flutter in the air, casting playful shadows. Soft lens flares enhance the euphoric, dreamlike vibe. (Before vs After — Left: Wan2.1 | Right: Merged model Wan14BT2V_MasterModel)
output:
url: videos/AnimateDiff_00001.mp4
- text: >-
Prompt: A gritty close-up of a dwarven beastmaster’s face, his grey beard braided tightly, brows furrowed as he looks just off-camera. The camera dollies out over his shoulder, revealing a perched gryphon watching him from a boulder, its feathers rustling slightly in the breeze. The moment holds stillness and mutual trust. Lighting is early daylight, clean and sharp with strong environmental clarity.
output:
url: videos/FusionX_00012.mp4
- text: >-
Prompt: A gritty close-up of a jungle tracker crouching low, face flushed with focus as she watches a perched macaw a few feet ahead. Her cheek twitches as she shifts forward, beads of sweat visible on her brow. The camera slowly dollies in from below her line of sight, capturing the moment her eyes widen in fascination. Lighting is rich and directional from above, creating a warm glow over her face with minimal shadows.
output:
url: videos/FusionX_00005.mp4
- text: >-
Prompt: A gritty close-up of a battle-worn ranger kneeling in a scorched clearing, calming a wounded gryphon whose wing is torn and bloodied. Its feathers are dusky bronze with streaks of ash-gray. She wears soot-covered hunter green armor, her blonde hair pulled into a loose braid. The camera dollies in as her hand brushes the creature's sharp beak. Lighting comes from late afternoon sun filtering through smoke, casting a burnt-orange haze across the frame.
output:
url: videos/Video_00069.mp4
base_model:
- Wan-AI/Wan2.1-T2V-14B
license: apache-2.0
---
# 🌀 Wan2.1_14B_FusionX
**High-Performance Merged Text-to-Video Model**
Built on WAN 2.1 and fused with research-grade components for cinematic motion, detail, and speed — optimized for ComfyUI and rapid iteration in as few as 6 steps.
Merged models for faster, richer motion & detail — high performance even at just 8 steps.
> 📌 Important: To match the quality shown here, use the linked workflows or make sure to follow the recommended settings outlined below.
---
## 🚀 Overview
A powerful text-to-video model built on top of **WAN 2.1 14B**, merged with several research-grade models to boost:
- Motion quality
- Scene consistency
- Visual detail
Comparable with closed-source solutions, but open and optimized for **ComfyUI** workflows.
---
## 💡 Inside the Fusion
This model is made up of the following which is on TOP of Wan 2.1 14B 720p(FusionX would not be what it is without these Models):
- **CausVid** – [Causal motion modeling for better flow and dynamics](https://github.com/tianweiy/CausVid)
- **AccVideo** – [Better temporal alignment and speed boost](https://github.com/aejion/AccVideo)
- **MoviiGen1.1** – [Cinematic smoothness and lighting](https://huggingface.co/ZuluVision/MoviiGen1.1)
- **MPS Reward LoRA** – [Tuned for motion and detail](https://huggingface.co/alibaba-pai/Wan2.1-Fun-Reward-LoRAs)
- **Custom LoRAs** – For texture, clarity, and small detail enhancements (Set at a very low level)
All merged models are provided for research and non-commercial use only.
Some components are subject to licenses such as CC BY-NC-SA 4.0, and do not fall under permissive licenses like Apache 2.0 or MIT.
Please refer to each model’s original license for full usage terms.
---
## 🚨✨**Hey guys! Just a quick update!**
We finally cooked up **FusionX LoRAs**!! 🧠💥
This is huge – now you can plug FusionX into your favorite workflows as a LoRA on top of the Wan base models and SkyReels models!🔌💫
You can still stick with the base FusionX Model if you already use it, but if you would rather have more control over the "FusionX" strength and a speed boost, then this might be for you.
Oh, and there’s a **nice speed boost** too! ⚡
**Example:** *(RTX 5090)*
- FusionX as a full base model: **8 steps = 160s** ⏱️
- FusionX as a **LoRA on Wan 2.1 14B fp8 T2V**: **8 steps = 120s** 🚀
**Bonus:** You can bump up the FusionX LoRA strength and lower your steps for a **huge speed boost** while testing/drafting.
Example: strength `2.00` with `3 steps` takes `72 seconds`.
Or lower the strength to experiment with a **less “FusionX” look**. ⚡🔍
We’ve got:
- **T2V (Text to Video)** 🎬 – works perfectly with **VACE** ⚙️
- **I2V (Image to Video)** 🖼️➡️📽️
- A dedicated **Phantom LoRA** 👻
The new LoRA's are [HERE](https://huggingface.co/vrgamedevgirl84/Wan14BT2VFusioniX/tree/main/FusionX_LoRa)
Note: The LoRa's are not meant to be put on top of the FusionX main models and instead you would use them with the Wan base models.
**New workflows** are [HERE](https://civitai.com/models/1681541) 🛠️🚀
---
After lots of testing 🧪, the video quality with the LoRA is **just as good** (and sometimes **even better**! 💯)
That’s thanks to it being trained on the **fp16 version** of FusionX 🧬💎
---
### 🌀 Preview Gallery
*These are compressed GIF previews for quick viewing — final video outputs are higher quality.*












---
## 📂 Workflows & Model Downloads
- 💡 **ComfyUI workflows** can be found here:
👉 [Workflow Collection (WIP)](https://civitai.com/models/1663553)
- 📦 **Model files (T2V, I2V, Phantom, VACE)**:
👉 [Main Hugging Face Repo](https://huggingface.co/vrgamedevgirl84/Wan14BT2VFusioniX/tree/main)
### 🧠 GGUF Variants:
- 🖼️ [FusionX Image-to-Video (GGUF)](https://huggingface.co/QuantStack/Wan2.1_I2V_14B_FusionX-GGUF/tree/main)
- 🎥 [FusionX Text-to-Video (GGUF)](https://huggingface.co/QuantStack/Wan2.1_T2V_14B_FusionX-GGUF/tree/main)
- 🎞️ [FusionX T2V VACE (for native)](https://huggingface.co/QuantStack/Wan2.1_T2V_14B_FusionX_VACE-GGUF/tree/main)
- 👻 [FusionX Phantom](https://huggingface.co/QuantStack/Phantom_Wan_14B_FusionX-GGUF/tree/main)
---
## 🎬 Example Videos
Want to see what FusionX can do? Check out these real outputs generated using the latest workflows and settings:
- **Text-to-Video**
👉 [Watch Examples](https://civitai.com/posts/17874424)
- **Image-to-Video**
👉 [Watch Examples](https://civitai.com/posts/18029174)
- **Phantom Mode**
👉 [Watch Examples](https://civitai.com/posts/17986906)
- **VACE Integration**
👉 [Watch Examples](https://civitai.com/posts/18080876)
---
## 🚀 Overview
A powerful text-to-video model built on top of **WAN 2.1 14B**, merged with several research-grade models to boost:
- Motion quality
- Scene consistency
- Visual detail
Comparable with closed-source solutions, but open and optimized for **ComfyUI** workflows.
---
## 🔧 Usage Details
### Text-to-Video
- **CGF**: Must be set to `1`
- **Shift**:
- `1024x576`: Start at `1`
- `1080x720`: Start at `2`
- For realism → lower values
- For stylized → test `3–9`
- **Scheduler**:
- Recommended: `uni_pc`
- Alternative: `flowmatch_causvid` (better for some details)
### Image-to-Video
- **CGF**: `1`
- **Shift**: `2` works best in most cases
- **Scheduler**:
- Recommended: `dmp++_sde/beta`
- To boost motion and reduce slow-mo effect:
- Frame count: `121`
- FPS: `24`
---
## 🛠 Technical Notes
- Works in as few as **6 steps**
- Best quality at **8–10 steps**
- Drop-in replacement for `Wan2.1-T2V-14B`
- Up to **50% faster rendering**, especially with **SageAttn**
- Works natively and with **Kaji Wan Wrapper**
[Wrapper GitHub](https://github.com/kijai/ComfyUI-WanVideoWrapper)
- Do **not** re-add merged LoRAs (CausVid, AccVideo, MPS)
- Feel free to add **other LoRAs** for style/variation
- Native WAN workflows also supported (slightly slower)
---
## 🧪 Performance Tips
- RTX 5090 → ~138 sec/video at 1024x576 / 81 frames
- If VRAM is limited:
- Enable block swapping
- Start with `5` blocks and adjust as needed
- Use **SageAttn** for ~30% speedup (wrapper only)
- Do **not** use `teacache`
- "Enhance a video" (tested): Adds vibrance (try values 2–4)
- "SLG" not tested — feel free to explore
---
## 🧠 Prompt Help
Want better cinematic prompts? Try the **WAN Cinematic Video Prompt Generator GPT** — it adds visual richness and makes a big difference in quality. [Download Here](https://chatgpt.com/g/g-67c3a6d6d19c81919b3247d2bfd01d0b-wan-cinematic-video-prompt-generator)
---
## 📣 Join The Community
We’re building a friendly space to chat, share outputs, and get help.
- Motion LoRAs coming soon
- Tips, updates, and support from other users
👉 [Join the Discord](https://discord.com/invite/hxPmmXmRW3)
---
## ⚖️ License
Some merged components use permissive licenses (Apache 2.0 / MIT),
**but others** — such as those from research models like *CausVid* — may be released under **non-commercial licenses** (e.g., [CC BY-NC-SA 4.0](https://creativecommons.org/licenses/by-nc-sa/4.0/)).
- ✅ You **can** use, modify, and redistribute **under original license terms**
- ❗ You **must** retain and respect the license of each component
- ⚠️ **Commercial use is not permitted** for models or components under non-commercial licenses
- 📌 Outputs are **not automatically licensed** — do your own due diligence
This model is intended for **research, education, and personal use only**.
For commercial use or monetization, please consult a legal advisor and verify all component licenses.
---
## 🙏 Credits
- WAN Team (base model)
- aejion (AccVideo)
- Tianwei Yin (CausVid)
- ZuluVision (MoviiGen)
- Alibaba PAI (MPS LoRA)
- Kijai (ComfyUI Wrapper)
And thanks to the open-source community!
---
|