Text-to-Image
Diffusers
lora
template:diffusion-lora
trojblue commited on
Commit
5c774fa
verified
0 Parent(s):

initial commit

Browse files
Files changed (2) hide show
  1. .gitattributes +55 -0
  2. README.md +124 -0
.gitattributes ADDED
@@ -0,0 +1,55 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ *.7z filter=lfs diff=lfs merge=lfs -text
2
+ *.arrow filter=lfs diff=lfs merge=lfs -text
3
+ *.bin filter=lfs diff=lfs merge=lfs -text
4
+ *.bz2 filter=lfs diff=lfs merge=lfs -text
5
+ *.ckpt filter=lfs diff=lfs merge=lfs -text
6
+ *.ftz filter=lfs diff=lfs merge=lfs -text
7
+ *.gz filter=lfs diff=lfs merge=lfs -text
8
+ *.h5 filter=lfs diff=lfs merge=lfs -text
9
+ *.joblib filter=lfs diff=lfs merge=lfs -text
10
+ *.lfs.* filter=lfs diff=lfs merge=lfs -text
11
+ *.lz4 filter=lfs diff=lfs merge=lfs -text
12
+ *.mlmodel filter=lfs diff=lfs merge=lfs -text
13
+ *.model filter=lfs diff=lfs merge=lfs -text
14
+ *.msgpack filter=lfs diff=lfs merge=lfs -text
15
+ *.npy filter=lfs diff=lfs merge=lfs -text
16
+ *.npz filter=lfs diff=lfs merge=lfs -text
17
+ *.onnx filter=lfs diff=lfs merge=lfs -text
18
+ *.ot filter=lfs diff=lfs merge=lfs -text
19
+ *.parquet filter=lfs diff=lfs merge=lfs -text
20
+ *.pb filter=lfs diff=lfs merge=lfs -text
21
+ *.pickle filter=lfs diff=lfs merge=lfs -text
22
+ *.pkl filter=lfs diff=lfs merge=lfs -text
23
+ *.pt filter=lfs diff=lfs merge=lfs -text
24
+ *.pth filter=lfs diff=lfs merge=lfs -text
25
+ *.rar filter=lfs diff=lfs merge=lfs -text
26
+ *.safetensors filter=lfs diff=lfs merge=lfs -text
27
+ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
28
+ *.tar.* filter=lfs diff=lfs merge=lfs -text
29
+ *.tar filter=lfs diff=lfs merge=lfs -text
30
+ *.tflite filter=lfs diff=lfs merge=lfs -text
31
+ *.tgz filter=lfs diff=lfs merge=lfs -text
32
+ *.wasm filter=lfs diff=lfs merge=lfs -text
33
+ *.xz filter=lfs diff=lfs merge=lfs -text
34
+ *.zip filter=lfs diff=lfs merge=lfs -text
35
+ *.zst filter=lfs diff=lfs merge=lfs -text
36
+ *tfevents* filter=lfs diff=lfs merge=lfs -text
37
+ # Audio files - uncompressed
38
+ *.pcm filter=lfs diff=lfs merge=lfs -text
39
+ *.sam filter=lfs diff=lfs merge=lfs -text
40
+ *.raw filter=lfs diff=lfs merge=lfs -text
41
+ # Audio files - compressed
42
+ *.aac filter=lfs diff=lfs merge=lfs -text
43
+ *.flac filter=lfs diff=lfs merge=lfs -text
44
+ *.mp3 filter=lfs diff=lfs merge=lfs -text
45
+ *.ogg filter=lfs diff=lfs merge=lfs -text
46
+ *.wav filter=lfs diff=lfs merge=lfs -text
47
+ # Image files - uncompressed
48
+ *.bmp filter=lfs diff=lfs merge=lfs -text
49
+ *.gif filter=lfs diff=lfs merge=lfs -text
50
+ *.png filter=lfs diff=lfs merge=lfs -text
51
+ *.tiff filter=lfs diff=lfs merge=lfs -text
52
+ # Image files - compressed
53
+ *.jpg filter=lfs diff=lfs merge=lfs -text
54
+ *.jpeg filter=lfs diff=lfs merge=lfs -text
55
+ *.webp filter=lfs diff=lfs merge=lfs -text
README.md ADDED
@@ -0,0 +1,124 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - text-to-image
4
+ - lora
5
+ - diffusers
6
+ - template:diffusion-lora
7
+ widget:
8
+ - text: >-
9
+ pixel art, a rainy day scene of a cat-eared anime girl with a transparent
10
+ umbrella, standing at a train crossing, her reflection shimmering in the
11
+ puddles as a train rushes by.
12
+ output:
13
+ url: images/ComfyUI_00021_.webp
14
+ - text: '-'
15
+ output:
16
+ url: images/ComfyUI_00021_.webp
17
+ base_model: tencent/HunyuanVideo
18
+ instance_prompt: pixel art
19
+ license: mit
20
+ ---
21
+ # Hunyuan Video Lora - PixelArt
22
+
23
+ <Gallery />
24
+
25
+ ## Model description
26
+
27
+ # Hunyuan Video LoRA - PixelArt
28
+
29
+
30
+
31
+ **(Model card WIP; subject to updates today or tomorrow)**
32
+
33
+
34
+
35
+ [v1.0]:
36
+
37
+ This LoRA brings an **anime-inspired pixel art style** to life, trained on a mix of pixel animations and still images. It鈥檚 designed to generate vibrant, colorful anime-style pixel art, excelling at **character motions** and **pixelated scenery**. Think bright, dynamic visuals with that classic retro charm.
38
+
39
+
40
+
41
+ ## Usage
42
+
43
+ A sample workflow is available in the Hugging Face repo:
44
+
45
+ -[ [v0.1&#x2F;ComfyUI_00024_.webp 路 trojblue&#x2F;HunyuanVideo-lora-AnimeShot\]](https:&#x2F;&#x2F;huggingface.co&#x2F;trojblue&#x2F;HunyuanVideo-lora-AnimeShot&#x2F;blob&#x2F;main&#x2F;v0.1&#x2F;ComfyUI_00024_.webp)
46
+
47
+ \- For default node compatibility, update ComfyUI to the latest commit. More details here: [[HunyuanVideo Native Support in ComfyUI - by Jo Zhang\]](https:&#x2F;&#x2F;blog.comfy.org&#x2F;p&#x2F;hunyuanvideo-native-support-in-comfyui).
48
+
49
+
50
+
51
+ ## Configs
52
+
53
+ Trained on a dynamic aspect ratio at **~768 resolution**, this LoRA has been tested at resolutions:
54
+
55
+ - 768x768
56
+
57
+ - 656x880
58
+
59
+ - 880x656
60
+
61
+ It鈥檚 flexible with step counts, working effectively at:
62
+
63
+ - 1 step
64
+
65
+ - 33 steps
66
+
67
+ - 65 steps
68
+
69
+ - 97 steps
70
+
71
+ Feel free to tweak these settings to find your sweet spot!
72
+
73
+
74
+
75
+ ## Prompting
76
+
77
+ Start prompts with &#x60;pixel art, &lt;description&gt;&#x60;. The training data used **natural language captions** (1-3 sentences), so descriptive, sentence-style prompts tend to yield the best results. Here are the prompts behind the sample images:
78
+
79
+ &#x60;&#x60;&#x60;
80
+ pixel art, a pixelated scene showing a fox-eared anime boy with fiery red hair and golden eyes, sitting on a torii gate at sunset, gently playing a bamboo flute as cherry blossoms float in the air.
81
+
82
+ pixel art, a pixelated image of a silver-haired shrine maiden with glowing violet eyes, sweeping the temple courtyard under the full moon, with soft lantern light and petals drifting by.
83
+
84
+ pixel art, a pixel scene of a pink-haired demon loli with tiny horns, roasting marshmallows over a campfire in a haunted forest, surrounded by glowing ghost friends.
85
+
86
+ pixel art, a rainy day scene of a cat-eared anime girl with a transparent umbrella, standing at a train crossing, her reflection shimmering in the puddles as a train rushes by.
87
+
88
+ pixel art, a rainy day scene of a cat-eared anime girl with a transparent umbrella, standing at a train crossing, her reflection shimmering in the puddles as a train rushes by.
89
+
90
+ pixel art, a pixelated mountaintop at dawn with a lone boy in a scarf standing at the edge, arms outstretched as the wind carries cherry petals past him, a red kite fluttering behind.
91
+
92
+ pixel art, a pixelated scene of a frog in a wizard hat stirring a bubbling cauldron in the middle of a mushroom forest, fireflies glowing around him like sparks of magic.
93
+
94
+ pixel art, a pixelated alleyway in a quiet neon-lit city, a boy with silver hair feeding a stray black cat from a bento box, both bathed in soft vending machine light.
95
+
96
+ pixel art, a peaceful farm landscape with a scarecrow slightly askew in the middle of a golden wheat field, crows perched on its arms as the wind moves the grain in waves.
97
+
98
+ pixel art, a quiet underwater scene where a catfish wearing a crown floats lazily above a coral throne, tiny sea creatures circling like royal attendants.
99
+ &#x60;&#x60;&#x60;
100
+
101
+
102
+
103
+ ## Limitations
104
+
105
+ The dataset leans heavily on **anime characters and scenery**, so prompts outside this scope might produce less pixelated or lower-quality results. I may experiment with WAN to improve this later, but for now, this is the Hunyuan version as it stands.
106
+
107
+ Training wrapped up around January鈥擨 just didn鈥檛 get around to posting it sooner (my bad!). As a result, the inference setup hasn鈥檛 been updated to the latest Hunyuan Video best practices. Checking recent guidelines could help optimize your results.
108
+
109
+
110
+
111
+ ## Updates
112
+
113
+ feel free to follow me on twitter for model updates and stuff: [[yada (@yada_cc) &#x2F; X\]](https:&#x2F;&#x2F;x.com&#x2F;yada_cc)
114
+
115
+ ## Trigger words
116
+
117
+ You should use `pixel art` to trigger the image generation.
118
+
119
+
120
+ ## Download model
121
+
122
+ Weights for this model are available in Safetensors format.
123
+
124
+ [Download](/trojblue/HunyuanVideo-lora-PixelArt/tree/main) them in the Files & versions tab.