multimodalart HF Staff commited on
Commit
b71c275
·
1 Parent(s): f443caf

Upload folder using huggingface_hub

Browse files
.gitattributes CHANGED
@@ -33,3 +33,6 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ image-7.png filter=lfs diff=lfs merge=lfs -text
37
+ image-8.png filter=lfs diff=lfs merge=lfs -text
38
+ image-9.png filter=lfs diff=lfs merge=lfs -text
README.md ADDED
@@ -0,0 +1,104 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - stable-diffusion-xl
4
+ - stable-diffusion-xl-diffusers
5
+ - text-to-image
6
+ - diffusers
7
+ - lora
8
+ - template:sd-lora
9
+ widget:
10
+ - text: A <s0><s1> mouse cartoon piloting a boat
11
+ output:
12
+ url: image-0.png
13
+ - text: A <s0><s1> mouse cartoon piloting a boat
14
+ output:
15
+ url: image-1.png
16
+ - text: A <s0><s1> mouse cartoon pulling the boat's horn
17
+ output:
18
+ url: image-2.png
19
+ - text: A <s0><s1> mouse cartoon zoom out piloting a boat
20
+ output:
21
+ url: image-3.png
22
+ - text: A <s0><s1> mouse cartoon mad at a bucket
23
+ output:
24
+ url: image-4.png
25
+ - text: A <s0><s1> mouse cartoon opening a dog's mouth
26
+ output:
27
+ url: image-5.png
28
+ - text: A <s0><s1> mouse cartoon dancing in the kitchen
29
+ output:
30
+ url: image-6.png
31
+ - text: A <s0><s1> mouse cartoon looking at a cow
32
+ output:
33
+ url: image-7.png
34
+ - text: A <s0><s1> mouse cartoon plahying musical instruments
35
+ output:
36
+ url: image-8.png
37
+ - text: A <s0><s1> mouse cartoon sitting down
38
+ output:
39
+ url: image-9.png
40
+ base_model: stabilityai/stable-diffusion-xl-base-1.0
41
+ instance_prompt: A <s0><s1> mouse cartoon
42
+ license: openrail++
43
+ ---
44
+
45
+ # SDXL LoRA DreamBooth - multimodalart/mouse-public-domain-rank16
46
+
47
+ <Gallery />
48
+
49
+ ## Model description
50
+
51
+ ### These are multimodalart/mouse-public-domain-rank16 LoRA adaption weights for stabilityai/stable-diffusion-xl-base-1.0.
52
+
53
+ ## Download model
54
+
55
+ ### Use it with UIs such as AUTOMATIC1111, Comfy UI, SD.Next, Invoke
56
+
57
+ - **LoRA**: download **[`mouse-public-domain-rank16.safetensors` here 💾](/multimodalart/mouse-public-domain-rank16/blob/main/mouse-public-domain-rank16.safetensors)**.
58
+ - Place it on your `models/Lora` folder.
59
+ - On AUTOMATIC1111, load the LoRA by adding `<lora:mouse-public-domain-rank16:1>` to your prompt. On ComfyUI just [load it as a regular LoRA](https://comfyanonymous.github.io/ComfyUI_examples/lora/).
60
+ - *Embeddings*: download **[`mouse-public-domain-rank16_emb.safetensors` here 💾](/multimodalart/mouse-public-domain-rank16/blob/main/mouse-public-domain-rank16_emb.safetensors)**.
61
+ - Place it on it on your `embeddings` folder
62
+ - Use it by adding `mouse-public-domain-rank16_emb` to your prompt. For example, `A mouse-public-domain-rank16_emb mouse cartoon `
63
+ (you need both the LoRA and the embeddings as they were trained together for this LoRA)
64
+
65
+
66
+ ## Use it with the [🧨 diffusers library](https://github.com/huggingface/diffusers)
67
+
68
+ ```py
69
+ from diffusers import AutoPipelineForText2Image
70
+ import torch
71
+ from huggingface_hub import hf_hub_download
72
+ from safetensors.torch import load_file
73
+
74
+ pipeline = AutoPipelineForText2Image.from_pretrained('stabilityai/stable-diffusion-xl-base-1.0', torch_dtype=torch.float16).to('cuda')
75
+ pipeline.load_lora_weights('multimodalart/mouse-public-domain-rank16', weight_name='pytorch_lora_weights.safetensors')
76
+ embedding_path = hf_hub_download(repo_id='multimodalart/mouse-public-domain-rank16', filename='mouse-public-domain-rank16_emb.safetensors' repo_type="model")
77
+ state_dict = load_file(embedding_path)
78
+ pipeline.load_textual_inversion(state_dict["clip_l"], token=["<s0>", "<s1>"], text_encoder=pipeline.text_encoder, tokenizer=pipeline.tokenizer)
79
+ pipeline.load_textual_inversion(state_dict["clip_g"], token=["<s0>", "<s1>"], text_encoder=pipeline.text_encoder_2, tokenizer=pipeline.tokenizer_2)
80
+
81
+ image = pipeline('A <s0><s1> mouse cartoon ').images[0]
82
+ ```
83
+
84
+ For more details, including weighting, merging and fusing LoRAs, check the [documentation on loading LoRAs in diffusers](https://huggingface.co/docs/diffusers/main/en/using-diffusers/loading_adapters)
85
+
86
+ ## Trigger words
87
+
88
+ To trigger image generation of trained concept(or concepts) replace each concept identifier in you prompt with the new inserted tokens:
89
+
90
+ to trigger concept `TOK` → use `<s0><s1>` in your prompt
91
+
92
+
93
+
94
+ ## Details
95
+ All [Files & versions](/multimodalart/mouse-public-domain-rank16/tree/main).
96
+
97
+ The weights were trained using [🧨 diffusers Advanced Dreambooth Training Script](https://github.com/huggingface/diffusers/blob/main/examples/advanced_diffusion_training/train_dreambooth_lora_sdxl_advanced.py).
98
+
99
+ LoRA for the text encoder was enabled. False.
100
+
101
+ Pivotal tuning was enabled: True.
102
+
103
+ Special VAE used for training: madebyollin/sdxl-vae-fp16-fix.
104
+
image-0.png ADDED
image-1.png ADDED
image-2.png ADDED
image-3.png ADDED
image-4.png ADDED
image-5.png ADDED
image-6.png ADDED
image-7.png ADDED

Git LFS Details

  • SHA256: 399df42b5a3205af6976b8749125eae11198a775abd77771876ea8fda3b8a485
  • Pointer size: 132 Bytes
  • Size of remote file: 1.01 MB
image-8.png ADDED

Git LFS Details

  • SHA256: 0ab2c81e2cb44ea1f6c168095e574a72e895499067fa7bbcf4612b550c2e3df8
  • Pointer size: 132 Bytes
  • Size of remote file: 1.04 MB
image-9.png ADDED

Git LFS Details

  • SHA256: a1360d1f17c9e7bac60af18358f1c7fcf6b8064429a2806a6e7d7c901739d43f
  • Pointer size: 132 Bytes
  • Size of remote file: 1.09 MB
logs/dreambooth-lora-sd-xl/1704066476.0948572/events.out.tfevents.1704066476.r-multimodalart-autotrain-mouse-public-domain-rank16--bf9f7g7qd.211.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:fe2333bc9125600dda8490e7b44a8503935fc7830f281319f4be6a5af3d55e3b
3
+ size 3545
logs/dreambooth-lora-sd-xl/1704066476.0969837/hparams.yml ADDED
@@ -0,0 +1,74 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ adam_beta1: 0.9
2
+ adam_beta2: 0.999
3
+ adam_epsilon: 1.0e-08
4
+ adam_weight_decay: 0.0001
5
+ adam_weight_decay_text_encoder: null
6
+ allow_tf32: false
7
+ cache_dir: null
8
+ cache_latents: true
9
+ caption_column: prompt
10
+ center_crop: false
11
+ checkpointing_steps: 5000
12
+ checkpoints_total_limit: null
13
+ class_data_dir: null
14
+ class_prompt: null
15
+ crops_coords_top_left_h: 0
16
+ crops_coords_top_left_w: 0
17
+ dataloader_num_workers: 0
18
+ dataset_config_name: null
19
+ dataset_name: ./08807279-5f15-43ac-8dd0-0ee6da5ae55d
20
+ enable_xformers_memory_efficient_attention: false
21
+ gradient_accumulation_steps: 1
22
+ gradient_checkpointing: true
23
+ hub_model_id: null
24
+ hub_token: null
25
+ image_column: image
26
+ instance_data_dir: null
27
+ instance_prompt: 'A <s0><s1> mouse cartoon '
28
+ learning_rate: 1.0
29
+ local_rank: -1
30
+ logging_dir: logs
31
+ lr_num_cycles: 1
32
+ lr_power: 1.0
33
+ lr_scheduler: constant
34
+ lr_warmup_steps: 0
35
+ max_grad_norm: 1.0
36
+ max_train_steps: 800
37
+ mixed_precision: bf16
38
+ num_class_images: 100
39
+ num_new_tokens_per_abstraction: 2
40
+ num_train_epochs: 54
41
+ num_validation_images: 4
42
+ optimizer: prodigy
43
+ output_dir: mouse-public-domain-rank16
44
+ pretrained_model_name_or_path: stabilityai/stable-diffusion-xl-base-1.0
45
+ pretrained_vae_model_name_or_path: madebyollin/sdxl-vae-fp16-fix
46
+ prior_generation_precision: null
47
+ prior_loss_weight: 1.0
48
+ prodigy_beta3: null
49
+ prodigy_decouple: true
50
+ prodigy_safeguard_warmup: true
51
+ prodigy_use_bias_correction: true
52
+ push_to_hub: false
53
+ rank: 16
54
+ repeats: 3
55
+ report_to: tensorboard
56
+ resolution: 1024
57
+ resume_from_checkpoint: null
58
+ revision: null
59
+ sample_batch_size: 4
60
+ scale_lr: false
61
+ seed: 42
62
+ snr_gamma: null
63
+ text_encoder_lr: 1.0
64
+ token_abstraction: TOK
65
+ train_batch_size: 2
66
+ train_text_encoder: false
67
+ train_text_encoder_frac: 1.0
68
+ train_text_encoder_ti: true
69
+ train_text_encoder_ti_frac: 0.5
70
+ use_8bit_adam: false
71
+ validation_epochs: 50
72
+ validation_prompt: null
73
+ variant: null
74
+ with_prior_preservation: false
logs/dreambooth-lora-sd-xl/events.out.tfevents.1704066476.r-multimodalart-autotrain-mouse-public-domain-rank16--bf9f7g7qd.211.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:002be0225581977025858ab0229aaa29e206318bdb47b67d92eaa297c64a30ad
3
+ size 67034
mouse-public-domain-rank16.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cd48049e85c062998422ef76b905d5824515559c01208cb1df0a9db5ead27da2
3
+ size 93148104
mouse-public-domain-rank16_emb.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f4b6903c7e01ef2894e322ac85bc4c4e371e2cedaacfa1f9d84dc9c3c644bd15
3
+ size 8344
pytorch_lora_weights.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2b729e4f283bee31f842e87357cff7a9622fc896b0f4aa8f05388ef761cd5456
3
+ size 93065304