Image - a netzkontrast Collection

netzkontrast 's Collections

LLMs

Speech

Lora

Video

Image

Image

updated 3 days ago

Customizing Text-to-Image Models with a Single Image Pair

Paper • 2405.01536 • Published May 2, 2024 • 20
Concept Weaver: Enabling Multi-Concept Fusion in Text-to-Image Models

Paper • 2404.03913 • Published Apr 5, 2024
LCM-Lookahead for Encoder-based Text-to-Image Personalization

Paper • 2404.03620 • Published Apr 4, 2024 • 1
Customizing Text-to-Image Diffusion with Camera Viewpoint Control

Paper • 2404.12333 • Published Apr 18, 2024 • 1
fka/awesome-chatgpt-prompts

Viewer • Updated 12 days ago • 203 • 6.07k • 6.94k
MohamedRashad/midjourney-detailed-prompts

Viewer • Updated Apr 24, 2024 • 3.05k • 63 • 51
jtatman/stable-diffusion-prompts-uncensored

Viewer • Updated Jan 4, 2024 • 852k • 42 • 15
Gustavosta/Stable-Diffusion-Prompts

Viewer • Updated Sep 18, 2022 • 81.9k • 3.44k • 459
succinctly/midjourney-prompts

Viewer • Updated Jul 22, 2022 • 246k • 122 • 93
succinctly/text2image-prompt-generator

Text Generation • Updated Aug 20, 2022 • 37.7k • 295
alespalla/chatbot_instruction_prompts

Viewer • Updated Oct 16, 2024 • 323k • 444 • 47
MohamedRashad/easy_imageinwords

Viewer • Updated May 13, 2024 • 2.4k • 40 • 3
vivym/midjourney-prompts

Viewer • Updated Nov 15, 2023 • 7.13M • 75 • 41
jtatman/stable-diffusion-prompts-stats-full-uncensored

Viewer • Updated Nov 8, 2024 • 897k • 154 • 60
Gustavosta/MagicPrompt-Dalle

Text Generation • Updated Mar 17, 2023 • 1.24k • 48
Running on Zero

716

😻

Omost
Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders

Paper • 2408.15998 • Published Aug 28, 2024 • 85
SEA: Supervised Embedding Alignment for Token-Level Visual-Textual Integration in MLLMs

Paper • 2408.11813 • Published Aug 21, 2024 • 11
TokenPacker: Efficient Visual Projector for Multimodal LLM

Paper • 2407.02392 • Published Jul 2, 2024 • 21
PALP: Prompt Aligned Personalization of Text-to-Image Models

Paper • 2401.06105 • Published Jan 11, 2024 • 48
Genie: Generative Interactive Environments

Paper • 2402.15391 • Published Feb 23, 2024 • 70
Training-Free Consistent Text-to-Image Generation

Paper • 2402.03286 • Published Feb 5, 2024 • 66
CAS-ViT: Convolutional Additive Self-attention Vision Transformers for Efficient Mobile Applications

Paper • 2408.03703 • Published Aug 7, 2024
AutoPresent: Designing Structured Visuals from Scratch

Paper • 2501.00912 • Published 17 days ago • 8
LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token

Paper • 2501.03895 • Published 11 days ago • 48
ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding

Paper • 2501.05452 • Published 9 days ago • 14