netzkontrast
's Collections
Customizing Text-to-Image Models with a Single Image Pair
Paper
•
2405.01536
•
Published
•
20
Concept Weaver: Enabling Multi-Concept Fusion in Text-to-Image Models
Paper
•
2404.03913
•
Published
LCM-Lookahead for Encoder-based Text-to-Image Personalization
Paper
•
2404.03620
•
Published
•
1
Customizing Text-to-Image Diffusion with Camera Viewpoint Control
Paper
•
2404.12333
•
Published
•
1
fka/awesome-chatgpt-prompts
Viewer
•
Updated
•
203
•
6.07k
•
6.94k
MohamedRashad/midjourney-detailed-prompts
Viewer
•
Updated
•
3.05k
•
63
•
51
jtatman/stable-diffusion-prompts-uncensored
Viewer
•
Updated
•
852k
•
42
•
15
Gustavosta/Stable-Diffusion-Prompts
Viewer
•
Updated
•
81.9k
•
3.44k
•
459
succinctly/midjourney-prompts
Viewer
•
Updated
•
246k
•
122
•
93
succinctly/text2image-prompt-generator
Text Generation
•
Updated
•
37.7k
•
295
alespalla/chatbot_instruction_prompts
Viewer
•
Updated
•
323k
•
444
•
47
MohamedRashad/easy_imageinwords
Viewer
•
Updated
•
2.4k
•
40
•
3
Viewer
•
Updated
•
7.13M
•
75
•
41
jtatman/stable-diffusion-prompts-stats-full-uncensored
Viewer
•
Updated
•
897k
•
154
•
60
Gustavosta/MagicPrompt-Dalle
Text Generation
•
Updated
•
1.24k
•
48
😻
Omost
Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of
Encoders
Paper
•
2408.15998
•
Published
•
85
SEA: Supervised Embedding Alignment for Token-Level Visual-Textual
Integration in MLLMs
Paper
•
2408.11813
•
Published
•
11
TokenPacker: Efficient Visual Projector for Multimodal LLM
Paper
•
2407.02392
•
Published
•
21
PALP: Prompt Aligned Personalization of Text-to-Image Models
Paper
•
2401.06105
•
Published
•
48
Genie: Generative Interactive Environments
Paper
•
2402.15391
•
Published
•
70
Training-Free Consistent Text-to-Image Generation
Paper
•
2402.03286
•
Published
•
66
CAS-ViT: Convolutional Additive Self-attention Vision Transformers for
Efficient Mobile Applications
Paper
•
2408.03703
•
Published
AutoPresent: Designing Structured Visuals from Scratch
Paper
•
2501.00912
•
Published
•
8
LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One
Vision Token
Paper
•
2501.03895
•
Published
•
48
ReFocus: Visual Editing as a Chain of Thought for Structured Image
Understanding
Paper
•
2501.05452
•
Published
•
14