Dimple: Discrete Diffusion Multimodal Large Language Model with Parallel Decoding Paper • 2505.16990 • Published May 22 • 21
LightLab: Controlling Light Sources in Images with Diffusion Models Paper • 2505.09608 • Published May 14 • 33
StoryReasoning Dataset: Using Chain-of-Thought for Scene Understanding and Grounded Story Generation Paper • 2505.10292 • Published May 15 • 3
A Brief Review for Compression and Transfer Learning Techniques in DeepFake Detection Paper • 2504.21066 • Published Apr 29 • 1
AWARE-NET: Adaptive Weighted Averaging for Robust Ensemble Network in Deepfake Detection Paper • 2505.00312 • Published May 1 • 2
Less-to-More Generalization: Unlocking More Controllability by In-Context Generation Paper • 2504.02160 • Published Apr 2 • 37
Community Forensics: Using Thousands of Generators to Train Fake Image Detectors Paper • 2411.04125 • Published Nov 6, 2024 • 1
LEGION: Learning to Ground and Explain for Synthetic Image Detection Paper • 2503.15264 • Published Mar 19 • 21
Large-Scale Text-to-Image Model with Inpainting is a Zero-Shot Subject-Driven Image Generator Paper • 2411.15466 • Published Nov 23, 2024 • 39
view article Article The SOTA Text-to-speech and Zero Shot Voice cloning model that no one knows about... By srinivasbilla • Jan 20 • 69