Modifying Large Language Model Post-Training for Diverse Creative Writing Paper • 2503.17126 • Published 4 days ago • 23
Plug-and-Play 1.x-Bit KV Cache Quantization for Video Large Language Models Paper • 2503.16257 • Published 5 days ago • 22
BlobCtrl: A Unified and Flexible Framework for Element-level Image Generation and Editing Paper • 2503.13434 • Published 8 days ago • 24
DreamRenderer: Taming Multi-Instance Attribute Control in Large-Scale Text-to-Image Models Paper • 2503.12885 • Published 8 days ago • 41
abliteration loras Collection Extracted adapters for removing censorship in models • 3 items • Updated Jan 21 • 2
DiT-Air: Revisiting the Efficiency of Diffusion Model Architecture Design in Text to Image Generation Paper • 2503.10618 • Published 12 days ago • 17
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning Paper • 2503.09516 • Published 13 days ago • 24
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM 14 days ago • 343
Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think Paper • 2502.20172 • Published 26 days ago • 28
EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer Paper • 2503.07027 • Published 15 days ago • 26
MM-Eureka: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning Paper • 2503.07365 • Published 15 days ago • 54
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published Feb 4 • 213
DistiLLM-2: A Contrastive Approach Boosts the Distillation of LLMs Paper • 2503.07067 • Published 15 days ago • 29
Token-Efficient Long Video Understanding for Multimodal LLMs Paper • 2503.04130 • Published 19 days ago • 84
LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM Paper • 2503.04724 • Published 19 days ago • 66