FrameFusion: Combining Similarity and Importance for Video Token Reduction on Large Visual Language Models Paper • 2501.01986 • Published Dec 30, 2024 • 1
Linear Combination of Saved Checkpoints Makes Consistency and Diffusion Models Better Paper • 2404.02241 • Published Apr 2, 2024 • 2
FilMaster: Bridging Cinematic Principles and Generative AI for Automated Film Generation Paper • 2506.18899 • Published Jun 23 • 5
MBQ: Modality-Balanced Quantization for Large Vision-Language Models Paper • 2412.19509 • Published Dec 27, 2024
Latent Zoning Network: A Unified Principle for Generative Modeling, Representation Learning, and Classification Paper • 2509.15591 • Published Sep 19 • 45
GenMAC: Compositional Text-to-Video Generation with Multi-Agent Collaboration Paper • 2412.04440 • Published Dec 5, 2024 • 22
Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matching Paper • 2412.17153 • Published Dec 22, 2024 • 39
A Survey on Efficient Inference for Large Language Models Paper • 2404.14294 • Published Apr 22, 2024 • 3
Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding Paper • 2410.01699 • Published Oct 2, 2024 • 18
MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression Paper • 2406.14909 • Published Jun 21, 2024 • 16
DiTFastAttn: Attention Compression for Diffusion Transformer Models Paper • 2406.08552 • Published Jun 12, 2024 • 25
ViDiT-Q: Efficient and Accurate Quantization of Diffusion Transformers for Image and Video Generation Paper • 2406.02540 • Published Jun 4, 2024 • 3
MixDQ: Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization Paper • 2405.17873 • Published May 28, 2024 • 3
DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis Paper • 2405.14224 • Published May 23, 2024 • 16
A Unified Sampling Framework for Solver Searching of Diffusion Probabilistic Models Paper • 2312.07243 • Published Dec 12, 2023
LV-Eval: A Balanced Long-Context Benchmark with 5 Length Levels Up to 256K Paper • 2402.05136 • Published Feb 6, 2024 • 1
FlightLLM: Efficient Large Language Model Inference with a Complete Mapping Flow on FPGAs Paper • 2401.03868 • Published Jan 8, 2024 • 1