Sparse-vDiT: Unleashing the Power of Sparse Attention to Accelerate Video Diffusion Transformers Paper • 2506.03065 • Published 3 days ago • 27
view article Article No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL By toslali-ibm and 5 others • 4 days ago • 37
view article Article SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data By danaaubakirova and 8 others • 4 days ago • 94
One RL to See Them All: Visual Triple Unified Reinforcement Learning Paper • 2505.18129 • Published 14 days ago • 59
Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding Paper • 2505.22618 • Published 9 days ago • 39
view article Article 🌙 Introducing **Moon**: Storytelling Generator Model By kulia-moon and 1 other • 8 days ago • 6
Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO Paper • 2505.22453 • Published 9 days ago • 45
view article Article Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs By wenhuach and 8 others • Apr 29 • 32
s3: You Don't Need That Much Data to Train a Search Agent via RL Paper • 2505.14146 • Published 18 days ago • 17
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning Paper • 2505.17667 • Published 15 days ago • 85
Distilling LLM Agent into Small Models with Retrieval and Code Tools Paper • 2505.17612 • Published 15 days ago • 77
Fixing Data That Hurts Performance: Cascading LLMs to Relabel Hard Negatives for Robust Information Retrieval Paper • 2505.16967 • Published 15 days ago • 22
view article Article nanoVLM: The simplest repository to train your VLM in pure PyTorch By ariG23498 and 6 others • 17 days ago • 140
Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective Paper • 2505.15045 • Published 17 days ago • 53
view article Article Falcon-Arabic: A Breakthrough in Arabic Language Models By tiiuae and 7 others • 17 days ago • 30
Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models Paper • 2505.10554 • Published 22 days ago • 118