Why Low-Precision Transformer Training Fails: An Analysis on Flash Attention Paper • 2510.04212 • Published 7 days ago • 20
Paper2Video: Automatic Video Generation from Scientific Papers Paper • 2510.05096 • Published 6 days ago • 88
Video-LMM Post-Training: A Deep Dive into Video Reasoning with Large Multimodal Models Paper • 2510.05034 • Published 6 days ago • 42
Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models Paper • 2510.04618 • Published 6 days ago • 62
Reactive Transformer (RxT) -- Stateful Real-Time Processing for Event-Driven Reactive Language Models Paper • 2510.03561 • Published 8 days ago • 22
Reinforce-Ada: An Adaptive Sampling Framework for Reinforce-Style LLM Training Paper • 2510.04996 • Published 6 days ago • 14
MITS: Enhanced Tree Search Reasoning for LLMs via Pointwise Mutual Information Paper • 2510.03632 • Published 8 days ago • 38
Self-Improvement in Multimodal Large Language Models: A Survey Paper • 2510.02665 • Published 9 days ago • 17
Free Lunch Alignment of Text-to-Image Diffusion Models without Preference Image Pairs Paper • 2509.25771 • Published 12 days ago • 10
Efficient Multi-modal Large Language Models via Progressive Consistency Distillation Paper • 2510.00515 • Published 11 days ago • 38
SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights Paper • 2509.22944 • Published 15 days ago • 73
CLUE: Non-parametric Verification from Experience via Hidden-State Clustering Paper • 2510.01591 • Published 10 days ago • 22
Interactive Training: Feedback-Driven Neural Network Optimization Paper • 2510.02297 • Published 10 days ago • 38
DC-VideoGen: Efficient Video Generation with Deep Compression Video Autoencoder Paper • 2509.25182 • Published 13 days ago • 35
OceanGym: A Benchmark Environment for Underwater Embodied Agents Paper • 2509.26536 • Published 12 days ago • 33
FlashEdit: Decoupling Speed, Structure, and Semantics for Precise Image Editing Paper • 2509.22244 • Published 16 days ago • 5