new

Get trending papers in your email inbox once a day!

Get trending papers in your email inbox!

Daily Papers

byAK and the research community

Jul 11

Submitted by

Yukang

Scaling RL to Long Videos

·
14 authors

Submitted by

ai-alanov

T-LoRA: Single Image Diffusion Model Customization Without Overfitting

·
4 authors

Submitted by

HaochenWang

Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology

·
12 authors

Submitted by

ChaimZhu

OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding

·
7 authors

Submitted by

js-hyun

Multi-Granular Spatio-Temporal Token Merging for Training-Free Acceleration of Video LLMs

·
9 authors

Submitted by

stzhao

PyVision: Agentic Vision with Dynamic Tooling

·
7 authors

Submitted by

Diankun

Geometry Forcing: Marrying Video Diffusion and 3D Representation for Consistent World Modeling

·
7 authors

2

Submitted by

EthanTaylor

LangSplatV2: High-dimensional 3D Language Gaussian Splatting with 450+ FPS

·
7 authors

Submitted by

Franck-Dernoncourt

A Survey on Long-Video Storytelling Generation: Architectures, Consistency, and Cinematic Quality

·
29 authors

Submitted by

zhoutianyi

Skip a Layer or Loop it? Test-Time Depth Adaptation of Pretrained LLMs

·
3 authors

5

Submitted by

bhheo

Token Bottleneck: One Token to Remember Dynamics

·
5 authors

Submitted by

SSamDav

Dynamic Chunking for End-to-End Hierarchical Sequence Modeling

·
3 authors

Submitted by

Xuandong

Machine Bullshit: Characterizing the Emergent Disregard for Truth in Large Language Models

·
6 authors

Submitted by

envomp

Beyond the Linear Separability Ceiling

·
3 authors

Submitted by

dbralios

Re-Bottleneck: Latent Re-Structuring for Neural Audio Autoencoders

·
3 authors

Submitted by

Bochkov

Growing Transformers: Modular Composition and Layer-wise Expansion on a Frozen Substrate

·
1 authors

2

Submitted by

xianbao

SciMaster: Towards General-Purpose Scientific AI Agents, Part I. X-Master as Foundation: Can We Lead on Humanity's Last Exam?

·
11 authors

Submitted by

Bochkov

Emergent Semantics Beyond Token Embeddings: Transformer LMs with Frozen Visual Unicode Representations

·
1 authors