Nguyen Van Thanh's picture

5058

Nguyen Van Thanh

NguyenVanThanhHust

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 21 hours ago

G-CUT3R: Guided 3D Reconstruction with Camera and Depth Prior Integration

upvoted a paper about 21 hours ago

Lumen: Consistent Video Relighting and Harmonious Background Replacement with Video Generative Models

upvoted a paper about 21 hours ago

Representing Speech Through Autoregressive Prediction of Cochlear Tokens

View all activity

Organizations

None yet

upvoted 20 papers about 21 hours ago

G-CUT3R: Guided 3D Reconstruction with Camera and Depth Prior Integration

Paper • 2508.11379 • Published 9 days ago • 12

Lumen: Consistent Video Relighting and Harmonious Background Replacement with Video Generative Models

Paper • 2508.12945 • Published 5 days ago • 12

Representing Speech Through Autoregressive Prediction of Cochlear Tokens

Paper • 2508.11598 • Published 8 days ago • 16

Matrix-Game 2.0: An Open-Source, Real-Time, and Streaming Interactive World Model

Paper • 2508.13009 • Published 5 days ago • 21

HeroBench: A Benchmark for Long-Horizon Planning and Structured Reasoning in Virtual Worlds

Paper • 2508.12782 • Published 6 days ago • 24

Has GPT-5 Achieved Spatial Intelligence? An Empirical Study

Paper • 2508.13142 • Published 5 days ago • 31

When Punctuation Matters: A Large-Scale Comparison of Prompt Robustness Methods for LLMs

Paper • 2508.11383 • Published 9 days ago • 38

Speed Always Wins: A Survey on Efficient Architectures for Large Language Models

Paper • 2508.09834 • Published 10 days ago • 46

Next Visual Granularity Generation

Paper • 2508.12811 • Published 6 days ago • 45

4DNeX: Feed-Forward 4D Generative Modeling Made Easy

Paper • 2508.13154 • Published 5 days ago • 57

Ovis2.5 Technical Report

Paper • 2508.11737 • Published 8 days ago • 99

Snap-Snap: Taking Two Images to Reconstruct 3D Human Gaussians in Milliseconds

Paper • 2508.14892 • Published 3 days ago • 4

"Does the cafe entrance look accessible? Where is the door?" Towards Geospatial AI Agents for Visual Inquiries

Paper • 2508.15752 • Published 2 days ago • 5

aiXiv: A Next-Generation Open Access Ecosystem for Scientific Discovery Generated by AI Scientists

Paper • 2508.15126 • Published 3 days ago • 14

ATLAS: Decoupling Skeletal and Shape Parameters for Expressive Parametric Human Modeling

Paper • 2508.15767 • Published 2 days ago • 8

Visual Autoregressive Modeling for Instruction-Guided Image Editing

Paper • 2508.15772 • Published 2 days ago • 7

A Survey on Large Language Model Benchmarks

Paper • 2508.15361 • Published 3 days ago • 12

SceneGen: Single-Image 3D Scene Generation in One Feedforward Pass

Paper • 2508.15769 • Published 2 days ago • 13

Waver: Wave Your Way to Lifelike Video Generation

Paper • 2508.15761 • Published 2 days ago • 22

Mobile-Agent-v3: Foundamental Agents for GUI Automation

Paper • 2508.15144 • Published 3 days ago • 44