Beyond Transcription: Mechanistic Interpretability in ASR Paper • 2508.15882 • Published 18 days ago • 83
AblationBench Collection This is a collection of datasets used to evaluate language models in the task of ablation planning in empirical AI research. • 4 items • Updated May 16 • 5
Voyager: Long-Range and World-Consistent Video Diffusion for Explorable 3D Scene Generation Paper • 2506.04225 • Published Jun 4 • 27
RefVNLI: Towards Scalable Evaluation of Subject-driven Text-to-image Generation Paper • 2504.17502 • Published Apr 24 • 56
Single Image Iterative Subject-driven Generation and Editing Paper • 2503.16025 • Published Mar 20 • 14
Bringing Objects to Life: 4D generation from 3D objects Paper • 2412.20422 • Published Dec 29, 2024 • 42
GLEE: A Unified Framework and Benchmark for Language-based Economic Environments Paper • 2410.05254 • Published Oct 7, 2024 • 85
Make It Count: Text-to-Image Generation with an Accurate Number of Objects Paper • 2406.10210 • Published Jun 14, 2024 • 79
Point-Cloud Completion with Pretrained Text-to-image Diffusion Models Paper • 2306.10533 • Published Jun 18, 2023 • 9