MeshCoder: LLM-Powered Structured Mesh Code Generation from Point Clouds Paper • 2508.14879 • Published 4 days ago • 46
Advances in Speech Separation: Techniques, Challenges, and Future Trends Paper • 2508.10830 • Published 10 days ago • 12
Mind the Generation Process: Fine-Grained Confidence Estimation During LLM Generation Paper • 2508.12040 • Published 8 days ago • 13
Training-Free Text-Guided Color Editing with Multi-Modal Diffusion Transformer Paper • 2508.09131 • Published 12 days ago • 14
MultiRef: Controllable Image Generation with Multiple Visual References Paper • 2508.06905 • Published 15 days ago • 19
Matrix-Game 2.0: An Open-Source, Real-Time, and Streaming Interactive World Model Paper • 2508.13009 • Published 6 days ago • 22
When Punctuation Matters: A Large-Scale Comparison of Prompt Robustness Methods for LLMs Paper • 2508.11383 • Published 9 days ago • 38
ToonComposer: Streamlining Cartoon Production with Generative Post-Keyframing Paper • 2508.10881 • Published 10 days ago • 50
VertexRegen: Mesh Generation with Continuous Level of Detail Paper • 2508.09062 • Published 12 days ago • 33
UNCAGE: Contrastive Attention Guidance for Masked Generative Transformers in Text-to-Image Generation Paper • 2508.05399 • Published 17 days ago • 16
Train Long, Think Short: Curriculum Learning for Efficient Reasoning Paper • 2508.08940 • Published 12 days ago • 22
Story2Board: A Training-Free Approach for Expressive Storyboard Generation Paper • 2508.09983 • Published 11 days ago • 63
CharacterShot: Controllable and Consistent 4D Character Animation Paper • 2508.07409 • Published 14 days ago • 37
GENIE: Gaussian Encoding for Neural Radiance Fields Interactive Editing Paper • 2508.02831 • Published 20 days ago • 11
VLM4D: Towards Spatiotemporal Awareness in Vision Language Models Paper • 2508.02095 • Published 20 days ago • 6
Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens Paper • 2508.01191 • Published 22 days ago • 221