MeshCoder: LLM-Powered Structured Mesh Code Generation from Point Clouds
Abstract
MeshCoder reconstructs complex 3D objects from point clouds into editable Blender Python scripts, enhancing shape-to-code reconstruction and 3D shape understanding through a multimodal large language model.
Reconstructing 3D objects into editable programs is pivotal for applications like reverse engineering and shape editing. However, existing methods often rely on limited domain-specific languages (DSLs) and small-scale datasets, restricting their ability to model complex geometries and structures. To address these challenges, we introduce MeshCoder, a novel framework that reconstructs complex 3D objects from point clouds into editable Blender Python scripts. We develop a comprehensive set of expressive Blender Python APIs capable of synthesizing intricate geometries. Leveraging these APIs, we construct a large-scale paired object-code dataset, where the code for each object is decomposed into distinct semantic parts. Subsequently, we train a multimodal large language model (LLM) that translates 3D point cloud into executable Blender Python scripts. Our approach not only achieves superior performance in shape-to-code reconstruction tasks but also facilitates intuitive geometric and topological editing through convenient code modifications. Furthermore, our code-based representation enhances the reasoning capabilities of LLMs in 3D shape understanding tasks. Together, these contributions establish MeshCoder as a powerful and flexible solution for programmatic 3D shape reconstruction and understanding.
Community
Paper: https://arxiv.org/abs/2508.14879
Github: https://github.com/ZhaoyangLyu/MeshCoder
Project Page: https://daibingquan.github.io/MeshCoder
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- OmniPart: Part-Aware 3D Generation with Semantic Decoupling and Structural Cohesion (2025)
- MeshLLM: Empowering Large Language Models to Progressively Understand and Generate 3D Mesh (2025)
- Puppeteer: Rig and Animate Your 3D Models (2025)
- LL3M: Large Language 3D Modelers (2025)
- BANG: Dividing 3D Assets via Generative Exploded Dynamics (2025)
- From One to More: Contextual Part Latents for 3D Generation (2025)
- HPR3D: Hierarchical Proxy Representation for High-Fidelity 3D Reconstruction and Controllable Editing (2025)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment:
@librarian-bot
recommend
Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper