BeingBeyond
/

Being-H0-1B-2508

@@ -36,6 +36,18 @@ We introduce **Being-H0**, the first dexterous Vision-Language-Action model pret
 - **[2025-08-02]**: We release the **Being-H0** codebase and pretrained models! Check our [Hugging Face Model Hub](https://huggingface.co/BeingBeyond/Being-H0) for more details. 🔥🔥🔥
 - **[2025-07-21]**: We publish **Being-H0**! Check our paper [here](https://arxiv.org/abs/2507.15597). 🌟🌟🌟
 ## Setup
 ### Clone repository
@@ -65,18 +77,6 @@ pip install git+https://github.com/mattloper/chumpy.git
 - Download Models and Code (the downloaded file should have the format `mano_v*_*.zip`). Note that all code and data from this download falls under the [MANO license](http://mano.is.tue.mpg.de/license).
 - unzip and copy the contents in `mano_v*_*/` folder to the `beingvla/models/motion/mano/` folder
-## Model Checkpoints
-Download pre-trained models from Hugging Face:
-| Model Type | Model Name | Parameters | Description |
-|------------|------------|------------|-------------|
-| **Motion Model** | [Being-H0-GRVQ-8K](https://huggingface.co/BeingBeyond/Being-H0-GRVQ-8K) | - | Motion tokenizer |
-| **VLA Pre-trained** | [Being-H0-1B-2508](https://huggingface.co/BeingBeyond/Being-H0-1B-2508) | 1B | Base vision-language-action model |
-| **VLA Pre-trained** | [Being-H0-8B-2508](https://huggingface.co/BeingBeyond/Being-H0-8B-2508) | 8B | Base vision-language-action model |
-| **VLA Pre-trained** | [Being-H0-14B-2508](https://huggingface.co/BeingBeyond/Being-H0-14B-2508) | 14B | Base vision-language-action model |
-| **VLA Post-trained** | [Being-H0-8B-Align-2508](https://huggingface.co/BeingBeyond/Being-H0-8B-Align-2508) | 8B | Fine-tuned for robot alignment |
 ## Inference
 ### Motion Generation
@@ -114,14 +114,9 @@ python -m beingvla.models.motion.m2m.aligner.eval_policy \
     --action-chunk-length 16
 ```
-## TODO: Training and Data Features
-The following features are planned for future implementation:
-### Data Preparation.
-- [ ] Training code and scripts.
-- [ ] Hugging Face library compatibility.
-- [x] Inference code and scripts.
 ## Citation
 If you find our work useful, please consider citing us and give a star to our repository! 🌟🌟🌟

 - **[2025-08-02]**: We release the **Being-H0** codebase and pretrained models! Check our [Hugging Face Model Hub](https://huggingface.co/BeingBeyond/Being-H0) for more details. 🔥🔥🔥
 - **[2025-07-21]**: We publish **Being-H0**! Check our paper [here](https://arxiv.org/abs/2507.15597). 🌟🌟🌟
+## Model Checkpoints
+Download pre-trained models from Hugging Face:
+| Model Type | Model Name | Parameters | Description |
+|------------|------------|------------|-------------|
+| **Motion Model** | [Being-H0-GRVQ-8K](https://huggingface.co/BeingBeyond/Being-H0-GRVQ-8K) | - | Motion tokenizer |
+| **VLA Pre-trained** | [Being-H0-1B-2508](https://huggingface.co/BeingBeyond/Being-H0-1B-2508) | 1B | Base vision-language-action model |
+| **VLA Pre-trained** | [Being-H0-8B-2508](https://huggingface.co/BeingBeyond/Being-H0-8B-2508) | 8B | Base vision-language-action model |
+| **VLA Pre-trained** | [Being-H0-14B-2508](https://huggingface.co/BeingBeyond/Being-H0-14B-2508) | 14B | Base vision-language-action model |
+| **VLA Post-trained** | [Being-H0-8B-Align-2508](https://huggingface.co/BeingBeyond/Being-H0-8B-Align-2508) | 8B | Fine-tuned for robot alignment |
 ## Setup
 ### Clone repository
 - Download Models and Code (the downloaded file should have the format `mano_v*_*.zip`). Note that all code and data from this download falls under the [MANO license](http://mano.is.tue.mpg.de/license).
 - unzip and copy the contents in `mano_v*_*/` folder to the `beingvla/models/motion/mano/` folder
 ## Inference
 ### Motion Generation
     --action-chunk-length 16
 ```
+## Contributing and Building on Being-H0
+We encourage researchers and practitioners to leverage Being-H0 as a foundation for their own creative experiments and applications. Whether you're adapting Being-H0 to new robotic platforms, exploring novel hand manipulation tasks, or extending the model to new domains, our modular codebase is designed to support your innovations. We welcome contributions of all kinds - from bug fixes and documentation improvements to new features and model architectures. By building on Being-H0 together, we can advance the field of dexterous vision-language-action modeling and enable robots to understand and replicate the rich complexity of human hand movements. Join us in making robotic manipulation more intuitive, capable, and accessible to all.
 ## Citation
 If you find our work useful, please consider citing us and give a star to our repository! 🌟🌟🌟