Robotics
Transformers
Safetensors
English
VLA
Hume-vla commited on
Commit
56837e2
·
verified ·
1 Parent(s): 44388c7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +76 -3
README.md CHANGED
@@ -1,3 +1,76 @@
1
- ---
2
- license: mit
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ datasets:
4
+ - IPEC-COMMUNITY/libero_object_no_noops_lerobot
5
+ language:
6
+ - en
7
+ base_model:
8
+ - lerobot/pi0
9
+ pipeline_tag: robotics
10
+ library_name: transformers
11
+ tags:
12
+ - VLA
13
+ ---
14
+ # Model Card for Hume-Libero_Object
15
+
16
+
17
+ <!-- Provide a quick summary of what the model is/does. -->
18
+ A Dual-System Visual-Language-Action model with System-2 thinking trained on Libero-Object.
19
+ - Paper: [https://arxiv.org/abs/2505.21432](https://arxiv.org/abs/2505.21432)
20
+ - Homepage: [https://hume-vla.github.io](https://hume-vla.github.io)
21
+ - Codebase: [🦾 Hume: A Dual-System VLA with System2 Thinking](https://github.com/hume-vla/hume) ![GitHub Repo stars](https://img.shields.io/github/stars/hume-vla/hume)
22
+
23
+
24
+ ## Uses
25
+
26
+ <!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
27
+ - If you want to reproduce the results in paper, follow the [instruction](https://github.com/hume-vla/hume/tree/main/experiments/libero)
28
+ - If you want to directly use the model:
29
+ ```python
30
+ from hume import HumePolicy
31
+ import numpy as np
32
+
33
+ # load policy
34
+ hume = HumePolicy.from_pretrained("/path/to/checkpoints")
35
+
36
+ # config Test-Time Computing args
37
+ hume.init_infer(
38
+ infer_cfg=dict(
39
+ replan_steps=8,
40
+ s2_replan_steps=16,
41
+ s2_candidates_num=5,
42
+ noise_temp_lower_bound=1.0,
43
+ noise_temp_upper_bound=1.0,
44
+ time_temp_lower_bound=0.9,
45
+ time_temp_upper_bound=1.0,
46
+ post_process_action=True,
47
+ device="cuda",
48
+ )
49
+ )
50
+
51
+ # prepare observations
52
+ observation = {
53
+ "observation.images.image": np.zeros((1,224,224,3), dtype = np.uint8), # (B, H, W, C)
54
+ "observation.images.wrist_image": np.zeros((1,224,224,3), dtype = np.uint8), # (B, H, W, C)
55
+ "observation.state": np.zeros((1, 7)), # (B, state_dim)
56
+ "task": ["Lift the papper"],
57
+ }
58
+
59
+ # Infer the action
60
+ action = hume.infer(observation) # (B, action_dim)
61
+
62
+ ```
63
+
64
+ ## Citation
65
+
66
+ <!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
67
+
68
+
69
+ ```BibTeX
70
+ @article{song2025hume,
71
+ title={Hume: Introducing System-2 Thinking in Visual-Language-Action Model},
72
+ author={Anonimous Authors},
73
+ journal={arXiv preprint arXiv:2505.21432},
74
+ year={2025}
75
+ }
76
+ ```