gr00t model - ๐งช phosphobot training pipeline
- Dataset: cmercier/test_one_pen_2_sept
- Wandb run id: None
Error Traceback
We faced an issue while training your model.
Traceback (most recent call last):
File "/root/src/helper.py", line 139, in train_gr00t_on_modal
trainer.train(
File "/root/phosphobot/am/gr00t.py", line 1215, in train
asyncio.run(
File "/opt/conda/lib/python3.11/asyncio/runners.py", line 190, in run
return runner.run(main)
^^^^^^^^^^^^^^^^
File "/opt/conda/lib/python3.11/asyncio/runners.py", line 118, in run
return self._loop.run_until_complete(task)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/opt/conda/lib/python3.11/asyncio/base_events.py", line 654, in run_until_complete
return future.result()
^^^^^^^^^^^^^^^
File "/root/phosphobot/am/gr00t.py", line 1425, in _call_training_script
raise RuntimeError(error_msg)
RuntimeError: Training process failed with exit code 2:
2025-09-02 17:57:22.182764: E external/local_xla/xla/stream_executor/cuda/cuda_blas.cc:1515] Unable to register cuBLAS factory: Attempting to register factory for plugin cuBLAS when one has already been registered
2025-09-02 17:57:22.190674: I tensorflow/core/platform/cpu_feature_guard.cc:182] This TensorFlow binary is optimized to use available CPU instructions in performance-critical operations.
To enable the following instructions: AVX2 AVX512F AVX512_VNNI FMA, in other operations, rebuild TensorFlow with the appropriate compiler flags.
2025-09-02 17:57:23.541518: W tensorflow/compiler/tf2tensorrt/utils/py_utils.cc:38] TF-TRT Warning: Could not find TensorRT
`use_fast` is set to `True` but the image processor class does not have a fast version. Falling back to the slow version.
โญโ Unrecognized options โโโโโโโโโโโโโโโโโโโโโโโโโโโฎ
โ Unrecognized options: --use-fast โ
โ โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ โ
โ For full helptext, run gr00t_finetune.py --help โ
โฐโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโฏ
Training parameters
{
"validation_dataset_name": null,
"batch_size": 107,
"num_epochs": 10,
"save_steps": 1000,
"learning_rate": 0.0001,
"data_dir": "/tmp/outputs/data",
"validation_data_dir": "/tmp/outputs/validation_data",
"output_dir": "/tmp/outputs/train",
"use_fast": "False"
}
๐ Get Started: docs.phospho.ai
๐ค Get your robot: robots.phospho.ai