---
pipeline_tag: robotics
tags:
- lerobot
---
SmolVLA: A vision-language-action model for affordable and efficient robotics

[Paper](https://huggingface.co/papers/2506.01844)

Designed by Hugging Face.

This model has 450M parameters in total.
You can use inside the [LeRobot library](https://github.com/huggingface/lerobot).

Install smolvla extra dependencies:
```bash
pip install -e ".[smolvla]"
```

Example of finetuning the smolvla pretrained model (`smolvla_base`):
```bash
python lerobot/scripts/train.py \
--policy.path=lerobot/smolvla_base \
--dataset.repo_id=danaaubakirova/svla_so100_task1_v3 \
--batch_size=64 \
--steps=200000
```

Example of finetuning the smolvla neural network with pretrained VLM and action expert
intialized from scratch:
```bash
python lerobot/scripts/train.py \
--policy.type=smolvla \
--dataset.repo_id=danaaubakirova/svla_so100_task1_v3 \
--batch_size=64 \
--steps=200000
```

Example of using the smolvla pretrained model outside LeRobot training framework:
```python
policy = SmolVLAPolicy.from_pretrained("lerobot/smolvla_base")
```