IliaLarchenko
/

dot_pusht_images

model_hub_mixin

pytorch_model_hub_mixin

Model card Files Files and versions

dot_pusht_images / README.md

IliaLarchenko's picture

Update README.md

1ff0ee9 verified 7 months ago

|

history blame contribute delete

1.71 kB

	---
	library_name: lerobot
	tags:
	- model_hub_mixin
	- pytorch_model_hub_mixin
	- robotics
	- dot
	license: apache-2.0
	datasets:
	- lerobot/pusht
	pipeline_tag: robotics
	---

	# Model Card for "Decoder Only Transformer (DOT) Policy" for PushT images dataset

	Read more about the model and implementation details in the [DOT Policy repository](https://github.com/IliaLarchenko/dot_policy).

	This model is trained using the [LeRobot library](https://huggingface.co/lerobot) and achieves state-of-the-art results on behavior cloning on the PushT images dataset. It achieves a 74.2% success rate (and 0.936 average max reward) vs. ~69% for the previous state-of-the-art model (Diffusion and VQ-BET perform the same).

	This result is achieved without the checkpoint selection and is easy to reproduce.

	You can use this model by installing LeRobot from [this branch](https://github.com/IliaLarchenko/lerobot/tree/dot)

	To train the model:

	```bash
	python lerobot/scripts/train.py \
	--policy.type=dot \
	--dataset.repo_id=lerobot/pusht \
	--env.type=pusht \
	--env.task=PushT-v0 \
	--output_dir=outputs/train/pusht_images \
	--batch_size=24 \
	--log_freq=1000 \
	--eval_freq=10000 \
	--save_freq=50000 \
	--offline.steps=1000000 \
	--seed=100000 \
	--wandb.enable=true \
	--num_workers=24 \
	--use_amp=true \
	--device=cuda \
	--policy.return_every_n=2
	```

	To evaluate the model:

	```bash
	python lerobot/scripts/eval.py \
	--policy.path=IliaLarchenko/dot_pusht_images \
	--env.type=pusht \
	--env.task=PushT-v0 \
	--eval.n_episodes=1000 \
	--eval.batch_size=100 \
	--seed=1000000
	```

	Model size:
	- Total parameters: 14.1m
	- Trainable parameters: 2.9m