qizekun
/

PointSO

Model card Files Files and versions Community

PointSO / README.md

qizekun's picture

Update README.md

f0cf072 verified 11 days ago

|

history blame contribute delete

1.51 kB

	---
	license: mit
	datasets:
	- allenai/objaverse
	- allenai/objaverse-xl
	- qizekun/OrienText300K
	- tiange/Cap3D
	language:
	- en
	---


	# PointSO

	Paper_link: https://arxiv.org/abs/2502.13143
	<br>
	Project Page: https://qizekun.github.io/sofar/
	<br>
	Github Code: https://github.com/qizekun/SoFar

	We introduce the concept of semantic orientation, representing the object orientation condition on open vocabulary language.
	The model is part of [SoFar](https://github.com/qizekun/SoFar), a 6-DoF system for spatial reasoning and robotic manipulation.
	The released weights is on [Huggingface PointSO](https://huggingface.co/qizekun/PointSO), and the code is in the [orientation](https://github.com/qizekun/SoFar/tree/main/orientation) folder.

	## Pretrain
	Download the PointMAE as initialization.
	```bash
	wget https://github.com/Pang-Yatian/Point-MAE/releases/download/main/pretrain.pth -P orientation/
	```
	Perpare the OrienText300K dataset following [DATASET.md](https://github.com/qizekun/SoFar/blob/main/datasets/DATASET.md).
	```bash
	cd orientation
	sh train_ddp.sh
	```

	## Finetune
	Perpare the Open6DOR finetuning dataset following [DATASET.md](https://github.com/qizekun/SoFar/blob/main/datasets/DATASET.md).
	The dataset is generated from isaac sim with different assets from Open6DOR.
	Finetune PointSO will significantly improve the performance on Open6DOR rotation track & 6-DoF track.
	We recommend using this version of the model for the Open6DOR V2 evaluation.
	```bash
	cd orientation
	sh train_ddp_ft.sh
	```