update paper link.
Browse files
README.md
CHANGED
@@ -9,7 +9,7 @@ base_model:
|
|
9 |
This model was introduced in the paper [**GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents**](https://aka.ms/GUI-Actor).
|
10 |
It is developed based on [Qwen2-VL-7B-Instruct ](https://huggingface.co/Qwen/Qwen2-VL-7B-Instruct), augmented by an attention-based action head and finetuned to perform GUI grounding using the dataset [here (coming soon)]().
|
11 |
|
12 |
-
For more details on model design and evaluation, please check: [π Project Page](https://aka.ms/GUI-Actor) | [π» Github Repo](https://github.com/microsoft/GUI-Actor) | [π Paper]().
|
13 |
|
14 |
| Model Name | Hugging Face Link |
|
15 |
|--------------------------------------------|--------------------------------------------|
|
@@ -126,10 +126,10 @@ print(f"Predicted click point: [{round(px, 4)}, {round(py, 4)}]")
|
|
126 |
title={GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents},
|
127 |
author={Qianhui Wu and Kanzhi Cheng and Rui Yang and Chaoyun Zhang and Jianwei Yang and Huiqiang Jiang and Jian Mu and Baolin Peng and Bo Qiao and Reuben Tan and Si Qin and Lars Liden and Qingwei Lin and Huan Zhang and Tong Zhang and Jianbing Zhang and Dongmei Zhang and Jianfeng Gao},
|
128 |
year={2025},
|
129 |
-
eprint={},
|
130 |
archivePrefix={arXiv},
|
131 |
primaryClass={cs.CV},
|
132 |
-
url={},
|
133 |
}
|
134 |
```
|
135 |
|
|
|
9 |
This model was introduced in the paper [**GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents**](https://aka.ms/GUI-Actor).
|
10 |
It is developed based on [Qwen2-VL-7B-Instruct ](https://huggingface.co/Qwen/Qwen2-VL-7B-Instruct), augmented by an attention-based action head and finetuned to perform GUI grounding using the dataset [here (coming soon)]().
|
11 |
|
12 |
+
For more details on model design and evaluation, please check: [π Project Page](https://aka.ms/GUI-Actor) | [π» Github Repo](https://github.com/microsoft/GUI-Actor) | [π Paper](https://www.arxiv.org/pdf/2506.03143).
|
13 |
|
14 |
| Model Name | Hugging Face Link |
|
15 |
|--------------------------------------------|--------------------------------------------|
|
|
|
126 |
title={GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents},
|
127 |
author={Qianhui Wu and Kanzhi Cheng and Rui Yang and Chaoyun Zhang and Jianwei Yang and Huiqiang Jiang and Jian Mu and Baolin Peng and Bo Qiao and Reuben Tan and Si Qin and Lars Liden and Qingwei Lin and Huan Zhang and Tong Zhang and Jianbing Zhang and Dongmei Zhang and Jianfeng Gao},
|
128 |
year={2025},
|
129 |
+
eprint={2506.03143},
|
130 |
archivePrefix={arXiv},
|
131 |
primaryClass={cs.CV},
|
132 |
+
url={https://www.arxiv.org/pdf/2506.03143},
|
133 |
}
|
134 |
```
|
135 |
|