jadechoghari
commited on
Commit
•
07fb5ea
1
Parent(s):
d068b7e
Update README.md
Browse files
README.md
CHANGED
@@ -5,7 +5,7 @@ pipeline_tag: image-text-to-text
|
|
5 |
|
6 |
Ferret-UI is the first UI-centric multimodal large language model (MLLM) designed for referring, grounding, and reasoning tasks.
|
7 |
Built on Gemma-2B and Llama-3-8B, it is capable of executing complex UI tasks.
|
8 |
-
This is the Gemma-2B version of ferret-ui. It follows from [this paper](https://arxiv.org/pdf/2404.05719) by Apple.
|
9 |
|
10 |
|
11 |
## How to Use 🤗📱
|
|
|
5 |
|
6 |
Ferret-UI is the first UI-centric multimodal large language model (MLLM) designed for referring, grounding, and reasoning tasks.
|
7 |
Built on Gemma-2B and Llama-3-8B, it is capable of executing complex UI tasks.
|
8 |
+
This is the **Gemma-2B** version of ferret-ui. It follows from [this paper](https://arxiv.org/pdf/2404.05719) by Apple.
|
9 |
|
10 |
|
11 |
## How to Use 🤗📱
|