microsoft
/

Florence-2-large-ft

Image-Text-to-Text

Model card Files Files and versions

leoxiaobin commited on Jun 18, 2024

Commit

f3c2bbf

·

verified ·

1 Parent(s): 9896a44

Update README.md

Files changed (1) hide show

README.md +22 -19

README.md CHANGED Viewed

@@ -110,20 +110,7 @@ Here are the tasks `Florence-2` could perform:
 <details>
 <summary> Click to expand </summary>
-### OCR
-```python
-prompt = "<OCR>"
-run_example(prompt)
-```
-### OCR with Region
-OCR with region output format:
-{'\<OCR_WITH_REGION>': {'quad_boxes': [[x1, y1, x2, y2, x3, y3, x4, y4], ...], 'labels': ['text1', ...]}}
-```python
-prompt = "<OCR_WITH_REGION>"
-run_example(prompt)
-```
 ### Caption
 ```python
@@ -143,6 +130,16 @@ prompt = "<MORE_DETAILED_CAPTION>"
 run_example(prompt)
 ```
 ### Object Detection
 OD results format:
@@ -172,14 +169,20 @@ prompt = "<REGION_PROPOSAL>"
 run_example(prompt)
 ```
-### Caption to Phrase Grounding
-caption to phrase grounding task requires additional text input, i.e. caption.
-Caption to phrase grounding results format:
-{'\<CAPTION_TO_PHRASE_GROUNDING>': {'bboxes': [[x1, y1, x2, y2], ...], 'labels': ['', '', ...]}}
 ```python
-task_prompt = "<CAPTION_TO_PHRASE_GROUNDING>"
-results = run_example(task_prompt, text_input="A green car parked in front of a yellow building.")
 ```
 for More detailed examples, please refer to [notebook](https://huggingface.co/microsoft/Florence-2-large/blob/main/sample_inference.ipynb)

 <details>
 <summary> Click to expand </summary>
 ### Caption
 ```python
 run_example(prompt)
 ```
+### Caption to Phrase Grounding
+caption to phrase grounding task requires additional text input, i.e. caption.
+Caption to phrase grounding results format:
+{'\<CAPTION_TO_PHRASE_GROUNDING>': {'bboxes': [[x1, y1, x2, y2], ...], 'labels': ['', '', ...]}}
+```python
+task_prompt = "<CAPTION_TO_PHRASE_GROUNDING>"
+results = run_example(task_prompt, text_input="A green car parked in front of a yellow building.")
+```
 ### Object Detection
 OD results format:
 run_example(prompt)
 ```
+### OCR
 ```python
+prompt = "<OCR>"
+run_example(prompt)
+```
+### OCR with Region
+OCR with region output format:
+{'\<OCR_WITH_REGION>': {'quad_boxes': [[x1, y1, x2, y2, x3, y3, x4, y4], ...], 'labels': ['text1', ...]}}
+```python
+prompt = "<OCR_WITH_REGION>"
+run_example(prompt)
 ```
 for More detailed examples, please refer to [notebook](https://huggingface.co/microsoft/Florence-2-large/blob/main/sample_inference.ipynb)