alibaba-damo
/

mgp-str-base

Model card Files Files and versions

yuekun commited on Mar 17, 2023

Commit

f926029

·

1 Parent(s): 11a6a89

Update README.md

update code example

Files changed (1) hide show

README.md +6 -6

README.md CHANGED Viewed

@@ -30,21 +30,21 @@ You can use the raw model for optical character recognition (OCR) on text images
 Here is how to use this model in PyTorch:
 ```python
-from transformers import MGPSTRProcessor, MGPSTRModel
 import requests
 from PIL import Image
-processor = MGPSTRProcessor.from_pretrained('alibaba-damo/mgp-str-base')
-model = MGPSTRModel.from_pretrained('alibaba-damo/mgp-str-base')
 # load image from the IIIT-5k dataset
 url = "https://i.postimg.cc/ZKwLg2Gw/367-14.png"
 image = Image.open(requests.get(url, stream=True).raw).convert("RGB")
-pixel_values = processor(image, return_tensors="pt").pixel_values
-generated_ids, attens = model(pixel_values)
-generated_text = processor.batch_decode(generated_ids)['generated_text']
 ```
 ### BibTeX entry and citation info

 Here is how to use this model in PyTorch:
 ```python
+from transformers import MgpstrProcessor, MgpstrForSceneTextRecognition
 import requests
 from PIL import Image
+processor = MgpstrProcessor.from_pretrained('alibaba-damo/mgp-str-base')
+model = MgpstrForSceneTextRecognition.from_pretrained('alibaba-damo/mgp-str-base')
 # load image from the IIIT-5k dataset
 url = "https://i.postimg.cc/ZKwLg2Gw/367-14.png"
 image = Image.open(requests.get(url, stream=True).raw).convert("RGB")
+pixel_values = processor(images=image, return_tensors="pt").pixel_values
+outputs = model(pixel_values)
+generated_text = processor.batch_decode(outputs.logits)['generated_text']
 ```
 ### BibTeX entry and citation info