OCR detection gets worse with cropped images

#49

by Asdmkasomo - opened 16 days ago

16 days ago

Hi All, asking for some help here -- we are trying to recognise text from a document but cropping of the document would result in GOT OCR being unable to detect the words.
However, it is able to detect if it was with the original image.
Able to know what should be the re-sizing be to match with the model.?

Thaipitak

14 days ago

Think about it this way: when you crop the image, the number of pixels is reduced. By the time it goes through the model’s feature extraction, some features are lost, which makes detection fail. You should try resizing the cropped image first (even using nearest-neighbor resizing is fine). Then the model will be able to detect it, since it has already been trained on characters of various sizes.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment