Post
118
The POINTS-Reader, a vision-language model for end-to-end document conversion, is a powerful, distillation-free Vision-Language Model that sets new SoTA benchmarks. The demo is now available on HF (Extraction, Preview, Documentation). The input consists of a fixed prompt and a document image, while the output contains only a string (the text extracted from the document image). 🔥🤗
✦ Space/App: prithivMLmods/POINTS-Reader-OCR (Going live soon)
↗️ Model: tencent/POINTS-Reader
🤗 The app is done and ready to go brrrr with zero GPU. Thankyou @merve
.
.
.
To know more about it, visit the app page or the respective model page!!
✦ Space/App: prithivMLmods/POINTS-Reader-OCR (Going live soon)
↗️ Model: tencent/POINTS-Reader
🤗 The app is done and ready to go brrrr with zero GPU. Thankyou @merve
.
.
.
To know more about it, visit the app page or the respective model page!!