NAMAA-Space
/

Qari-OCR-0.3-SNAPSHOT-VL-2B-Instruct-merged

@@ -1,13 +1,13 @@
 ---
 library_name: transformers
-tags: []
 ---
 # Model Card for Model ID
 <!-- Provide a quick summary of what the model is/does. -->
 ## Model Details
@@ -17,20 +17,20 @@ tags: []
 This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated.
-- **Developed by:** [More Information Needed]
 - **Funded by [optional]:** [More Information Needed]
 - **Shared by [optional]:** [More Information Needed]
-- **Model type:** [More Information Needed]
-- **Language(s) (NLP):** [More Information Needed]
 - **License:** [More Information Needed]
-- **Finetuned from model [optional]:** [More Information Needed]
 ### Model Sources [optional]
 <!-- Provide the basic links for the model. -->
 - **Repository:** [More Information Needed]
-- **Paper [optional]:** [More Information Needed]
 - **Demo [optional]:** [More Information Needed]
 ## Uses
@@ -41,7 +41,7 @@ This is the model card of a 🤗 transformers model that has been pushed on the
 <!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->
-[More Information Needed]
 ### Downstream Use [optional]
@@ -53,7 +53,7 @@ This is the model card of a 🤗 transformers model that has been pushed on the
 <!-- This section addresses misuse, malicious use, and uses that the model will not work well for. -->
-[More Information Needed]
 ## Bias, Risks, and Limitations
@@ -79,7 +79,7 @@ Use the code below to get started with the model.
 <!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
-[More Information Needed]
 ### Training Procedure
@@ -174,7 +174,16 @@ Carbon emissions can be estimated using the [Machine Learning Impact calculator]
 **BibTeX:**
-[More Information Needed]
 **APA:**
@@ -196,15 +205,4 @@ Carbon emissions can be estimated using the [Machine Learning Impact calculator]
 ## Model Card Contact
-[More Information Needed]
-```
-@misc{QariOCR2025,
-  title={QARI-OCR: High-Fidelity Arabic Text Recognition through Multimodal Large Language Model Adaptation},
-  author={Ahmed Wasfy, Omer Nacar, Abdelakreem Elkhateb, Mahmoud Reda, Omar Elshehy, Adel Ammar, Wadii Boulila},
-  year={2025},
-  archivePrefix={arXiv},
-  url={https://arxiv.org/abs/2506.02295},
-  note={Accessed: 2025-03-03}
-}
-```

 ---
 library_name: transformers
+tags:
+- image-to-text
 ---
 # Model Card for Model ID
 <!-- Provide a quick summary of what the model is/does. -->
+This model is designed for Arabic Optical Character Recognition (OCR).
 ## Model Details
 This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated.
+- **Developed by:** Ahmed Wasfy, Omer Nacar, Abdelakreem Elkhateb, Mahmoud Reda, Omar Elshehy, Adel Ammar, Wadii Boulila
 - **Funded by [optional]:** [More Information Needed]
 - **Shared by [optional]:** [More Information Needed]
+- **Model type:** Vision-Language Model for OCR
+- **Language(s) (NLP):** Arabic
 - **License:** [More Information Needed]
+- **Finetuned from model [optional]:** Qwen2-VL-2B-Instruct
 ### Model Sources [optional]
 <!-- Provide the basic links for the model. -->
 - **Repository:** [More Information Needed]
+- **Paper:** [QARI-OCR: High-Fidelity Arabic Text Recognition through Multimodal Large Language Model Adaptation](https://huggingface.co/papers/2506.02295)
 - **Demo [optional]:** [More Information Needed]
 ## Uses
 <!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->
+This model can be directly used for recognizing Arabic text in images.
 ### Downstream Use [optional]
 <!-- This section addresses misuse, malicious use, and uses that the model will not work well for. -->
+This model is specifically designed for Arabic text and might not perform well on other languages.
 ## Bias, Risks, and Limitations
 <!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
+Trained on specialized synthetic datasets.
 ### Training Procedure
 **BibTeX:**
+```
+@misc{QariOCR2025,
+  title={QARI-OCR: High-Fidelity Arabic Text Recognition through Multimodal Large Language Model Adaptation},
+  author={Ahmed Wasfy, Omer Nacar, Abdelakreem Elkhateb, Mahmoud Reda, Omar Elshehy, Adel Ammar, Wadii Boulila},
+  year={2025},
+  archivePrefix={arXiv},
+  url={https://arxiv.org/abs/2506.02295},
+  note={Accessed: 2025-03-03}
+}
+```
 **APA:**
 ## Model Card Contact
+[More Information Needed]