prithivMLmods's picture
Update README.md
d63da39 verified
|
raw
history blame
2.17 kB
metadata
license: apache-2.0
datasets:
  - allenai/olmOCR-mix-0225
  - prithivMLmods/Opendoc1-Analysis-Recognition
  - prithivMLmods/Opendoc2-Analysis-Recognition
  - prithivMLmods/Openpdf-Analysis-Recognition
pipeline_tag: image-text-to-text

22.png

Training Details

Parameter Value
Dataset Size 274,209 samples (Modular Combination of Datasets)
Model Architecture Qwen2_5_VLForConditionalGeneration
Hardware 2 × NVIDIA A100 SXM (32 vCPUs)
Total Disk 170,000 MB
Training Time 9,020 seconds (~2.51 hours)
Learning Rate 1e-5
Scheduler Linear Decay
Warmup Steps 750
Precision bfloat16

The open dataset image-text response will be updated soon.

References