Hungarian PDF pages from Common Crawl. Annotated with synthetic QAs by Llama 3.3 70B.
Jonathan Li
jlli
AI & ML interests
None yet
Recent Activity
updated
a collection
14 days ago
Hungarian Document Datasets
updated
a collection
14 days ago
Hungarian Document Datasets
updated
a collection
14 days ago
Hungarian Document Datasets
Organizations
Collections
1
models
0
None public yet
datasets
10
jlli/HuDocVQA
Viewer
•
Updated
•
22.4k
•
76
jlli/HuDocVQA-manual
Viewer
•
Updated
•
54
•
50
jlli/HuCCPDF
Viewer
•
Updated
•
113k
•
255
jlli/Hungarian_CCPDF_SynQA_v2
Viewer
•
Updated
•
24.3k
•
73
•
1
jlli/Hungarian_CCPDF_SynQA
Viewer
•
Updated
•
19.3k
•
69
jlli/SynthDog_hu2
Viewer
•
Updated
•
40k
•
32
jlli/JDocQA-binary
Viewer
•
Updated
•
1.38k
•
38
jlli/JDocQA-nonbinary
Viewer
•
Updated
•
7.54k
•
37
jlli/HungarianDocQA-OCR
Viewer
•
Updated
•
54
•
49
•
1
jlli/SynthDog_hu
Viewer
•
Updated
•
20.5k
•
34