Dataset and model for DocLayout-YOLO
zhiyuan zhao
juliozhao
AI & ML interests
Document Understanding; Large Multimodal Models
Recent Activity
upvoted
a
paper
2 months ago
MinerU2.5: A Decoupled Vision-Language Model for Efficient
High-Resolution Document Parsing
liked
a model
2 months ago
opendatalab/MinerU2.5-2509-1.2B