Sidewalk Models and datasets for sidewalk detection or segmentation tobiasc/segformer-b0-finetuned-segments-sidewalk Image Segmentation • 0.0B • Updated Mar 23, 2023 • 15 • 1
tobiasc/segformer-b0-finetuned-segments-sidewalk Image Segmentation • 0.0B • Updated Mar 23, 2023 • 15 • 1
OCR priyank-m/text_recognition_en_zh_clean Viewer • Updated Dec 16, 2022 • 1.4M • 109 • 4 priyank-m/MJSynth_text_recognition Viewer • Updated Jul 4, 2023 • 8.92M • 335 • 6 priyank-m/IAM_words_text_recognition Viewer • Updated Sep 7, 2022 • 115k • 90 • 6 priyank-m/trdg_wikipedia_en_text_recognition Viewer • Updated Mar 16 • 106k • 18 • 1
DPO dataset argilla/ultrafeedback-binarized-preferences-cleaned Viewer • Updated Dec 11, 2023 • 60.9k • 6k • 145 mlabonne/orpo-dpo-mix-40k Viewer • Updated Oct 17, 2024 • 44.2k • 701 • 285 zake7749/kyara-chinese-preference-rl-dpo-s0-30K Viewer • Updated Sep 7, 2024 • 30.2k • 24 • 3
Multimodal Dataset COT HuggingFaceM4/ChartQA Viewer • Updated Mar 5, 2024 • 32.7k • 6.67k • 44 Luckyjhg/Geo170K Viewer • Updated Feb 19 • 177k • 300 • 35
Multimodal LLM MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models Paper • 2408.02718 • Published Aug 5, 2024 • 62 LLaVA-OneVision: Easy Visual Task Transfer Paper • 2408.03326 • Published Aug 6, 2024 • 61
MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models Paper • 2408.02718 • Published Aug 5, 2024 • 62
Sidewalk Models and datasets for sidewalk detection or segmentation tobiasc/segformer-b0-finetuned-segments-sidewalk Image Segmentation • 0.0B • Updated Mar 23, 2023 • 15 • 1
tobiasc/segformer-b0-finetuned-segments-sidewalk Image Segmentation • 0.0B • Updated Mar 23, 2023 • 15 • 1
Multimodal Dataset COT HuggingFaceM4/ChartQA Viewer • Updated Mar 5, 2024 • 32.7k • 6.67k • 44 Luckyjhg/Geo170K Viewer • Updated Feb 19 • 177k • 300 • 35
OCR priyank-m/text_recognition_en_zh_clean Viewer • Updated Dec 16, 2022 • 1.4M • 109 • 4 priyank-m/MJSynth_text_recognition Viewer • Updated Jul 4, 2023 • 8.92M • 335 • 6 priyank-m/IAM_words_text_recognition Viewer • Updated Sep 7, 2022 • 115k • 90 • 6 priyank-m/trdg_wikipedia_en_text_recognition Viewer • Updated Mar 16 • 106k • 18 • 1
Multimodal LLM MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models Paper • 2408.02718 • Published Aug 5, 2024 • 62 LLaVA-OneVision: Easy Visual Task Transfer Paper • 2408.03326 • Published Aug 6, 2024 • 61
MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models Paper • 2408.02718 • Published Aug 5, 2024 • 62
DPO dataset argilla/ultrafeedback-binarized-preferences-cleaned Viewer • Updated Dec 11, 2023 • 60.9k • 6k • 145 mlabonne/orpo-dpo-mix-40k Viewer • Updated Oct 17, 2024 • 44.2k • 701 • 285 zake7749/kyara-chinese-preference-rl-dpo-s0-30K Viewer • Updated Sep 7, 2024 • 30.2k • 24 • 3