Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Ruochen Xu's picture

1 4 5

Ruochen Xu

ruochenx

tianchez's profile picture

Liaojiajia's profile picture

kyusonglee's profile picture

·

xrc10
ruochenx

AI & ML interests

NLP, Multimodal

Organizations

ruochenx 's collections 6

Models and datasets for sidewalk detection or segmentation

tobiasc/segformer-b0-finetuned-segments-sidewalk

Image Segmentation • 0.0B • Updated Mar 23, 2023 • 15 • 1

priyank-m/text_recognition_en_zh_clean

Viewer • Updated Dec 16, 2022 • 1.4M • 109 • 4
priyank-m/MJSynth_text_recognition

Viewer • Updated Jul 4, 2023 • 8.92M • 335 • 6
priyank-m/IAM_words_text_recognition

Viewer • Updated Sep 7, 2022 • 115k • 90 • 6
priyank-m/trdg_wikipedia_en_text_recognition

Viewer • Updated Mar 16 • 106k • 18 • 1

argilla/ultrafeedback-binarized-preferences-cleaned

Viewer • Updated Dec 11, 2023 • 60.9k • 6k • 145
mlabonne/orpo-dpo-mix-40k

Viewer • Updated Oct 17, 2024 • 44.2k • 701 • 285
zake7749/kyara-chinese-preference-rl-dpo-s0-30K

Viewer • Updated Sep 7, 2024 • 30.2k • 24 • 3

Multimodal Dataset COT

HuggingFaceM4/ChartQA

Viewer • Updated Mar 5, 2024 • 32.7k • 6.67k • 44
Luckyjhg/Geo170K

Viewer • Updated Feb 19 • 177k • 300 • 35

MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models

Paper • 2408.02718 • Published Aug 5, 2024 • 62
LLaVA-OneVision: Easy Visual Task Transfer

Paper • 2408.03326 • Published Aug 6, 2024 • 61

priyank-m/chinese_text_recognition

Viewer • Updated Sep 21, 2022 • 500k • 205 • 25

Models and datasets for sidewalk detection or segmentation

tobiasc/segformer-b0-finetuned-segments-sidewalk

Image Segmentation • 0.0B • Updated Mar 23, 2023 • 15 • 1

Multimodal Dataset COT

HuggingFaceM4/ChartQA

Viewer • Updated Mar 5, 2024 • 32.7k • 6.67k • 44
Luckyjhg/Geo170K

Viewer • Updated Feb 19 • 177k • 300 • 35

priyank-m/text_recognition_en_zh_clean

Viewer • Updated Dec 16, 2022 • 1.4M • 109 • 4
priyank-m/MJSynth_text_recognition

Viewer • Updated Jul 4, 2023 • 8.92M • 335 • 6
priyank-m/IAM_words_text_recognition

Viewer • Updated Sep 7, 2022 • 115k • 90 • 6
priyank-m/trdg_wikipedia_en_text_recognition

Viewer • Updated Mar 16 • 106k • 18 • 1

MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models

Paper • 2408.02718 • Published Aug 5, 2024 • 62
LLaVA-OneVision: Easy Visual Task Transfer

Paper • 2408.03326 • Published Aug 6, 2024 • 61

argilla/ultrafeedback-binarized-preferences-cleaned

Viewer • Updated Dec 11, 2023 • 60.9k • 6k • 145
mlabonne/orpo-dpo-mix-40k

Viewer • Updated Oct 17, 2024 • 44.2k • 701 • 285
zake7749/kyara-chinese-preference-rl-dpo-s0-30K

Viewer • Updated Sep 7, 2024 • 30.2k • 24 • 3

priyank-m/chinese_text_recognition

Viewer • Updated Sep 21, 2022 • 500k • 205 • 25

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs