Chaoyue Tang's picture

Chaoyue Tang

tcy6

·

tcy6
tcy6

AI & ML interests

Multimodal

Recent Activity

upvoted a collection about 1 month ago

updated a dataset about 2 months ago

openbmb/VisRAG-Ret-Test-SlideVQA

updated a dataset about 2 months ago

openbmb/VisRAG-Ret-Test-PlotQA

View all activity

Organizations

tcy6's activity

upvoted a collection about 1 month ago

DeepSeek-R1

8 items • Updated Jan 21 • 624

upvoted a collection 3 months ago

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 11 items • Updated 7 days ago • 460

upvoted a paper 5 months ago

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published Dec 6, 2024 • 155

upvoted a paper 7 months ago

VisRAG: Vision-based Retrieval-augmented Generation on Multi-modality Documents

Paper • 2410.10594 • Published Oct 14, 2024 • 27

upvoted a collection 7 months ago

MiniCPM RAG Suite

Embedding, re-ranking, generation -- the cornerstone of RAG. • 6 items • Updated Mar 3 • 12

upvoted 3 papers 7 months ago

RACER: Rich Language-Guided Failure Recovery Policies for Imitation Learning

Paper • 2409.14674 • Published Sep 23, 2024 • 44

A Preliminary Study of o1 in Medicine: Are We Closer to an AI Doctor?

Paper • 2409.15277 • Published Sep 23, 2024 • 38

Phantom of Latent for Large Language and Vision Models

Paper • 2409.14713 • Published Sep 23, 2024 • 30

upvoted 2 papers 8 months ago

CodeRAG-Bench: Can Retrieval Augment Code Generation?

Paper • 2406.14497 • Published Jun 20, 2024 • 2

How Do Your Code LLMs Perform? Empowering Code Instruction Tuning with High-Quality Data

Paper • 2409.03810 • Published Sep 5, 2024 • 36