4 23 15

Leng Sicong PRO

Sicong

AI & ML interests

None yet

Recent Activity

upvoted a paper 17 days ago

Through the Valley: Path to Effective Long CoT Training for Small Language Models

upvoted a paper 17 days ago

Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning

updated a dataset 23 days ago

Sicong/caption_rl

View all activity

Organizations

liked a dataset 4 months ago

TIGER-Lab/VisualWebInstruct

Viewer • Updated Apr 10 • 1.91M • 1.3k • 34

liked a Space 5 months ago

MiniMaxVL01

💬

Generate responses using text and images

liked 3 datasets 8 months ago

liked a model 10 months ago

microsoft/Phi-3.5-vision-instruct

Image-Text-to-Text • 4B • Updated Sep 26, 2024 • 1.07M • 690

liked a Space 11 months ago

550

Vision Arena (Testing VLMs side-by-side)

🖼

Analyze images to detect and label objects

liked a Space 12 months ago

691

Qwen2 72B Instruct

💻

Chat with Qwen2-72B-instruct using a system prompt

liked a model 12 months ago

internlm/internlm-xcomposer2d5-7b

Visual Question Answering • Updated Jul 22, 2024 • 2.75k • 206

liked a Space about 1 year ago

149

VideoLLaMA2

🎥

Media understanding

liked a dataset about 1 year ago

DAMO-NLP-SG/Multi-Source-Video-Captioning

Viewer • Updated Jun 17, 2024 • 1.5k • 63 • 8

liked a Space about 1 year ago

191

Video LLaMA

🚀

Upload a video or image to get conversational explanations

liked a model about 1 year ago

DAMO-NLP-SG/VideoLLaMA2-7B

Visual Question Answering • 8B • Updated Aug 13, 2024 • 2.79k • 41

liked a dataset over 1 year ago

OpenGVLab/InternVid

Viewer • Updated Aug 13, 2024 • 21.3M • 421 • 81

liked a model over 1 year ago

TinyLlama/TinyLlama-1.1B-Chat-v1.0

Text Generation • 1B • Updated Mar 17, 2024 • 1.08M • 1.31k