-
38
Llama 3.2V 11B Cot
💬Generate descriptions and answers by combining text and images
-
Xkev/Llama-3.2V-11B-cot
Image-Text-to-Text • Updated • 6.16k • 148 -
Xkev/LLaVA-CoT-100k
Viewer • Updated • 98.6k • 2.42k • 80 -
LLaVA-o1: Let Vision Language Models Reason Step-by-Step
Paper • 2411.10440 • Published • 122
Guowei Xu
Xkev
AI & ML interests
None yet
Recent Activity
updated
a Space
10 days ago
Xkev/Llama-3.2V-11B-cot
new activity
17 days ago
Xkev/LLaVA-CoT-100k:Greetings! I have made a R1 format fork of this dataset!
upvoted
a
paper
about 1 month ago
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth
Approach
Organizations
None yet