Series model of paper "Sherlock: Self-Correcting Reasoning in Vision-Language Models"
Yi Ding
Tuwhy
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
8 days ago
The Entropy Mechanism of Reinforcement Learning for Reasoning Language
Models
upvoted
a
paper
8 days ago
Skywork Open Reasoner 1 Technical Report
Organizations
Collections
3
models
6

Tuwhy/Llama-3.2V-11B-Sherlock-iter2
Image-Text-to-Text
•
Updated
•
48
•
1

Tuwhy/Llama-3.2V-11B-Sherlock-SFT
Image-Text-to-Text
•
Updated
•
14

Tuwhy/Llama-3.2V-11B-Sherlock-Offline
Image-Text-to-Text
•
Updated
•
15

Tuwhy/Llama-3.2V-11B-Sherlock-iter1
Image-Text-to-Text
•
Updated
•
10

Tuwhy/InternVL2.5-8B-MIRage
Image-Text-to-Text
•
Updated
•
4
•
1

Tuwhy/Qwen2-VL-7B-MIRage
Image-Text-to-Text
•
Updated
•
10
•
1