Entropy-Driven GRPO with Guided Error Correction for Advantage Diversity
Zhang Xingjian
Zhang199
AI & ML interests
Large Multimodal Models
Recent Activity
new activity
2 days ago
Zhang199/TinyLLaVA-Video-R1:Extend length of video which can be processed?
updated
a model
13 days ago
Zhang199/TinyLLaVA-Qwen2-0.5B-SigLIP
updated
a model
24 days ago
Zhang199/EDGE-GRPO-Qwen-1.5B
Organizations
None yet