-
ReTool: Reinforcement Learning for Strategic Tool Use in LLMs
Paper • 2504.11536 • Published • 62 -
Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning
Paper • 2505.24726 • Published • 273 -
Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey
Paper • 2503.12605 • Published • 35 -
MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention
Paper • 2506.13585 • Published • 266
Av
Avi66
·
AI & ML interests
ML Research , LLMs , Applications
MultiModality
Recent Activity
updated
a collection
about 1 month ago
TTS
updated
a collection
about 1 month ago
Papers
updated
a collection
about 2 months ago
Papers
Organizations
Vlm
-
XiaomiMiMo/MiMo-VL-7B-RL
Image-Text-to-Text • 8B • Updated • 17.8k • 161 -
mradermacher/Janus-Pro-7B-LM-GGUF
7B • Updated • 1k • 36 -
deepseek-ai/deepseek-vl2-tiny
Image-Text-to-Text • 3B • Updated • 58.6k • 217 -
RedHatAI/Llama-3.2-11B-Vision-Instruct-FP8-dynamic
Text Generation • 11B • Updated • 1.33k • 24
Spaces
Papers
-
ReTool: Reinforcement Learning for Strategic Tool Use in LLMs
Paper • 2504.11536 • Published • 62 -
Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning
Paper • 2505.24726 • Published • 273 -
Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey
Paper • 2503.12605 • Published • 35 -
MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention
Paper • 2506.13585 • Published • 266
Tamil llm
Vlm
-
XiaomiMiMo/MiMo-VL-7B-RL
Image-Text-to-Text • 8B • Updated • 17.8k • 161 -
mradermacher/Janus-Pro-7B-LM-GGUF
7B • Updated • 1k • 36 -
deepseek-ai/deepseek-vl2-tiny
Image-Text-to-Text • 3B • Updated • 58.6k • 217 -
RedHatAI/Llama-3.2-11B-Vision-Instruct-FP8-dynamic
Text Generation • 11B • Updated • 1.33k • 24
TTS
Spaces