view article Article ScreenSuite - The most comprehensive evaluation suite for GUI Agents! 5 days ago • 34
GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents Paper • 2506.03143 • Published 7 days ago • 44
Qwen2.5-Omni Collection End-to-End Omni (text, audio, image, video, and natural speech interaction) model based Qwen2.5 • 7 items • Updated 21 days ago • 143
view article Article No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL By toslali-ibm and 5 others • 8 days ago • 44
view article Article CodeAgents + Structure: A Better Way to Execute Actions By akseljoonas and 1 other • 14 days ago • 51
view article Article Everything You Need to Know about Knowledge Distillation By Kseniase and 1 other • Mar 6 • 26