1
TongUI
💬
Identify and mark clickable elements on screenshots based on queries
Open source our work TongUI: Building Generalized GUI Agents by Learning from Multimodal Web Tutorials; https://github.com/TongUI-agent/TongUI-agent
Identify and mark clickable elements on screenshots based on queries