-
InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners
Paper • 2504.14239 • Published • 13 -
Reallm-Labs/InfiGUI-R1-3B
Image-Text-to-Text • 4B • Updated • 1.19k • 3 -
Reallm-Labs/android_control_train
Viewer • Updated • 13.6k • 53 -
Reallm-Labs/android_control_test
Updated • 35
AI & ML interests
None defined yet.
Recent Activity
-
InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners
Paper • 2504.14239 • Published • 13 -
Reallm-Labs/InfiGUI-R1-3B
Image-Text-to-Text • 4B • Updated • 1.19k • 3 -
Reallm-Labs/android_control_train
Viewer • Updated • 13.6k • 53 -
Reallm-Labs/android_control_test
Updated • 35
The comprehensive model fusion strategies, including SFT fusion, DPO fusion, and new merging.
models
7

Reallm-Labs/Infi-MMR-3B
4B
•
Updated
•
10

Reallm-Labs/InfiGFusion-14B
Updated

Reallm-Labs/InfiFusion-14B
Updated
•
1

Reallm-Labs/InfiGUI-R1-3B
Image-Text-to-Text
•
4B
•
Updated
•
1.19k
•
3

Reallm-Labs/InfiR-1B-Instruct
1B
•
Updated
•
5
•
2

Reallm-Labs/InfiR-1B-Base
1B
•
Updated
•
8
•
2

Reallm-Labs/InfiGUIAgent-2B-Stage1
Image-Text-to-Text
•
2B
•
Updated
•
31
•
2