Evaluating Agentic Search with Agent-as-a-Judge
AI & ML interests
Natural language processing, language models, language agents
Recent Activity
models
44
osunlp/GUI-Drag-7B
8B
•
Updated
•
21
•
1
osunlp/GUI-Drag-3B
4B
•
Updated
•
21
•
1
osunlp/WebJudge-7B
Image-Text-to-Text
•
8B
•
Updated
•
125
•
7
osunlp/SAE_BioCLIP_24K_ViT-B-16_iNat21
Updated
•
17
•
1
osunlp/UGround-V1-7B
Image-Text-to-Text
•
8B
•
Updated
•
590
•
19
osunlp/UGround
Image-Text-to-Text
•
7B
•
Updated
•
305
•
24
osunlp/Dreamer-7B-Classifieds
Image-Text-to-Text
•
8B
•
Updated
•
11
•
1
osunlp/Dreamer-7B-Shopping
Image-Text-to-Text
•
8B
•
Updated
•
16
•
1
osunlp/Dreamer-7B-Reddit
Image-Text-to-Text
•
8B
•
Updated
•
14
•
1
osunlp/Dreamer-7B
Image-Text-to-Text
•
8B
•
Updated
•
21
•
5
datasets
20
osunlp/Online-Mind2Web
Viewer
•
Updated
•
300
•
386
•
17
osunlp/Mind2Web-2
Viewer
•
Updated
•
130
•
218
•
14
osunlp/Mind2Web
Viewer
•
Updated
•
253
•
1.78k
•
114
osunlp/GUI-Drag-dataset
Preview
•
Updated
•
44
•
1
osunlp/WebGuard
Viewer
•
Updated
•
6k
•
71
•
2
osunlp/AutoSDT-5K
Viewer
•
Updated
•
5.15k
•
130
•
4
osunlp/UGround-V1-Data-Box
Viewer
•
Updated
•
488k
•
1.5k
•
9
osunlp/UGround-V1-Data
Viewer
•
Updated
•
1.23M
•
4.57k
•
21
osunlp/Dreamer-V1-Data
Viewer
•
Updated
•
3.12M
•
1.08k
•
3
osunlp/HippoRAG_2
Preview
•
Updated
•
348
•
4