Evaluating Agentic Search with Agent-as-a-Judge
AI & ML interests
Natural language processing, language models, language agents
Recent Activity
View all activity
models
42

osunlp/WebJudge-7B
Image-Text-to-Text
•
8B
•
Updated
•
80
•
5

osunlp/SAE_BioCLIP_24K_ViT-B-16_iNat21
Updated
•
9

osunlp/UGround-V1-7B
Image-Text-to-Text
•
8B
•
Updated
•
3.7k
•
17

osunlp/UGround
Image-Text-to-Text
•
7B
•
Updated
•
117
•
23

osunlp/Dreamer-7B-Classifieds
Image-Text-to-Text
•
8B
•
Updated
•
8
•
1

osunlp/Dreamer-7B-Shopping
Image-Text-to-Text
•
8B
•
Updated
•
12
•
1

osunlp/Dreamer-7B-Reddit
Image-Text-to-Text
•
8B
•
Updated
•
12
•
1

osunlp/Dreamer-7B
Image-Text-to-Text
•
8B
•
Updated
•
2.99k
•
4

osunlp/Dreamer-72B
Image-Text-to-Text
•
73B
•
Updated
•
18
•
2

osunlp/UGround-V1-2B
Image-Text-to-Text
•
2B
•
Updated
•
2.12k
•
8
datasets
18
osunlp/Mind2Web-2
Preview
•
Updated
•
4
osunlp/AutoSDT-5K
Viewer
•
Updated
•
5.15k
•
152
•
3
osunlp/UGround-V1-Data-Box
Viewer
•
Updated
•
488k
•
772
•
5
osunlp/UGround-V1-Data
Viewer
•
Updated
•
1.23M
•
2.55k
•
13
osunlp/Online-Mind2Web
Viewer
•
Updated
•
300
•
377
•
10
osunlp/Dreamer-V1-Data
Viewer
•
Updated
•
3.12M
•
603
•
2
osunlp/HippoRAG_2
Preview
•
Updated
•
2.39k
•
1
osunlp/ScienceAgentBench
Viewer
•
Updated
•
102
•
766
•
15
osunlp/SMolInstruct
Updated
•
1.34k
•
38
osunlp/TravelPlanner
Viewer
•
Updated
•
1.23k
•
4.63k
•
60