Dongchan Shin
ShinDC
AI & ML interests
NLP
Recent Activity
upvoted
a
paper
about 1 month ago
AgentRewardBench: Evaluating Automatic Evaluations of Web Agent
Trajectories
upvoted
a
paper
about 1 month ago
OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real
Computer Environments
upvoted
a
paper
about 1 month ago
Spider 2.0: Evaluating Language Models on Real-World Enterprise
Text-to-SQL Workflows
Organizations
ShinDC's activity
No public activity