1 6 4

Huan Sun

huansun

http://web.cse.ohio-state.edu/~sun.397/

AI & ML interests

None yet

Recent Activity

upvoted a paper 7 days ago

RedTeamCUA: Realistic Adversarial Testing of Computer-Use Agents in Hybrid Web-OS Environments

upvoted a paper 4 months ago

On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective

authored a paper 8 months ago

AutoDAN-Turbo: A Lifelong Agent for Strategy Self-Exploration to Jailbreak LLMs

View all activity

Organizations

huansun's activity

upvoted a paper 7 days ago

RedTeamCUA: Realistic Adversarial Testing of Computer-Use Agents in Hybrid Web-OS Environments

Paper • 2505.21936 • Published 14 days ago • 1

upvoted a paper 4 months ago

On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective

Paper • 2502.14296 • Published Feb 20 • 46

authored a paper 8 months ago

AutoDAN-Turbo: A Lifelong Agent for Strategy Self-Exploration to Jailbreak LLMs

Paper • 2410.05295 • Published Oct 3, 2024 • 12

liked a Space 8 months ago

UGround

📱

authored a paper 9 months ago

MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding Benchmark

Paper • 2409.02813 • Published Sep 4, 2024 • 32

upvoted a paper about 1 year ago

Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization

Paper • 2405.15071 • Published May 23, 2024 • 42

authored a paper about 1 year ago

Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization

Paper • 2405.15071 • Published May 23, 2024 • 42

authored a paper over 1 year ago

MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI

Paper • 2311.16502 • Published Nov 27, 2023 • 35

upvoted 2 papers over 1 year ago

Mind2Web: Towards a Generalist Agent for the Web

Paper • 2306.06070 • Published Jun 9, 2023 • 19

MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI

Paper • 2311.16502 • Published Nov 27, 2023 • 35

liked a model over 1 year ago

osunlp/TableLlama

Text Generation • Updated Dec 7, 2023 • 1.84k • 29

liked a dataset over 1 year ago

osunlp/TableInstruct

Preview • Updated Mar 22, 2024 • 284 • 27

upvoted a paper over 1 year ago

MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning

Paper • 2309.05653 • Published Sep 11, 2023 • 10

authored a paper almost 2 years ago

AgentBench: Evaluating LLMs as Agents

Paper • 2308.03688 • Published Aug 7, 2023 • 25

liked a dataset almost 2 years ago

osunlp/AttrScore

Viewer • Updated Jun 29, 2023 • 1.8M • 100 • 11

authored a paper almost 2 years ago

Mind2Web: Towards a Generalist Agent for the Web

Paper • 2306.06070 • Published Jun 9, 2023 • 19