Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Boxi Yu's picture
2 2

Boxi Yu

Bertsekas
·
https://boxiyu.github.io/
  • BoshCavendish
  • BoxiYu

AI & ML interests

Coding Agent, Automated Operator

Recent Activity

authored a paper 3 days ago
How Should I Build A Benchmark? Revisiting Code-Related Benchmarks For LLMs
authored a paper 3 days ago
UTBoost: Rigorous Evaluation of Coding Agents on SWE-Bench
upvoted a paper 4 days ago
UTBoost: Rigorous Evaluation of Coding Agents on SWE-Bench
View all activity

Organizations

None yet

Bertsekas 's datasets 2

Bertsekas/SWE-Bench_Lite_UTBoost

Viewer • Updated 6 days ago • 300

Bertsekas/SWE-Bench_Verified_UTBoost

Viewer • Updated 6 days ago • 500
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs