Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

xbench

community
https://xbench.org/
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

Lucky2022  authored a paper 6 days ago
xbench: Tracking Agents Productivity Scaling with Profession-Aligned Real-World Evaluations
lyangpku  updated a dataset 7 days ago
xbench/ScienceQA
lyangpku  updated a dataset 7 days ago
xbench/DeepSearch
View all activity

x's profile picture Dookie's profile picture Kaiyuan Chen's profile picture Xiaobo Hu's profile picture

Lucky2022 
authored a paper 6 days ago

xbench: Tracking Agents Productivity Scaling with Profession-Aligned Real-World Evaluations

Paper • 2506.13651 • Published 9 days ago • 9
lyangpku 
updated 2 datasets 7 days ago

xbench/ScienceQA

Viewer • Updated 7 days ago • 100 • 228 • 7

xbench/DeepSearch

Viewer • Updated 7 days ago • 100 • 305 • 5
lyangpku 
published 2 datasets 29 days ago

xbench/DeepSearch

Viewer • Updated 7 days ago • 100 • 305 • 5

xbench/ScienceQA

Viewer • Updated 7 days ago • 100 • 228 • 7
Lucky2022 
authored 2 papers 4 months ago

CogDPM: Diffusion Probabilistic Models via Cognitive Predictive Coding

Paper • 2405.02384 • Published May 3, 2024

Unveiling Downstream Performance Scaling of LLMs: A Clustering-Based Perspective

Paper • 2502.17262 • Published Feb 24 • 21
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs