Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
testcase-eval
non-profit
Activity Feed
Follow
2
AI & ML interests
None defined yet.
Recent Activity
yilunzhao
authored
a paper
3 days ago
SciVer: Evaluating Foundation Models for Multimodal Scientific Claim Verification
yilunzhao
authored
a paper
4 days ago
Can LLMs Generate High-Quality Test Cases for Algorithm Problems? TestCase-Eval: A Systematic Evaluation of Fault Coverage and Exposure
yilunzhao
authored
a paper
4 days ago
MultiFinBen: A Multilingual, Multimodal, and Difficulty-Aware Benchmark for Financial LLM Evaluation
View all activity
Team members
2
testcase-evaluate
's models
None public yet