Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
ceval
community
https://cevalbenchmark.com
Activity Feed
Request to join this org
Follow
19
AI & ML interests
We focus on Chinese evaluation of foundation models.
Recent Activity
yuzhen17
authored
a paper
8 days ago
Pitfalls of Rule- and Model-based Verifiers -- A Case Study on Mathematical Reasoning
jxhe
authored
a paper
9 days ago
SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond
yuzhen17
authored
a paper
9 days ago
Learn to Reason Efficiently with Adaptive Length-based Reward Shaping
View all activity
Team members
2
models
0
None public yet
datasets
1
ceval/ceval-exam
Viewer
•
Updated
Mar 25
•
13.9k
•
14.4k
•
264