YAML Metadata
Warning:
empty or missing yaml metadata in repo card
(https://huggingface.co/docs/hub/model-cards#model-card-metadata)
Tasks | Version | Filter | n-shot | Metric | Value | Stderr | |
---|---|---|---|---|---|---|---|
arc_challenge | 1 | none | 25 | acc | 0.1809 | ± | 0.0112 |
none | 25 | acc_norm | 0.2201 | ± | 0.0121 | ||
truthfulqa_mc2 | 2 | none | 0 | acc | 0.4543 | ± | 0.0154 |
winogrande | 1 | none | 5 | acc | 0.5154 | ± | 0.014 |
hellaswag | 1 | none | 10 | acc | 0.2822 | ± | 0.0045 |
none | 10 | acc_norm | 0.3009 | ± | 0.0046 |
0.26024912280701756
Tasks | Version | Filter | n-shot | Metric | Value | Stderr | |
---|---|---|---|---|---|---|---|
abstract_algebra | 0 | none | 5 | acc | 0.3100 | ± | 0.0465 |
anatomy | 0 | none | 5 | acc | 0.2667 | ± | 0.0382 |
astronomy | 0 | none | 5 | acc | 0.1776 | ± | 0.0311 |
business_ethics | 0 | none | 5 | acc | 0.2200 | ± | 0.0416 |
clinical_knowledge | 0 | none | 5 | acc | 0.2528 | ± | 0.0267 |
college_biology | 0 | none | 5 | acc | 0.2153 | ± | 0.0344 |
college_chemistry | 0 | none | 5 | acc | 0.2300 | ± | 0.0423 |
college_computer_science | 0 | none | 5 | acc | 0.3400 | ± | 0.0476 |
college_mathematics | 0 | none | 5 | acc | 0.3200 | ± | 0.0469 |
college_medicine | 0 | none | 5 | acc | 0.2370 | ± | 0.0324 |
college_physics | 0 | none | 5 | acc | 0.1961 | ± | 0.0395 |
computer_security | 0 | none | 5 | acc | 0.2700 | ± | 0.0446 |
conceptual_physics | 0 | none | 5 | acc | 0.2383 | ± | 0.0279 |
econometrics | 0 | none | 5 | acc | 0.2982 | ± | 0.0430 |
electrical_engineering | 0 | none | 5 | acc | 0.2552 | ± | 0.0363 |
elementary_mathematics | 0 | none | 5 | acc | 0.2513 | ± | 0.0223 |
formal_logic | 0 | none | 5 | acc | 0.1667 | ± | 0.0333 |
global_facts | 0 | none | 5 | acc | 0.1600 | ± | 0.0368 |
high_school_biology | 0 | none | 5 | acc | 0.3000 | ± | 0.0261 |
high_school_chemistry | 0 | none | 5 | acc | 0.2167 | ± | 0.0290 |
high_school_computer_science | 0 | none | 5 | acc | 0.2300 | ± | 0.0423 |
high_school_european_history | 0 | none | 5 | acc | 0.2242 | ± | 0.0326 |
high_school_geography | 0 | none | 5 | acc | 0.3283 | ± | 0.0335 |
high_school_government_and_politics | 0 | none | 5 | acc | 0.3627 | ± | 0.0347 |
high_school_macroeconomics | 0 | none | 5 | acc | 0.3513 | ± | 0.0242 |
high_school_mathematics | 0 | none | 5 | acc | 0.2630 | ± | 0.0268 |
high_school_microeconomics | 0 | none | 5 | acc | 0.3067 | ± | 0.0300 |
high_school_physics | 0 | none | 5 | acc | 0.2583 | ± | 0.0357 |
high_school_psychology | 0 | none | 5 | acc | 0.3174 | ± | 0.0200 |
high_school_statistics | 0 | none | 5 | acc | 0.4722 | ± | 0.0340 |
high_school_us_history | 0 | none | 5 | acc | 0.2353 | ± | 0.0298 |
high_school_world_history | 0 | none | 5 | acc | 0.2616 | ± | 0.0286 |
human_aging | 0 | none | 5 | acc | 0.2108 | ± | 0.0274 |
human_sexuality | 0 | none | 5 | acc | 0.2977 | ± | 0.0401 |
international_law | 0 | none | 5 | acc | 0.2645 | ± | 0.0403 |
jurisprudence | 0 | none | 5 | acc | 0.2130 | ± | 0.0396 |
logical_fallacies | 0 | none | 5 | acc | 0.2331 | ± | 0.0332 |
machine_learning | 0 | none | 5 | acc | 0.2857 | ± | 0.0429 |
management | 0 | none | 5 | acc | 0.1748 | ± | 0.0376 |
marketing | 0 | none | 5 | acc | 0.1838 | ± | 0.0254 |
medical_genetics | 0 | none | 5 | acc | 0.3000 | ± | 0.0461 |
miscellaneous | 0 | none | 5 | acc | 0.2720 | ± | 0.0159 |
moral_disputes | 0 | none | 5 | acc | 0.2457 | ± | 0.0232 |
moral_scenarios | 0 | none | 5 | acc | 0.2391 | ± | 0.0143 |
nutrition | 0 | none | 5 | acc | 0.2255 | ± | 0.0239 |
philosophy | 0 | none | 5 | acc | 0.1961 | ± | 0.0226 |
prehistory | 0 | none | 5 | acc | 0.2284 | ± | 0.0234 |
professional_accounting | 0 | none | 5 | acc | 0.2553 | ± | 0.0260 |
professional_law | 0 | none | 5 | acc | 0.2458 | ± | 0.0110 |
professional_medicine | 0 | none | 5 | acc | 0.4485 | ± | 0.0302 |
professional_psychology | 0 | none | 5 | acc | 0.2516 | ± | 0.0176 |
public_relations | 0 | none | 5 | acc | 0.2727 | ± | 0.0427 |
security_studies | 0 | none | 5 | acc | 0.3551 | ± | 0.0306 |
sociology | 0 | none | 5 | acc | 0.2587 | ± | 0.0310 |
us_foreign_policy | 0 | none | 5 | acc | 0.2100 | ± | 0.0409 |
virology | 0 | none | 5 | acc | 0.2229 | ± | 0.0324 |
world_religions | 0 | none | 5 | acc | 0.2105 | ± | 0.0313 |
- Downloads last month
- 18
Inference API (serverless) does not yet support model repos that contain custom code.