Collection of data from public openreview reviews
Sumuk Shashidhar PRO
sumuks
AI & ML interests
Evaluations, Reasoning, Long Term Planning
Recent Activity
updated
a dataset
29 minutes ago
sumuks/yourbench_mmlu_astronomy_reporduction
published
a dataset
about 7 hours ago
sumuks/yourbench_mmlu_astronomy_reporduction
updated
a dataset
about 7 hours ago
sumuks/yourbench_mmlu_reporduction
Organizations
Collections
4
Papers
2
spaces
1
models
15

sumuks/purple-wintermute-0.2-7b
Updated
•
80

sumuks/purple-wintermute-0.2-72b
Updated
•
4

sumuks/purple-wintermute-0.1-7b
Updated
•
13

sumuks/qwen2.5-7b-utility-evaluator-r128
Updated

sumuks/qwen2.5-72b-utility-evaluator-r128
Updated

sumuks/qwen2.5-72b-mvp-1-openreviewer
Updated
•
2

sumuks/qwen2.5-7b-utility-evaluator-r256-test
Updated

sumuks/qwen2.5-72b-openreviewer-mvp-1-full-review-r128
Updated
•
13

sumuks/qwen2.5-7b-idea-review-mvp-1
Updated
•
11

sumuks/full_review
Updated
•
6
datasets
118
sumuks/yourbench_mmlu_astronomy_reporduction
Viewer
•
Updated
•
509
sumuks/yourbench_mmlu_reporduction
Viewer
•
Updated
•
43
sumuks/tempora_yourbench_traces
Viewer
•
Updated
•
603k
•
318
sumuks/yourbench-wizard-example
Viewer
•
Updated
•
4
•
28
sumuks/yourbench-example-v4
Viewer
•
Updated
•
76
•
22
sumuks/yourbench-example-v3
Viewer
•
Updated
•
74
•
25
sumuks/yourbench-example-test
Viewer
•
Updated
•
76
•
30
sumuks/tempora_yourbench_traces_lighteval
Viewer
•
Updated
•
151k
•
28
sumuks/yourbench-example
Viewer
•
Updated
•
36
•
29
sumuks/yb_tempora_experiments
Viewer
•
Updated
•
53.1k
•
93