Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Ai2
Team
non-profit
Verified
https://allenai.org/
allen_ai
allenai
Activity Feed
Follow
4,084
AI & ML interests
Building breatkthrough AI to solve the world's biggest problems.
Recent Activity
valpy
authored
a paper
2 months ago
2 OLMo 2 Furious
valpy
authored
a paper
2 months ago
IssueBench: Millions of Realistic Prompts for Measuring Issue Bias in LLM Writing Assistance
valpy
authored
a paper
2 months ago
RewardBench 2: Advancing Reward Model Evaluation
View all activity
Articles
Introducing the Open Chain of Thought Leaderboard
Apr 23, 2024
•
35
Team members
189
+155
+142
+121
+111
+91
allenai
's datasets
284
Sort: Recently updated
allenai/PRISM
Viewer
•
Updated
Jun 7
•
412k
•
235
•
5
allenai/SimpleToM-rich
Viewer
•
Updated
Jun 7
•
4.59k
•
65
•
1
allenai/reward-bench-2
Viewer
•
Updated
Jun 4
•
1.87k
•
13.6k
•
24
allenai/sciriff-yesno
Viewer
•
Updated
Jun 3
•
2.24k
•
643
allenai/blog-images
Viewer
•
Updated
Jun 2
•
2
•
27.7k
allenai/WildChat-4M-Full
Updated
May 30
•
6
allenai/WildChat-4M
Updated
May 30
•
7
•
2
allenai/qasper-yesno
Viewer
•
Updated
May 29
•
649
•
526
allenai/olmOCR-pes2o-0225
Viewer
•
Updated
May 16
•
7.87M
•
106
•
4
allenai/discoverybench
Viewer
•
Updated
May 10
•
264
•
281
•
12
allenai/reward-bench-results
Updated
May 7
•
2.97k
•
3
allenai/DataDecide-data-recipes
Updated
May 6
•
1k
•
8
allenai/olmo-2-0425-1b-preference-mix
Viewer
•
Updated
Apr 30
•
378k
•
49
•
4
allenai/DataDecide-eval-results
Viewer
•
Updated
Apr 16
•
1.41M
•
236
•
4
allenai/sqa_reranking_eval
Viewer
•
Updated
Apr 15
•
2.43k
•
25
•
2
allenai/tulu-3-do-anything-now-eval
Viewer
•
Updated
Apr 11
•
300
•
28
•
1
allenai/tulu-3-harmbench-eval
Viewer
•
Updated
Apr 11
•
320
•
28
allenai/tulu-3-trustllm-jailbreaktrigger-eval
Viewer
•
Updated
Apr 11
•
400
•
22
allenai/super
Viewer
•
Updated
Mar 21
•
801
•
255
•
4
allenai/lmarena-100k-long-sample-prompts
Viewer
•
Updated
Mar 19
•
386
•
11
•
1
allenai/tulu-3-sft-olmo-2-mixture-0225
Viewer
•
Updated
Mar 14
•
866k
•
1.16k
•
14
allenai/multipref
Viewer
•
Updated
Mar 12
•
31.4k
•
8.09k
•
22
allenai/DataDecide-eval-instances
Viewer
•
Updated
Mar 10
•
1.17k
•
138
•
2
allenai/chatbot-area-preference-dissection
Viewer
•
Updated
Mar 10
•
5.24k
•
14
•
3
allenai/WildBench
Viewer
•
Updated
Mar 4
•
2.3k
•
1.39k
•
37
allenai/CoSyn-point
Viewer
•
Updated
Feb 28
•
69.1k
•
320
•
11
allenai/CoSyn-400K
Viewer
•
Updated
Feb 28
•
408k
•
2.3k
•
40
allenai/olmOCR-mix-0225
Viewer
•
Updated
Feb 25
•
259k
•
1.86k
•
162
allenai/pixmo-docs
Viewer
•
Updated
Feb 24
•
255k
•
2.13k
•
33
allenai/pixmo-docs-0223
Updated
Feb 23
•
6
Previous
1
2
3
4
5
...
10
Next