Collection related to the paper, "Training a Generally Curious Agent" (Project page: https://paprika-llm.github.io/)
Fahim Tajwar
ftajwar
AI & ML interests
LLMs, RLHF
Recent Activity
updated
a dataset
1 day ago
self-label-zanette-lab/big_math_full_dataset
published
a dataset
1 day ago
self-label-zanette-lab/big_math_full_dataset
updated
a dataset
7 days ago
self-label-zanette-lab/big_math_filtered_easy
Organizations
models
3
datasets
6
ftajwar/deduplicated_dapo_dataset
Viewer
•
Updated
•
17.4k
•
56
ftajwar/dapo_easy_one_third_sorted_by_frequency_of_majority_answer
Viewer
•
Updated
•
5.8k
•
53
ftajwar/dapo_easy_one_third_sorted_by_pass_rate
Viewer
•
Updated
•
5.8k
•
50
ftajwar/srt_test_dataset
Viewer
•
Updated
•
273
•
62
ftajwar/paprika_SFT_dataset
Viewer
•
Updated
•
17.2k
•
9
•
3
ftajwar/paprika_preference_dataset
Viewer
•
Updated
•
5.26k
•
13
•
1