Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
7
8
Tianjian Li
dogtooth
Follow
Fishtiks's profile picture
jackzhang's profile picture
2 followers
·
6 following
https://tianjianl.github.io
truthbutcher
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
25 days ago
J1: Incentivizing Thinking in LLM-as-a-Judge via Reinforcement Learning
authored
a paper
28 days ago
SIMPLEMIX: Frustratingly Simple Mixing of Off- and On-policy Data in Language Model Preference Learning
upvoted
a
paper
about 1 month ago
SIMPLEMIX: Frustratingly Simple Mixing of Off- and On-policy Data in Language Model Preference Learning
View all activity
Organizations
Papers
2
arxiv:
2505.02363
arxiv:
2310.00840
models
0
None public yet
datasets
218
Sort: Recently updated
dogtooth/helpsteer2_binarized_filtered
Viewer
•
Updated
Apr 5
•
2.51k
•
25
dogtooth/Big-Math-RL-Verified
Viewer
•
Updated
Apr 3
•
1.52M
•
14
dogtooth/default_project_dev_test
Viewer
•
Updated
Mar 26
•
4k
•
33
dogtooth/Big-Math-Selected-500
Viewer
•
Updated
Mar 25
•
3.5k
•
9
dogtooth/Big-Math-RL-Verified-Chinese
Viewer
•
Updated
Mar 6
•
251k
•
29
dogtooth/mmlu
Viewer
•
Updated
Mar 5
•
14.2k
•
69
dogtooth/boolq
Viewer
•
Updated
Mar 5
•
3.27k
•
30
dogtooth/gpqa
Viewer
•
Updated
Mar 5
•
448
•
39
dogtooth/math_qa
Viewer
•
Updated
Mar 5
•
2.99k
•
32
dogtooth/wiqa
Viewer
•
Updated
Mar 5
•
3k
•
21
Expand 218 datasets