Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Tianjian Li's picture
1 7 8

Tianjian Li

dogtooth
Fishtiks's profile picture jackzhang's profile picture
·
https://tianjianl.github.io
  • truthbutcher

AI & ML interests

None yet

Recent Activity

upvoted a paper 25 days ago
J1: Incentivizing Thinking in LLM-as-a-Judge via Reinforcement Learning
authored a paper 28 days ago
SIMPLEMIX: Frustratingly Simple Mixing of Off- and On-policy Data in Language Model Preference Learning
upvoted a paper about 1 month ago
SIMPLEMIX: Frustratingly Simple Mixing of Off- and On-policy Data in Language Model Preference Learning
View all activity

Organizations

Johns Hopkins University's profile picture

Papers 2

arxiv:2505.02363
arxiv:2310.00840

models 0

None public yet

datasets 218

dogtooth/helpsteer2_binarized_filtered

Viewer • Updated Apr 5 • 2.51k • 25

dogtooth/Big-Math-RL-Verified

Viewer • Updated Apr 3 • 1.52M • 14

dogtooth/default_project_dev_test

Viewer • Updated Mar 26 • 4k • 33

dogtooth/Big-Math-Selected-500

Viewer • Updated Mar 25 • 3.5k • 9

dogtooth/Big-Math-RL-Verified-Chinese

Viewer • Updated Mar 6 • 251k • 29

dogtooth/mmlu

Viewer • Updated Mar 5 • 14.2k • 69

dogtooth/boolq

Viewer • Updated Mar 5 • 3.27k • 30

dogtooth/gpqa

Viewer • Updated Mar 5 • 448 • 39

dogtooth/math_qa

Viewer • Updated Mar 5 • 2.99k • 32

dogtooth/wiqa

Viewer • Updated Mar 5 • 3k • 21
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs