The collection for the Project "Simple Reinforcement Learning for Reasoning"
HKUST NLP Group
university
AI & ML interests
None defined yet.
Recent Activity
View all activity
Collections
6
models
35
hkust-nlp/Qwen-2.5-Math-7B-SimpleRL
Updated
•
12
•
1
hkust-nlp/Qwen-2.5-Math-7B-SimpleRL-Zero
Updated
•
26
•
2
hkust-nlp/preselect-fasttext-classifier
Updated
hkust-nlp/qwen2.5-7b-coder_codeio_stage1
Updated
•
16
hkust-nlp/qwen2.5-7b-coder_codeio
Updated
•
21
hkust-nlp/qwen2.5-7b-coder_codeio_pp_stage1
Updated
•
25
hkust-nlp/qwen2.5-7b-coder_codeio_pp
Updated
•
32
•
3
hkust-nlp/llama3.1-8b_codeio_stage1
Updated
•
18
hkust-nlp/llama3.1-8b_codeio
Updated
•
22
hkust-nlp/llama3.1-8b_codeio_pp_stage1
Updated
•
17
datasets
21
hkust-nlp/PreSelect-100B
Viewer
•
Updated
•
54.5M
•
182
•
1
hkust-nlp/CodeIO-PyEdu-Reasoning
Preview
•
Updated
•
433
•
31
hkust-nlp/CodeIO-PyEdu-Reasoning-Raw
Updated
•
60
hkust-nlp/SynCSE-partial-NLI
Viewer
•
Updated
•
263k
•
63
•
2
hkust-nlp/SynCSE-scratch-NLI
Viewer
•
Updated
•
276k
•
87
•
2
hkust-nlp/gsm8k-fix
Viewer
•
Updated
•
7.47k
•
90
•
2
hkust-nlp/dart-math-uniform
Viewer
•
Updated
•
591k
•
105
•
9
hkust-nlp/vrt-baseline
Viewer
•
Updated
•
591k
•
57
•
1
hkust-nlp/dart-math-hard
Viewer
•
Updated
•
585k
•
122
•
13
hkust-nlp/dart-math-pool-gsm8k-query-info
Viewer
•
Updated
•
7.47k
•
60
•
2