The *RLMT* collection. Coming soon!
Princeton NLP group
princeton-nlp
AI & ML interests
None yet
Recent Activity
new activity
11 days ago
HuggingFaceTB/FineMath-Llama-3B:Hyperparameters
updated
a collection
2 months ago
RLMT Experiments
updated
a collection
2 months ago
RLMT Experiments
Organizations
SimPO
This collections contains a list of SimPO and baseline models.
-
princeton-nlp/gemma-2-9b-it-SimPO
Text Generation • 9B • Updated • 1.29k • • 170 -
princeton-nlp/gemma-2-9b-it-DPO
Text Generation • 9B • Updated • 36 • • 9 -
princeton-nlp/Llama-3-Base-8B-SFT-IPO
Text Generation • 8B • Updated • 24 • • 1 -
princeton-nlp/Llama-3-Base-8B-SFT-DPO
Text Generation • 8B • Updated • 151 •
RLMT Experiments
The *RLMT* collection. Coming soon!
SimPO
This collections contains a list of SimPO and baseline models.
-
princeton-nlp/gemma-2-9b-it-SimPO
Text Generation • 9B • Updated • 1.29k • • 170 -
princeton-nlp/gemma-2-9b-it-DPO
Text Generation • 9B • Updated • 36 • • 9 -
princeton-nlp/Llama-3-Base-8B-SFT-IPO
Text Generation • 8B • Updated • 24 • • 1 -
princeton-nlp/Llama-3-Base-8B-SFT-DPO
Text Generation • 8B • Updated • 151 •
models
306
princeton-nlp/warm-start__grpo__nothink__Qwen2.5-7B-Instruct
8B
•
Updated
•
27
princeton-nlp/warm-start__grpo__nothink__Llama-3.1-8B-Instruct
8B
•
Updated
•
7
princeton-nlp/warm-start__grpo__nothink__Qwen2.5-7B
8B
•
Updated
•
10
princeton-nlp/warm-start__grpo__nothink__Llama-3.1-8B
8B
•
Updated
•
5
princeton-nlp/warm-start__grpo__think__Qwen2.5-7B-Instruct
8B
•
Updated
•
8
princeton-nlp/warm-start__grpo__think__Llama-3.1-8B-Instruct
8B
•
Updated
•
7
princeton-nlp/warm-start__grpo__think__Qwen2.5-7B
8B
•
Updated
•
16
princeton-nlp/warm-start__grpo__think__Llama-3.1-8B
8B
•
Updated
•
8
princeton-nlp/zero__grpo__nothink__Qwen2.5-7B
8B
•
Updated
•
8
princeton-nlp/zero__grpo__nothink__Llama-3.1-8B
8B
•
Updated
•
6
datasets
47
princeton-nlp/rl_tulu3_wildchat-if_prompts
Viewer
•
Updated
•
7.79k
•
48
•
3
princeton-nlp/gemini_2.5_flash_0417_sft-data
Viewer
•
Updated
•
6k
•
57
•
1
princeton-nlp/prolong-data-512K
Updated
•
10.1k
•
11
princeton-nlp/SWE-bench_Lite
Viewer
•
Updated
•
323
•
32.4k
•
50
princeton-nlp/SWE-bench
Viewer
•
Updated
•
21.5k
•
20.2k
•
128
princeton-nlp/SWE-bench_Verified
Viewer
•
Updated
•
500
•
596k
•
233
princeton-nlp/TextbooksBySubject
Viewer
•
Updated
•
129
•
65
princeton-nlp/TextbookChapters
Viewer
•
Updated
•
77.9k
•
53
•
10
princeton-nlp/SWE-bench_Multimodal
Viewer
•
Updated
•
612
•
1.15k
•
21
princeton-nlp/fineweb_edu-swahili-translated
Viewer
•
Updated
•
137k
•
50
•
2