Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
princeton-nlp
's Collections
SimPO
SWE-bench
ProLong
Sheared Llama
SimCSE
SimPO
updated
Mar 16
This collections contains a list of SimPO and baseline models.
Upvote
20
+10
princeton-nlp/gemma-2-9b-it-SimPO
Text Generation
•
Updated
Aug 2, 2024
•
147k
•
164
princeton-nlp/gemma-2-9b-it-DPO
Text Generation
•
Updated
Jul 18, 2024
•
987
•
9
princeton-nlp/Llama-3-Base-8B-SFT-IPO
Text Generation
•
Updated
Jun 17, 2024
•
916
•
1
princeton-nlp/Llama-3-Base-8B-SFT-DPO
Text Generation
•
Updated
Jun 17, 2024
•
1.03k
princeton-nlp/Llama-3-Base-8B-SFT-KTO
Text Generation
•
Updated
Jun 17, 2024
•
920
princeton-nlp/Llama-3-Base-8B-SFT-ORPO
Text Generation
•
Updated
Jun 17, 2024
•
881
princeton-nlp/Llama-3-Base-8B-SFT-RDPO
Text Generation
•
Updated
Jun 17, 2024
•
894
princeton-nlp/Llama-3-Base-8B-SFT-SimPO
Text Generation
•
Updated
May 24, 2024
•
1.04k
•
1
princeton-nlp/Llama-3-Base-8B-SFT
Text Generation
•
Updated
Jun 17, 2024
•
11.9k
•
4
princeton-nlp/Llama-3-Instruct-8B-SimPO
Text Generation
•
Updated
Jun 17, 2024
•
1.06k
•
•
58
princeton-nlp/Llama-3-Instruct-8B-IPO
Text Generation
•
Updated
Jun 17, 2024
•
871
princeton-nlp/Llama-3-Instruct-8B-KTO
Text Generation
•
Updated
Jun 17, 2024
•
868
princeton-nlp/Llama-3-Instruct-8B-ORPO
Text Generation
•
Updated
Jun 17, 2024
•
846
princeton-nlp/Llama-3-Instruct-8B-RDPO
Text Generation
•
Updated
Jun 17, 2024
•
843
princeton-nlp/Llama-3-Instruct-8B-DPO
Text Generation
•
Updated
Jun 17, 2024
•
874
princeton-nlp/Mistral-7B-Instruct-RDPO
Text Generation
•
Updated
Jun 17, 2024
•
854
princeton-nlp/Mistral-7B-Instruct-DPO
Text Generation
•
Updated
Jun 17, 2024
•
858
princeton-nlp/Mistral-7B-Instruct-IPO
Text Generation
•
Updated
Jun 17, 2024
•
866
princeton-nlp/Mistral-7B-Instruct-KTO
Text Generation
•
Updated
Jun 17, 2024
•
870
princeton-nlp/Mistral-7B-Instruct-SimPO
Text Generation
•
Updated
Jun 17, 2024
•
877
•
2
princeton-nlp/Mistral-7B-Instruct-ORPO
Text Generation
•
Updated
Jun 17, 2024
•
845
princeton-nlp/Mistral-7B-Base-SFT-IPO
Text Generation
•
Updated
Jun 17, 2024
•
883
princeton-nlp/Mistral-7B-Base-SFT-KTO
Text Generation
•
Updated
Jun 17, 2024
•
908
princeton-nlp/Mistral-7B-Base-SFT-DPO
Text Generation
•
Updated
Jun 17, 2024
•
913
princeton-nlp/Mistral-7B-Base-SFT-RDPO
Text Generation
•
Updated
Jun 17, 2024
•
876
princeton-nlp/Mistral-7B-Base-SFT-SimPO
Text Generation
•
Updated
Jun 17, 2024
•
970
princeton-nlp/llama3-ultrafeedback
Viewer
•
Updated
Jul 18, 2024
•
61.8k
•
825
•
18
princeton-nlp/Mistral-7B-Base-SFT-CPO
Text Generation
•
Updated
Sep 30, 2024
•
873
•
1
princeton-nlp/Mistral-7B-Base-SFT-RRHF
Text Generation
•
Updated
Sep 30, 2024
•
851
princeton-nlp/Mistral-7B-Base-SFT-SLiC-HF
Text Generation
•
Updated
Jul 7, 2024
•
870
princeton-nlp/Mistral-7B-Instruct-CPO
Text Generation
•
Updated
Jul 7, 2024
•
860
princeton-nlp/Mistral-7B-Instruct-RRHF
Text Generation
•
Updated
Jul 7, 2024
•
850
princeton-nlp/Mistral-7B-Instruct-SLiC-HF
Text Generation
•
Updated
Jul 7, 2024
•
850
princeton-nlp/Llama-3-Base-8B-SFT-CPO
Text Generation
•
Updated
Jul 7, 2024
•
876
princeton-nlp/Llama-3-Base-8B-SFT-RRHF
Text Generation
•
Updated
Jul 7, 2024
•
836
princeton-nlp/Llama-3-Base-8B-SFT-SLiC-HF
Text Generation
•
Updated
Jul 7, 2024
•
879
princeton-nlp/Llama-3-Instruct-8B-CPO
Text Generation
•
Updated
Jul 7, 2024
•
843
princeton-nlp/Llama-3-Instruct-8B-RRHF
Text Generation
•
Updated
Jul 7, 2024
•
847
princeton-nlp/Llama-3-Instruct-8B-SLiC-HF
Text Generation
•
Updated
Jul 7, 2024
•
853
princeton-nlp/Llama-3-Instruct-8B-RRHF-v0.2
Text Generation
•
Updated
Jul 7, 2024
•
13
princeton-nlp/Llama-3-Instruct-8B-SLiC-HF-v0.2
Text Generation
•
Updated
Jul 7, 2024
•
838
princeton-nlp/Llama-3-Instruct-8B-DPO-v0.2
Text Generation
•
Updated
Jul 7, 2024
•
857
princeton-nlp/Llama-3-Instruct-8B-IPO-v0.2
Text Generation
•
Updated
Jul 7, 2024
•
852
princeton-nlp/Llama-3-Instruct-8B-CPO-v0.2
Text Generation
•
Updated
Jul 7, 2024
•
840
princeton-nlp/Llama-3-Instruct-8B-KTO-v0.2
Text Generation
•
Updated
Jul 7, 2024
•
844
princeton-nlp/Llama-3-Instruct-8B-ORPO-v0.2
Text Generation
•
Updated
Jul 7, 2024
•
832
princeton-nlp/Llama-3-Instruct-8B-RDPO-v0.2
Text Generation
•
Updated
Jul 7, 2024
•
900
princeton-nlp/Llama-3-Instruct-8B-SimPO-v0.2
Text Generation
•
Updated
Jul 7, 2024
•
954
•
•
6
princeton-nlp/llama3-ultrafeedback-armorm
Viewer
•
Updated
Jul 18, 2024
•
61.8k
•
541
•
17
Upvote
20
+16
Share collection
View history
Collection guide
Browse collections