jasperyeoh2
/

mistral-dpo-peft

preference-optimization

instruction-tuning

Model card Files Files and versions Community

jasperyeoh2 commited on Apr 25

Commit

cae8908

·

verified ·

1 Parent(s): d6e35e1

Create README.md

Files changed (1) hide show

README.md +10 -0

README.md ADDED Viewed

	@@ -0,0 +1,10 @@

+---
+datasets:
+- jasperyeoh2/pairrm-preference-dataset
+- GAIR/lima
+base_model:
+- mistralai/Mistral-7B-Instruct-v0.2
+tags:
+- PEFT
+- DPO
+---