Ctrl+K

nielsr HF Staff

Add pipeline tag, library name (#1)

a045060 verified 3 days ago

.gitattributes

1.57 kB

Upload model files for Guided by Gut paper (DeepSeek-R1-Distill-Qwen-1.5B-LIMO_ConfReward3_last10step) 13 days ago
README.md

2.42 kB

Add pipeline tag, library name (#1) 3 days ago
config.json

867 Bytes

Upload model files for Guided by Gut paper (DeepSeek-R1-Distill-Qwen-1.5B-LIMO_ConfReward3_last10step) 13 days ago
generation_config.json

186 Bytes

Upload model files for Guided by Gut paper (DeepSeek-R1-Distill-Qwen-1.5B-LIMO_ConfReward3_last10step) 13 days ago
model-00001-of-00002.safetensors

5 GB
xet

Upload model files for Guided by Gut paper (DeepSeek-R1-Distill-Qwen-1.5B-LIMO_ConfReward3_last10step) 13 days ago
model-00002-of-00002.safetensors

2.11 GB
xet

Upload model files for Guided by Gut paper (DeepSeek-R1-Distill-Qwen-1.5B-LIMO_ConfReward3_last10step) 13 days ago
model.safetensors.index.json

27.8 kB

Upload model files for Guided by Gut paper (DeepSeek-R1-Distill-Qwen-1.5B-LIMO_ConfReward3_last10step) 13 days ago
special_tokens_map.json

485 Bytes

Upload model files for Guided by Gut paper (DeepSeek-R1-Distill-Qwen-1.5B-LIMO_ConfReward3_last10step) 13 days ago
tokenizer.json

11.4 MB
xet

Upload model files for Guided by Gut paper (DeepSeek-R1-Distill-Qwen-1.5B-LIMO_ConfReward3_last10step) 13 days ago
tokenizer_config.json

6.77 kB

Upload model files for Guided by Gut paper (DeepSeek-R1-Distill-Qwen-1.5B-LIMO_ConfReward3_last10step) 13 days ago