Output bug
#22 opened about 22 hours ago
by
DazWilliams
Example Prompts
1
#21 opened about 22 hours ago
by
agat
duplicated bos_token when using apply_chat_template with Tokenizer
1
#20 opened 2 days ago
by
irvingjr
tokenizer.model
#19 opened 5 days ago
by
Lozai
Update README.md
#18 opened 7 days ago
by
tekno-power
<think> tag is missing in the latest revision
2
#17 opened 8 days ago
by
ajsqr
微调DeepSeek-R1打造SQL语言转自然语言视频教程
#16 opened 10 days ago
by
leo009

One more "0" in model-00001-of-000002.safetensors?
#15 opened 10 days ago
by
PPrimo
Excellent models !!! - Plans for Mistral Nemo and/or Gemma 2 Distills ?
#14 opened 14 days ago
by
DavidAU

Adding Evaluation Results
#12 opened 20 days ago
by
Mikhil-jivus
Missing multilanguage capabilities
5
#11 opened 21 days ago
by
h4rz3rk4s3
run in colab t4
#9 opened 24 days ago
by
rakmik
Adding Evaluation Results
#8 opened 25 days ago
by
T145

Add pipeline tag, link to paper
#7 opened 27 days ago
by
nielsr

Do the distilled models also have 128K context?
1
#4 opened about 1 month ago
by
Troyanovsky
How was this quantized?
1
#3 opened about 1 month ago
by
imq
missing special_tokens_map.json file
#2 opened about 1 month ago
by
vince62s
