PowerInfer
/

SmallThinker-4BA0.6B-Instruct

Text Generation

feature-extraction

Model card Files Files and versions Community

yixinsong commited on 16 days ago

Commit

b51db6d

·

verified ·

1 Parent(s): 8ca0b32

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -36,6 +36,8 @@ For the MMLU evaluation, we use a 0-shot CoT setting.
 Note：i9 14900、1+13 8ge4 use 4 threads，others use the number of threads that can achieve the maximum speed. All models here have been quantized to q4_0.
 ## Model Card
 <div align="center">

 Note：i9 14900、1+13 8ge4 use 4 threads，others use the number of threads that can achieve the maximum speed. All models here have been quantized to q4_0.
+You can deploy SmallThinker with offloading support using [PowerInfer](https://github.com/SJTU-IPADS/PowerInfer/tree/main/smallthinker)
 ## Model Card
 <div align="center">