Update README.md
Browse files
README.md
CHANGED
@@ -50,7 +50,7 @@ print(output[0].outputs[0].text)
|
|
50 |
## Serving
|
51 |
Then we can serve with the following command:
|
52 |
```
|
53 |
-
vllm serve pytorch/Phi-4-mini-instruct-
|
54 |
```
|
55 |
|
56 |
# Inference with Transformers
|
|
|
50 |
## Serving
|
51 |
Then we can serve with the following command:
|
52 |
```
|
53 |
+
vllm serve pytorch/Phi-4-mini-instruct-float8dq --tokenizer microsoft/Phi-4-mini-instruct -O3
|
54 |
```
|
55 |
|
56 |
# Inference with Transformers
|