Update README.md
Browse files
README.md
CHANGED
@@ -211,12 +211,12 @@ python -m sglang.launch_server --model-path HuggingFaceTB/SmolLM3-3B
|
|
211 |
#### vLLM
|
212 |
|
213 |
```bash
|
214 |
-
vllm serve HuggingFaceTB/SmolLM3-3B
|
215 |
```
|
216 |
|
217 |
#### Setting `chat_template_kwargs`
|
218 |
|
219 |
-
You can specify `chat_template_kwargs` such as `enable_thinking`
|
220 |
|
221 |
```bash
|
222 |
curl http://localhost:8000/v1/chat/completions -H "Content-Type: application/json" -d '{
|
|
|
211 |
#### vLLM
|
212 |
|
213 |
```bash
|
214 |
+
vllm serve HuggingFaceTB/SmolLM3-3B --enable-auto-tool-choice --tool-call-parser=hermes
|
215 |
```
|
216 |
|
217 |
#### Setting `chat_template_kwargs`
|
218 |
|
219 |
+
You can specify `chat_template_kwargs` such as `enable_thinking` to a deployed model by passing the `chat_template_kwargs` parameter in the API request.
|
220 |
|
221 |
```bash
|
222 |
curl http://localhost:8000/v1/chat/completions -H "Content-Type: application/json" -d '{
|