ReasoningParser
#3
by
victors2709
- opened
Hello.
I tried to start with
vllm/vllm-openai:v0.10.1.1 \
--model LLM360/K2-Think \
--tensor-parallel-size 2\
--disable-log-requests \
--reasoning-parser qwen3 \
--enable-auto-tool-choice --tool-call-parser hermes \
--enable-prompt-tokens-details \
--port 8002
(EngineCore_0 pid=270) ERROR 09-10 07:52:08 [core.py:700] EngineCore failed to start.
(EngineCore_0 pid=270) ERROR 09-10 07:52:08 [core.py:700] Traceback (most recent call last):
(EngineCore_0 pid=270) ERROR 09-10 07:52:08 [core.py:700] File "/usr/local/lib/python3.12/dist-packages/vllm/v1/engine/core.py", line 691, in run_engine_core
(EngineCore_0 pid=270) ERROR 09-10 07:52:08 [core.py:700] engine_core = EngineCoreProc(*args, **kwargs)
(EngineCore_0 pid=270) ERROR 09-10 07:52:08 [core.py:700] ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
(EngineCore_0 pid=270) ERROR 09-10 07:52:08 [core.py:700] File "/usr/local/lib/python3.12/dist-packages/vllm/v1/engine/core.py", line 492, in __init__
(EngineCore_0 pid=270) ERROR 09-10 07:52:08 [core.py:700] super().__init__(vllm_config, executor_class, log_stats,
(EngineCore_0 pid=270) ERROR 09-10 07:52:08 [core.py:700] File "/usr/local/lib/python3.12/dist-packages/vllm/v1/engine/core.py", line 96, in __init__
(EngineCore_0 pid=270) ERROR 09-10 07:52:08 [core.py:700] self.structured_output_manager = StructuredOutputManager(vllm_config)
(EngineCore_0 pid=270) ERROR 09-10 07:52:08 [core.py:700] ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
(EngineCore_0 pid=270) ERROR 09-10 07:52:08 [core.py:700] File "/usr/local/lib/python3.12/dist-packages/vllm/v1/structured_output/__init__.py", line 72, in __init__
(EngineCore_0 pid=270) ERROR 09-10 07:52:08 [core.py:700] self.reasoner = reasoner_cls(tokenizer=self.tokenizer)
(EngineCore_0 pid=270) ERROR 09-10 07:52:08 [core.py:700] ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
(EngineCore_0 pid=270) ERROR 09-10 07:52:08 [core.py:700] File "/usr/local/lib/python3.12/dist-packages/vllm/reasoning/qwen3_reasoning_parser.py", line 43, in __init__
(EngineCore_0 pid=270) ERROR 09-10 07:52:08 [core.py:700] raise RuntimeError(
(EngineCore_0 pid=270) ERROR 09-10 07:52:08 [core.py:700] RuntimeError: Qwen3 reasoning parser could not locate think start/end tokens in the tokenizer!
The base model for K2-Think is Qwen2.5-32B. <think>
and </think>
do not appear as special tokens as they do in Qwen3.
aaryamonvikram
changed discussion status to
closed