ReasoningParser

#3
by victors2709 - opened

Hello.
I tried to start with

 vllm/vllm-openai:v0.10.1.1 \
 --model LLM360/K2-Think \
 --tensor-parallel-size 2\
 --disable-log-requests \
 --reasoning-parser qwen3 \
 --enable-auto-tool-choice --tool-call-parser hermes \
 --enable-prompt-tokens-details \
 --port 8002
(EngineCore_0 pid=270) ERROR 09-10 07:52:08 [core.py:700] EngineCore failed to start.
(EngineCore_0 pid=270) ERROR 09-10 07:52:08 [core.py:700] Traceback (most recent call last):
(EngineCore_0 pid=270) ERROR 09-10 07:52:08 [core.py:700]   File "/usr/local/lib/python3.12/dist-packages/vllm/v1/engine/core.py", line 691, in run_engine_core
(EngineCore_0 pid=270) ERROR 09-10 07:52:08 [core.py:700]     engine_core = EngineCoreProc(*args, **kwargs)
(EngineCore_0 pid=270) ERROR 09-10 07:52:08 [core.py:700]                   ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
(EngineCore_0 pid=270) ERROR 09-10 07:52:08 [core.py:700]   File "/usr/local/lib/python3.12/dist-packages/vllm/v1/engine/core.py", line 492, in __init__
(EngineCore_0 pid=270) ERROR 09-10 07:52:08 [core.py:700]     super().__init__(vllm_config, executor_class, log_stats,
(EngineCore_0 pid=270) ERROR 09-10 07:52:08 [core.py:700]   File "/usr/local/lib/python3.12/dist-packages/vllm/v1/engine/core.py", line 96, in __init__
(EngineCore_0 pid=270) ERROR 09-10 07:52:08 [core.py:700]     self.structured_output_manager = StructuredOutputManager(vllm_config)
(EngineCore_0 pid=270) ERROR 09-10 07:52:08 [core.py:700]                                      ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
(EngineCore_0 pid=270) ERROR 09-10 07:52:08 [core.py:700]   File "/usr/local/lib/python3.12/dist-packages/vllm/v1/structured_output/__init__.py", line 72, in __init__
(EngineCore_0 pid=270) ERROR 09-10 07:52:08 [core.py:700]     self.reasoner = reasoner_cls(tokenizer=self.tokenizer)
(EngineCore_0 pid=270) ERROR 09-10 07:52:08 [core.py:700]                     ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
(EngineCore_0 pid=270) ERROR 09-10 07:52:08 [core.py:700]   File "/usr/local/lib/python3.12/dist-packages/vllm/reasoning/qwen3_reasoning_parser.py", line 43, in __init__
(EngineCore_0 pid=270) ERROR 09-10 07:52:08 [core.py:700]     raise RuntimeError(
(EngineCore_0 pid=270) ERROR 09-10 07:52:08 [core.py:700] RuntimeError: Qwen3 reasoning parser could not locate think start/end tokens in the tokenizer!
LLM360 org

The base model for K2-Think is Qwen2.5-32B. <think> and </think> do not appear as special tokens as they do in Qwen3.

aaryamonvikram changed discussion status to closed

Sign up or log in to comment