Future Plans for Multi-Token Prediction Support?
#4
by
NaiveYan
- opened
As referenced in this discussion, multi-token prediction (MTP) may enhance DeepSeek-R1's efficiency. The current README lacks parameters for speculative decoding (Separately, Reasoning Outputs and Tool Calling parameters are also not included). Could it be confirmed whether MTP support is planned?