Future Plans for Multi-Token Prediction Support?

#4
by NaiveYan - opened

As referenced in this discussion, multi-token prediction (MTP) may enhance DeepSeek-R1's efficiency. The current README lacks parameters for speculative decoding (Separately, Reasoning Outputs and Tool Calling parameters are also not included). Could it be confirmed whether MTP support is planned?

Sign up or log in to comment