No base model

#2
by ricardo-rei - opened

Is there going to be a release for the base model?

+1, especially considering the quote mentioned at the beginning of the README:

Qwen3-Next-80B-A3B-Base outperforms Qwen3-32B-Base on downstream tasks with 10% of the total training cost and with 10 times inference throughput for context over 32K tokens.

then I check the collection to see Qwen3-Next-80B-A3B-Base, and.. nothing πŸ˜”

+1 pls release the base model

Sign up or log in to comment