No base model
#2
by
ricardo-rei
- opened
Is there going to be a release for the base model?
+1
+1
+1, especially considering the quote mentioned at the beginning of the README:
Qwen3-Next-80B-A3B-Base outperforms Qwen3-32B-Base on downstream tasks with 10% of the total training cost and with 10 times inference throughput for context over 32K tokens.
then I check the collection to see Qwen3-Next-80B-A3B-Base
, and.. nothing π
+1
+1
+1 pls release the base model