No base model

by ricardo-rei - opened 29 days ago

Discussion

ricardo-rei

29 days ago

Is there going to be a release for the base model?

Downtown-Case

29 days ago

Ernigma

29 days ago

drmcbride

29 days ago

pszemraj

27 days ago

+1, especially considering the quote mentioned at the beginning of the README:

Qwen3-Next-80B-A3B-Base outperforms Qwen3-32B-Base on downstream tasks with 10% of the total training cost and with 10 times inference throughput for context over 32K tokens.

then I check the collection to see Qwen3-Next-80B-A3B-Base, and.. nothing 😔

maybeMayank

23 days ago

guohoujian

23 days ago

Aly87

14 days ago

+1 pls release the base model

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment