Qwen
/

Qwen3-8B

Text Generation

text-generation-inference

Model card Files Files and versions

Resources

View closed (3)

Why is max_position_embeddings set to 40K in config.json but original_max_position_embeddings is 32K in the README? Which one should be used?

#17 opened about 18 hours ago by

License file missing under repository

#16 opened 2 days ago by

Add assistant mask support to Qwen3-8B

#14 opened 7 days ago by

This model is a benchmark for 8B models

#13 opened 13 days ago by

aqqqx

#12 opened 17 days ago by

Multilingual Prompt vs Single Language Prompt

#11 opened 28 days ago by

Error while trying to Deploy this model

#9 opened 29 days ago by

READ

#8 opened about 1 month ago by

New 8B model much slower than old 7B model when running on vLLM.

#6 opened about 1 month ago by

Qwen3-8B supported AWQ quantization

#5 opened about 1 month ago by

Collections of Bad Cases User Reviews and Comments of Qwen3 8B model

#4 opened about 1 month ago by

🚀[Fine-tuning] Implementation and Best Practices for Qwen3 CPT/SFT/DPO/GRPO Training👋

#3 opened about 1 month ago by

【Evaluation】Best practice for evaluating Qwen3 !!

#2 opened about 2 months ago by

Add languages tag

#1 opened about 2 months ago by

de-francophones