Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Qwen
/
Qwen3-4B-FP8
like
22
Follow
Qwen
33.6k
Text Generation
Transformers
Safetensors
qwen3
conversational
text-generation-inference
fp8
arxiv:
2309.00071
arxiv:
2505.09388
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
2
Train
Deploy
Use this model
Remove vLLM FP8 Limitation
#2
by
simon-mo
- opened
Apr 29
base:
refs/heads/main
β
from:
refs/pr/2
Discussion
Files changed
+0
-23
simon-mo
Qwen org
Apr 29
This has been fixed as of latest v0.8.5 release π
See translation
Remove vLLM FP8 Limitation
5bb0cb13
jklj077
changed pull request status to
merged
Apr 30
Edit
Preview
Upload images, audio, and videos by dragging in the text input, pasting, or
clicking here
.
Tap or paste here to upload images
Comment
Β·
Sign up
or
log in
to comment