Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Qwen
/
Qwen3-4B-FP8
like
22
Follow
Qwen
34.5k
Text Generation
Transformers
Safetensors
qwen3
conversational
text-generation-inference
fp8
arxiv:
2309.00071
arxiv:
2505.09388
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
2
Train
Deploy
Use this model
main
Qwen3-4B-FP8
Commit History
Update README.md
5dad03d
verified
littlebird13
commited on
22 days ago
update tokenizer_config.json
f3ecd40
feihu.hf
commited on
24 days ago
Remove vLLM FP8 Limitation (
#2
)
d968998
verified
jklj077
simon-mo
commited on
Apr 30
Update README.md
0dcffbe
verified
yangapku
commited on
Apr 29
Update README.md
884ae87
verified
yangapku
commited on
Apr 29
Update README.md
bcd75a3
verified
yangapku
commited on
Apr 28
Update README.md
35fec96
verified
littlebird13
commited on
Apr 28
Update README.md
1ef33a9
verified
jklj077
commited on
Apr 28
Delete special_tokens_map.json
97f8501
verified
littlebird13
commited on
Apr 28
Delete added_tokens.json
be2fe05
verified
littlebird13
commited on
Apr 28
Update README.md
c1919f6
verified
littlebird13
commited on
Apr 28
Update generation_config.json
e66e5a4
verified
littlebird13
commited on
Apr 28
Update README.md
1d3f2ab
verified
littlebird13
commited on
Apr 28
Upload folder using huggingface_hub
ae9c71f
verified
littlebird13
commited on
Apr 28
initial commit
3fdd654
verified
littlebird13
commited on
Apr 28