cognitivecomputations
/

DeepSeek-V3-0324-AWQ

Text Generation

text-generation-inference

4-bit precision

Model card Files Files and versions

Resources

View closed (5)

NO function_call

#11 opened 9 days ago by

could you give me a reason why you ignore kv_a_proj_with_mqa layer when quantizing this model?

#10 opened about 1 month ago by

Frequent interruptions during reasoning with vllm 0.8.1

#9 opened 2 months ago by

Stuck when run on 8xH100

#8 opened 2 months ago by

Accuracy test

#1 opened 3 months ago by