THUDM
/

GLM-4-32B-0414

Text Generation

Model card Files Files and versions Community

Resources

View closed (2)

please release AWQ version

#17 opened 4 days ago by

A quick test using M1 Max (64G) and Word

#16 opened 11 days ago by

Awesome model! Can we get a version with a larger context window?

#15 opened 12 days ago by

It supports Serbo-Croatian language very well!

#13 opened 15 days ago by

GPTQ or AWQ Quants

#12 opened 15 days ago by

Great job, thanks for this model.

#11 opened 16 days ago by

recommended sampling parameters?

#10 opened 18 days ago by

Can we have some more popular benchmarks

#8 opened 19 days ago by

The model is the best for coding.

#7 opened 22 days ago by

When running with a single GPU, I get an error saying the VRAM is insufficient. However, when using multiple GPUs on a single machine, there are many errors. My vllm version is 0.8.4.

#6 opened 22 days ago by

BitsAndBytes quantization inference error

#5 opened 22 days ago by

Some bug when using function call with vllm==0.8.4

#4 opened 23 days ago by

SimpleQA Scores Are WAY off

#3 opened 24 days ago by

Need fp8 version for inerface

#2 opened 25 days ago by

RuntimeError: CUDA error: device-side assert triggered

#1 opened 25 days ago by