dranger003
/

c4ai-command-r-plus-iMat.GGUF

Text Generation

Model card Files Files and versions

Resources

View closed (8)

add AIBOM

#20 opened about 1 month ago by

How about a quantized version that fits in 16 GB of memory like wizardlm?

#19 opened over 1 year ago by

Will you redo quants after your bpe pr gets merged?

#18 opened over 1 year ago by

I'm generating a imatrix using `groups_merged.txt` if you want me to run any tests?

#15 opened over 1 year ago by

Can we get a Q4 without the IMat?

#14 opened over 1 year ago by

fail on 104b-iq2_xxs.gguf with llama.cpp

#12 opened over 1 year ago by

Invalid split files?

#11 opened over 1 year ago by

Unable to load in ollama built from PR branch

#10 opened over 1 year ago by

Is IQ1_S broken? If so why list it here?

#9 opened over 1 year ago by

Fast work by the people on the llama.cpp team

#8 opened over 1 year ago by

For a context of at least 32K tokens which version on a 2x16GB Gpu Config?

#3 opened over 1 year ago by

What does iMat mean?

#2 opened over 1 year ago by