rozek
/

LLaMA-2-7B-32K-Instruct_GGUF

Text Generation

text-generation-inference

togethercomputer

Model card Files Files and versions Community

Resources

View closed (0)

Error when trying to ask a question - ggml_allocr_alloc: not enough space in the buffer (needed 178227200, largest block available 19333120)

#3 opened over 1 year ago by

Get a Segementation fault when loading the model

#2 opened over 1 year ago by

Quantizations for llama.cpp

#1 opened over 1 year ago by