how to convert this model in gguf format.

#2
by vicky4s4s - opened

This model possible to create in llama.cpp??

DevQuasar org

Not sure I understand the question.
These quants are created with llama.cpp

This cohere original llm model possible to convert gguf format.

Which package used to convert this format?.

Beacause, some cohere architecture not support in llama. Cpp.

DevQuasar org

This is created with llama.cpp
This arch is supported in llama.cpp:
https://github.com/ggml-org/llama.cpp/blob/master/convert_hf_to_gguf.py#L3923

Thank you so much.

Hi, You quantization in CohereLabs/c4ai-command-r7b-12-2024 this model right?.

ValueError Traceback (most recent call last)
in <cell line: 0>()
2 model_path = "vicky4s4s/cohere_7b_gguf"
3
----> 4 model = Llama(model_path=model_path, n_gpu_layers=100)

/usr/local/lib/python3.11/dist-packages/llama_cpp/llama.py in init(self, model_path, n_gpu_layers, split_mode, main_gpu, tensor_split, rpc_servers, vocab_only, use_mmap, use_mlock, kv_overrides, seed, n_ctx, n_batch, n_ubatch, n_threads, n_threads_batch, rope_scaling_type, pooling_type, rope_freq_base, rope_freq_scale, yarn_ext_factor, yarn_attn_factor, yarn_beta_fast, yarn_beta_slow, yarn_orig_ctx, logits_all, embedding, offload_kqv, flash_attn, no_perf, last_n_tokens_size, lora_base, lora_scale, lora_path, numa, chat_format, chat_handler, draft_model, tokenizer, type_k, type_v, spm_infill, verbose, **kwargs)
366
367 if not os.path.exists(model_path):
--> 368 raise ValueError(f"Model path does not exist: {model_path}")
369
370 self._model = self._stack.enter_context(

ValueError: Model path does not exist: vicky4s4s/cohere_7b_gguf

Cohere models llama.cpp used to convert after, model load time this error will be raise. how to fix this one. please let me know any idea or give the complete instructions how to convert this models proper way in step by step.

DevQuasar org

Sorry I'm not sure I understand what do you want to achieve.
Here's the provided gguf q2 quant inference proof

Screenshot 2025-04-20 at 10.25.54 AM.png

Your need to confirm your account before you can post a new comment.

Sign up or log in to comment