DevQuasar/CohereForAI.aya-expanse-8b-GGUF · how to convert this model in gguf format.

ValueError Traceback (most recent call last)
in <cell line: 0>()
2 model_path = "vicky4s4s/cohere_7b_gguf"
3
----> 4 model = Llama(model_path=model_path, n_gpu_layers=100)

/usr/local/lib/python3.11/dist-packages/llama_cpp/llama.py in init(self, model_path, n_gpu_layers, split_mode, main_gpu, tensor_split, rpc_servers, vocab_only, use_mmap, use_mlock, kv_overrides, seed, n_ctx, n_batch, n_ubatch, n_threads, n_threads_batch, rope_scaling_type, pooling_type, rope_freq_base, rope_freq_scale, yarn_ext_factor, yarn_attn_factor, yarn_beta_fast, yarn_beta_slow, yarn_orig_ctx, logits_all, embedding, offload_kqv, flash_attn, no_perf, last_n_tokens_size, lora_base, lora_scale, lora_path, numa, chat_format, chat_handler, draft_model, tokenizer, type_k, type_v, spm_infill, verbose, **kwargs)
366
367 if not os.path.exists(model_path):
--> 368 raise ValueError(f"Model path does not exist: {model_path}")
369
370 self._model = self._stack.enter_context(

ValueError: Model path does not exist: vicky4s4s/cohere_7b_gguf

Cohere models llama.cpp used to convert after, model load time this error will be raise. how to fix this one. please let me know any idea or give the complete instructions how to convert this models proper way in step by step.

csabakecskemeti

DevQuasar org 1 day ago

Sorry I'm not sure I understand what do you want to achieve.
Here's the provided gguf q2 quant inference proof