fail

#30
by ratboy1 - opened
🥲 Failed to load the model

Failed to load model

error loading model: missing tensor 'blk.0.ffn_down_exps.weight'

I'm facing the same issue when running GH200 using the CUDA Deep Learning image. https://catalog.ngc.nvidia.com/orgs/nvidia/containers/cuda-dl-base

the same error

$ ollama run mixtral:8x7b-ins-v0.1-q8 "hi"
Error: llama runner process has terminated: error loading model: missing tensor 'blk.0.ffn_down_exps.weight'

$ ollama show mixtral:8x7b-ins-v0.1-q8
  Model
    architecture        llama
    parameters          46.7B
    context length      32768
    embedding length    4096
    quantization        Q8_0
  Parameters
    stop    "[INST]"
    stop    "[/INST]"

$ ollama --version
ollama version is 0.6.2

same error

Your need to confirm your account before you can post a new comment.

Sign up or log in to comment