fail
#30
by
ratboy1
- opened
🥲 Failed to load the model
Failed to load model
error loading model: missing tensor 'blk.0.ffn_down_exps.weight'
I'm facing the same issue when running GH200 using the CUDA Deep Learning image. https://catalog.ngc.nvidia.com/orgs/nvidia/containers/cuda-dl-base
the same error
$ ollama run mixtral:8x7b-ins-v0.1-q8 "hi"
Error: llama runner process has terminated: error loading model: missing tensor 'blk.0.ffn_down_exps.weight'
$ ollama show mixtral:8x7b-ins-v0.1-q8
Model
architecture llama
parameters 46.7B
context length 32768
embedding length 4096
quantization Q8_0
Parameters
stop "[INST]"
stop "[/INST]"
$ ollama --version
ollama version is 0.6.2
same error