The code needed to running this model, as well as the base model itself are not ready yet.
This is uploaded merely to help testing.
see https://github.com/ggerganov/llama.cpp/pull/8151
4-bit
8-bit
16-bit
Base model