wen GGUF

by sukkritsharma - opened Jan 17

Jan 17

I wanted to use this using llama.cpp since my current stack uses nomic-embed-text-v1.5.Q8_0.gguf, wanted to know when are we going to get the GGUF variants

zpn

Nomic AI org Jan 17

I imagine this will require changes to llama.cpp/gpt4all before we can get GGUFs, I would suggest creating an issue there

zpn changed discussion status to closed Jan 17

andriym

Nomic AI org Jan 17

I would make the issue in the llama.cpp main repo: https://github.com/ggerganov/llama.cpp. Support for all modern bert based model will likely be desired (not just embed)

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment