dots.llm1.inst

#1056

by jacek2024 - opened 14 days ago

Discussion

jacek2024

14 days ago

•

edited 14 days ago

you must use new llama.cpp (code has been merged today!)

https://huggingface.co/rednote-hilab/dots.llm1.inst

https://github.com/ggml-org/llama.cpp/pull/14118

nicoboss

14 days ago

•

edited 14 days ago

I'm so excited about dots.llm.1 and I already informed @mradermacher about it in https://huggingface.co/mradermacher/BabyHercules-4x150M-GGUF/discussions/6#684e9961e30cadf2f6406b44 but he seems to be so busy he doesn't even respond there anymore. Let's hope @mradermacher can update to latest llama.cpp soon. I already merged the latest changes to our llama.cpp fork but unfortunately only @mradermacher is able to update all the workers to the latest llama.cpp version.

jacek2024

14 days ago

good luck with the update :)

mradermacher

Owner 13 days ago

I do read things, eventually :) I also tend to be excited only after I tried something :) But yeah, it's certainly an exciting model. Maybe I can run a Q4 on my cpu for blazing fast speed...

jacek2024

13 days ago

Q4 is already available, but other quants are not :)

mradermacher

Owner 13 days ago

Arruhm... uh... ehm... only an imatrix q4 will do, yes.

nicoboss

13 days ago

•

edited 13 days ago

As a quick status update: We sucessfully convearted dots.llm1.inst to GGUF and computed its imatrix. We are now computing its static quants following by its imatrix quants. dots.llm1.base is on good tracks as well.

Please monitor https://hf.tst.eu/status.html to always get the latest updates or check the status or regularly check the model
summary page at https://hf.tst.eu/model#dots.llm1.inst-GGUF and https://hf.tst.eu/model#dots.llm1.base-GGUF for quants to appear.

jacek2024

13 days ago

great news!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment