New discussion

again GGUF?

#46 opened about 1 month ago by
kalle07

.onnx support

#42 opened 5 months ago by
arjunsah21

gguf version?

#41 opened 5 months ago by
kalle07

Hosting model w/ Triton

#39 opened 8 months ago by
here4data

GPU requirements

👀 3
#31 opened 10 months ago by
fedorn

ms_macro eval

#27 opened 11 months ago by
skfrost19

Matryoshka Embedding in v2?

1
#25 opened 11 months ago by
dulacp

Normalising embeddings

1
#24 opened 11 months ago by
wenhao89

Model Loading Problem

1
#19 opened 12 months ago by
khushwant04

Max Length of 32768

1
#18 opened 12 months ago by
hugginghugging

Is this model aligned/censored?

#17 opened about 1 year ago by
xiliny

Customized fine-tuning by users

17
#13 opened about 1 year ago by
fwj

Upload to Ollama

19
#12 opened about 1 year ago by
nonetrix

Does this work with vLLM?

#9 opened about 1 year ago by
nickandbro