--- library_name: sentence-transformers pipeline_tag: sentence-similarity tags: - sentence-transformers - sentence-similarity - feature-extraction - embedder - embedding - models - GGUF - Bert - Nomic - Gist - Granite - BGE - Jina - Qwen - text-embeddings-inference - RAG - Rerank - similarity - PDF - Parsing - Parser misc: - text-embeddings-inference language: - en - de architecture: --- # All models tested with ALLM(AnythingLLM) with LM-Studio as server, all models should be work with ollama the setup for local documents described below is allmost the same, GPT4All has only one model (nomic), and koboldcpp is not build in right now but in development
(sometimes the results are more truthful if the “chat with document only” option is used)
BTW embedder is only a part of a good RAG
give me a ❤️, if you like ;)

My short impression: Working well, all other its up to you! Some models are very similar! (jina and qwen based you can add manual to LM-Studio, set model "gear wheel" below "overide domain type")
With the same setting, these embedders found same 6-7 snippets out of 10 from a book. This means that only 3-4 snippets were different, but I didn't test it extensively.
Further tests have shown that the following models are suitable for complex tasks (German-text, but should be similar in English). Jina-DE, nomic was not that good.