Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
adlumal 
posted an update 5 days ago
Post
2385
I benchmarked embedding APIs for speed, compared local vs hosted models, and tuned USearch for sub-millisecond retrieval on 143k chunks using only CPU. The post walks through the results, trade-offs, and what I learned about embedding API terms of service.
The main motivation for using USearch is that CPU compute is cheap and easy to scale.

Blog post: https://huggingface.co/blog/adlumal/lightning-fast-vector-search-for-legal-documents

You missed the most important disadvantage of proprietary closed embedding models served in SaaS form - vendor lock-in. You stop paying for the service and you end up with almost useless vector database - you can't produce new vectors for query in RAG (so your RAG stops working), you can't switch the model and leave all the vectors to work with another free model - their latent spaces are incompatible. Only thing you can do with it is to compare already owned vectors with each other, pitty. Proprietary embedding models usage should be considered only with great care.