ONNX version of intfloat/e5-small-v2
This is a sentence-transformers model: It maps sentences & paragraphs to a N dimensional dense vector space and can be used for tasks like clustering or semantic search.
The model conversion was made with onnx-convert tool with the following parameters:
python convert.sh --model_id intfloat/e5-small-v2 --quantize QInt8 --optimize 2
There are two versions of model available:
model.onnx
- Float32 version, with optimize=2model_opt2_QInt8.onnx
- QInt8 quantized version, with optimize=2
License
Apache 2.0
- Downloads last month
- 76
Inference Providers
NEW
This model is not currently available via any of the supported third-party Inference Providers, and
the model is not deployed on the HF Inference API.