This Model2Vec model was created by using Tokenlearn, with nomic-embed-text-v2-moe as a base, trained on around 3.5M passages (english and portuguese).
I have yet to run any benchmarks on it, but it easily outperforms potion-multilingual-128M on my custom-portuguese-testing-workload-thing.
The output dimension is 512.
Usage
Load this model using the from_pretrained
method:
from model2vec import StaticModel
# Load a pretrained Model2Vec model
model = StaticModel.from_pretrained("cnmoro/static-nomic-eng-ptbr")
# Compute text embeddings
embeddings = model.encode(["Example sentence"])
- Downloads last month
- 12
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
🙋
Ask for provider support
Model tree for cnmoro/static-nomic-eng-ptbr
Base model
FacebookAI/xlm-roberta-base
Finetuned
nomic-ai/nomic-xlm-2048
Finetuned
nomic-ai/nomic-embed-text-v2-moe