Tokens

by Recognizeme - opened Jul 7, 2024

Discussion

Recognizeme

Jul 7, 2024

Can you tell me if you are still developing the model?

Are you looking to increase the number of tokens?

johngiorgi

Owner Jul 16, 2024

I am not currently still developing the model but it would be pretty straightforward to train it on more tokens! See: https://github.com/JohnGiorgi/DeCLUTR. Based on the results in the paper I would expect increasing the training set to have a large positive effect on performance.

Recognizeme

Aug 19, 2024

It's a real shame. Your model is one of the best for getting embeddings in scientific texts!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment