Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
36.4
TFLOPS
674
113
484
Tom Aarsen
tomaarsen
Follow
singhrishi's profile picture
ahzasp's profile picture
adamvf's profile picture
735 followers
·
117 following
https://linkedin.com/in/tomaarsen
tomaarsen
tomaarsen
tomaarsen
AI & ML interests
NLP: text embeddings, information retrieval, named entity recognition, few-shot text classification
Articles
Welcome Gemma 2 - Google's new open LLM
Jun 27
•
116
Training and Finetuning Embedding Models with Sentence Transformers v3
May 28
•
146
Blazing Fast SetFit Inference with 🤗 Optimum Intel on Xeon
Apr 3
•
8
Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval
Mar 22
•
56
🪆 Introduction to Matryoshka Embedding Models
Feb 23
•
46
SetFitABSA: Few-Shot Aspect Based Sentiment Analysis using SetFit
Dec 6, 2023
•
5
🕳️ Attention Sinks in LLMs for endless fluency
Oct 9, 2023
•
6
Organizations
tomaarsen
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
sentence-transformers/all-MiniLM-L6-v2
about 7 hours ago
Limit of number of sentences
1
#78 opened about 13 hours ago by
bantonybow
New activity in
blog-explorers/README
about 21 hours ago
[Support] Community Articles
57
#5 opened 6 months ago by
victor
New activity in
conceptofmind/minipile_embeddings
2 days ago
Dataset Viewer issue
#1 opened 2 days ago by
tomaarsen
New activity in
gowitheflow/supervised-multilingual
3 days ago
Sources
1
#3 opened 3 days ago by
tomaarsen
New activity in
jinaai/jina-reranker-v2-base-multilingual
3 days ago
HugginFace text-embeddings-inference (TEI) support
3
#26 opened 4 days ago by
leversberg
New activity in
Alibaba-NLP/gte-Qwen2-1.5B-instruct
4 days ago
Qwen 2.5 1.5B retrain?
4
#12 opened 4 days ago by
tomaarsen
New activity in
jinaai/jina-embeddings-v3
4 days ago
Unable to load the model through sentence_transformerss
8
#22 opened 4 days ago by
adi751
New activity in
Alibaba-NLP/gte-multilingual-base
5 days ago
How to fine-tune?
1
#10 opened 22 days ago by
havardox
Fix broken SentenceTransformer snippet; format code with Python format
#11 opened 5 days ago by
tomaarsen
New activity in
mixedbread-ai/mxbai-rerank-large-v1
11 days ago
Deployment using TEI
3
#7 opened 11 days ago by
WolfAssi285
New activity in
sentence-transformers/embedding-training-data
11 days ago
Issues Loading Dataset
3
#1 opened 11 months ago by
colbertv2
New activity in
sentence-transformers/distiluse-base-multilingual-cased
13 days ago
Language list
5
#2 opened about 2 years ago by
lbourdois
New activity in
mattshumer/Reflection-Llama-3.1-70B
13 days ago
🤔⚔️ David vs Goliath: How Small Models Are Shaking Up the AI Giants Without Billion-Dollar Infrastructures 😬📉
2
#59 opened 13 days ago by
gohelrakesh
New activity in
jinaai/xlm-roberta-flash-implementation
13 days ago
feat: support sentence-transformers
1
#42 opened 13 days ago by
bwang0911
New activity in
zeta-alpha-ai/Zeta-Alpha-E5-Mistral
16 days ago
Integrate with Sentence Transformers (+ third parties like LangChain/Haystack/LlamaIndex, etc.)
1
#1 opened 16 days ago by
tomaarsen
New activity in
nomic-ai/nomic-embed-text-v1
16 days ago
keeping data local
12
#12 opened 7 months ago by
tyler-rankin-opg
New activity in
nvidia/NV-Embed-v2
19 days ago
Improve model metadata
1
#2 opened 19 days ago by
tomaarsen
New activity in
Snowflake/snowflake-arctic-embed-m-v1.5
23 days ago
Uninitialised weights warning when loading with Sentence Transformers
4
#4 opened about 1 month ago by
cpierse
Specify add_pooling_layer=False via configuration instead
#5 opened 23 days ago by
tomaarsen
New activity in
nomic-ai/nomic-embed-text-v1.5
23 days ago
Slow inference performance when using nomic-embed-text-v1.5
5
#34 opened 23 days ago by
umesh-c
New activity in
nvidia/NV-Embed-v2
23 days ago
Remove ` (default)` from MTEB metadata
#1 opened 23 days ago by
tomaarsen
New activity in
jinaai/jina-embeddings-v2-base-en
26 days ago
How to finetune the model using multiple GPUs ?
4
#45 opened 26 days ago by
Space192
New activity in
ai-forever/ru-en-RoSBERTa
26 days ago
Adopt MTEB dataset naming scheme
1
#1 opened 26 days ago by
tomaarsen
New activity in
nomic-ai/nomic-embed-text-v1.5
26 days ago
nomic-embed-text running slowly
2
#32 opened about 1 month ago by
xtreme786
New activity in
Snowflake/snowflake-arctic-embed-m-v1.5
about 1 month ago
Languages?
5
#2 opened about 2 months ago by
Mazyod
New activity in
sentence-transformers/all-MiniLM-L6-v2
about 1 month ago
Similarity Search Type
1
#77 opened about 1 month ago by
frbackup
New activity in
answerdotai/answerai-colbert-small-v1
about 1 month ago
Fix a snippet
1
#1 opened about 1 month ago by
tomaarsen
New activity in
sentence-transformers/all-MiniLM-L6-v2
about 1 month ago
Trying to run this model locally, on my machine
4
#75 opened about 1 month ago by
abdulrafay97
New activity in
mixedbread-ai/deepset-mxbai-embed-de-large-v1
about 2 months ago
fix: config.json
6
#4 opened about 2 months ago by
ouz-m
New activity in
sarkii/MizoEmbed-1
about 2 months ago
Set language to "lus" for Lushai
#2 opened about 2 months ago by
tomaarsen
New activity in
Snowflake/snowflake-arctic-embed-m-v1.5
about 2 months ago
Remove ` (default)` from MTEB scores caused by an MTEB bug
#3 opened about 2 months ago by
tomaarsen
New activity in
BAAI/bge-multilingual-gemma2
about 2 months ago
Integrate with Sentence Transformers (+ third parties like LangChain/Haystack/LlamaIndex, etc.)
1
#1 opened about 2 months ago by
tomaarsen
New activity in
mixedbread-ai/deepset-mxbai-embed-de-large-v1
about 2 months ago
Fix a small typo
1
#3 opened about 2 months ago by
tomaarsen
New activity in
dunzhang/stella_en_1.5B_v5
about 2 months ago
Getting different results for the same examples provided in sample
4
#17 opened about 2 months ago by
sramakintel
New activity in
dunzhang/stella_en_400M_v5
about 2 months ago
Fix query_prompt_name variable name
#9 opened about 2 months ago by
tomaarsen
New activity in
dunzhang/stella_en_1.5B_v5
about 2 months ago
Fix query_prompt_name variable name
#15 opened about 2 months ago by
tomaarsen
New activity in
dunzhang/stella_en_1.5B_v5
2 months ago
Error when loading model KeyError: 'qwen2'
1
#11 opened 2 months ago by
longluu
New activity in
jinaai/jina-reranker-v2-base-multilingual
2 months ago
data did not match any variant of untagged enum PyPreTokenizerTypeWrapper at line 69 column 3
3
#17 opened 2 months ago by
sigridjineth
New activity in
Snowflake/snowflake-arctic-embed-m
2 months ago
Sentence Transformers integration
3
#2 opened 5 months ago by
tomaarsen
New activity in
dunzhang/stella_en_1.5B_v5
2 months ago
Model max_seq_length
6
#6 opened 2 months ago by
shuyuej
New activity in
sentence-transformers/all-MiniLM-L6-v2
2 months ago
Does the model take our data and use our data to improve itself or for some other purpose?
2
#70 opened 2 months ago by
AIProDK
New activity in
Snowflake/snowflake-arctic-embed-m
2 months ago
TypeError: Pooling.__init__() got an unexpected keyword argument 'include_prompt'
1
#13 opened 2 months ago by
Rageshhf
New activity in
dunzhang/stella_en_1.5B_v5
2 months ago
Set 1024 as default dim, update usage snippets, store prompts in config
#1 opened 2 months ago by
tomaarsen
New activity in
dunzhang/stella_en_400M_v5
2 months ago
Set 1024 as default dim, update usage snippets, store prompts in config
#1 opened 2 months ago by
tomaarsen
New activity in
mteb/leaderboard
2 months ago
New Embedding Model for MTEB - Retriever/BIER Benchmark - Applying for refresh
11
#134 opened 2 months ago by
nv-bschifferer
e5-R-mistral-7b for retrieval, apply for refreshing the results
16
#132 opened 3 months ago by
BeastyZ
New activity in
nvidia/NV-Retriever-v1
2 months ago
Remove ` (default)` that has been included due to a bug in MTEB
#1 opened 2 months ago by
tomaarsen
New activity in
SetFit/bbc-news
3 months ago
Update README.md to include correct attribution and source
1
#1 opened 3 months ago by
derekgreene
New activity in
tomaarsen/mining_demo
3 months ago
Librarian Bot: Add language metadata for dataset
#2 opened 3 months ago by
librarian-bot
New activity in
jinaai/jina-clip-v1
3 months ago
How to create embeddings in the browser?
2
#16 opened 3 months ago by
gnoel-ddh
New activity in
mteb/leaderboard
3 months ago
New model and CMTEB leaderboard refresh request
1
#130 opened 3 months ago by
lier007
New activity in
lier007/xiaobu-embedding-v2
3 months ago
Set `library_name` as Sentence Transformers
#1 opened 3 months ago by
tomaarsen
New activity in
Lajavaness/bilingual-embedding-large
3 months ago
Questions about model & architecture
5
#1 opened 3 months ago by
tomaarsen
New activity in
jinaai/jina-embeddings-v2-base-en
3 months ago
Saving and Loading the fine-tuned model
11
#24 opened 11 months ago by
maiia-bocharova
New activity in
gbyuvd/ChemEmbed-v01
3 months ago
Fascinating work!
3
#1 opened 3 months ago by
tomaarsen
New activity in
jinaai/jina-clip-v1
3 months ago
Could not locate the jinaai/jina-clip-implementation--configuration_clip.py inside jinaai/jina-clip-v1.
6
#15 opened 3 months ago by
SergeShirokov
New activity in
sentence-transformers/msmarco-msmarco-distilbert-base-tas-b
3 months ago
What is the best performant "dataset" in MS MARCO Mined Triplets?
2
#1 opened 3 months ago by
kyeongpil
New activity in
mteb/leaderboard
3 months ago
New Embedding Models | Apply for refershing the results
6
#128 opened 3 months ago by
Omartificial-Intelligence-Space
New activity in
Omartificial-Intelligence-Space/Arabic-NLi-Triplet
3 months ago
Add "sentence-transformers" tag
1
#2 opened 3 months ago by
tomaarsen
New activity in
mteb/leaderboard
3 months ago
New model and mteb leaderboard refresh request
2
#129 opened 3 months ago by
lvkaokao
Load more