Tom Aarsen's picture

Tom Aarsen

tomaarsen

·

https://linkedin.com/in/tomaarsen

AI & ML interests

NLP: text embeddings, information retrieval, named entity recognition, few-shot text classification

Recent Activity

liked a model about 5 hours ago

OmniGen2/OmniGen2

updated a model about 16 hours ago

tomaarsen/splade-cocondenser-msmarco-marginmse-minilm

liked a model about 16 hours ago

NeuML/bioclinical-modernbert-base-embeddings

View all activity

Organizations

upvoted 3 collections about 17 hours ago

Router Splade Models

The collection includes Router Splade models that can be load thanks to the Sparse Encoder modules of Sentence Transformers • 5 items • Updated 4 days ago • 1

Inference Free Splade Models

The collection includes Inference Free Splade models that can be load thanks to the Sparse Encoder modules of Sentence Transformers • 6 items • Updated 4 days ago • 2

Splade Models

The collection includes Splade models from different authors that can be load thanks to the Sparse Encoder modules of Sentence Transformers • 14 items • Updated 4 days ago • 1

upvoted an article 1 day ago

Article

Gemma 3n fully available in the open-source ecosystem!

By

and 7 others •

2 days ago

• 71

upvoted an article 3 days ago

Article

Nano-vLLM meets Inference Endpoints

By

•

3 days ago

• 5

upvoted a collection 3 days ago

GLiNER-X

The Multilingual Named Entity Recognition (NER) model which is capable of identifying any entity type. • 6 items • Updated 4 days ago • 15

upvoted 2 articles 4 days ago

Article

Code a simple RAG from scratch

By

•

Oct 29, 2024

• 112

Article

Introducing Cosmos Predict-2: A Foundation For Your Own World Model

By

and 2 others •

11 days ago

• 8

upvoted a paper 4 days ago

jina-embeddings-v4: Universal Embeddings for Multimodal Multilingual Retrieval

Paper • 2506.18902 • Published 4 days ago • 7

upvoted an article 8 days ago

Article

Groq on Hugging Face Inference Providers 🔥

By

and 4 others •

12 days ago

• 34

upvoted a changelog 12 days ago

Changelog

New Model Filtering Options on the Hub

12 days ago

• 53

upvoted a collection 13 days ago

BioClinical ModernBERT

3 items • Updated 15 days ago • 9

upvoted 2 papers 18 days ago

MIRIAD: Augmenting LLMs with millions of medical query-response pairs

Paper • 2506.06091 • Published 22 days ago • 8

Qwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models

Paper • 2506.05176 • Published 23 days ago • 62

upvoted a collection 23 days ago

Qwen3-Embedding

6 items • Updated 23 days ago • 96

upvoted a paper 23 days ago

ProRank: Prompt Warmup via Reinforcement Learning for Small Language Models Reranking

Paper • 2506.03487 • Published 24 days ago • 3

upvoted a changelog 23 days ago

Changelog

New Inference Providers Dashboard

23 days ago

• 52

upvoted an article 26 days ago

Article

Context Is Gold to Find the Gold Passage: Evaluating and Training Contextual Document Embeddings

By

and 1 other •

26 days ago

• 24

upvoted a collection 26 days ago

ConTEB models

Our models trained with the InSeNT approach. These are the checkpoints that we used to run the evaluations reported in our paper. • 2 items • Updated 26 days ago • 1

upvoted a collection 28 days ago

Amharic Text Embedding Models

Text Embedding and ColBERT models based on Amharic RoBERTa and BERT for Amharic passage retrieval • 10 items • Updated 16 days ago • 4