6 8 109

Tollef J

tollefj

https://folk.ntnu.no/tollefj/

tollefj

AI & ML interests

Coreference resolution, span prediction, summarization, topic modeling

Recent Activity

liked a model about 2 months ago

black-forest-labs/FLUX.1-Kontext-dev-onnx

liked a model 3 months ago

ResembleAI/chatterbox

liked a model 3 months ago

microsoft/Phi-4-reasoning

View all activity

Organizations

liked a model about 2 months ago

black-forest-labs/FLUX.1-Kontext-dev-onnx

Updated Jun 27 • 79

liked 3 models 3 months ago

liked a model 4 months ago

nari-labs/Dia-1.6B

Text-to-Speech • Updated Jun 1 • 109k • • 2.71k

upvoted an article 5 months ago

Article

Training and Finetuning Reranker Models with Sentence Transformers v4

•

Mar 26

• 158

upvoted a paper 5 months ago

Charting and Navigating Hugging Face's Model Atlas

Paper • 2503.10633 • Published Mar 13 • 88

liked 3 models 5 months ago

google/gemma-3-27b-it

Image-Text-to-Text • 27B • Updated Mar 21 • 523k • • 1.57k

google/gemma-3-12b-it

Image-Text-to-Text • 12B • Updated Mar 21 • 389k • • 496

google/gemma-3-4b-it

Image-Text-to-Text • 4B • Updated Mar 21 • 1.18M • 799

upvoted a collection 5 months ago

Gemma 3 Release

Collection

28 items • Updated 12 days ago • 477

commented on Introducing EuroBERT: A High-Performance Multilingual Encoder Model 6 months ago

Why are there so few languages involved in the training of these models? You argue that this data mix was selected "to create a corpus of European and most widely spoken languages, representing a broad range of alphabets and cultures."
But what is the relevance in other alphabets when, for example, you do not include any Nordic languages with large and high-quality datasets?

Prefixing it "Euro" seems odd in this context. You have selected a tiny fraction of languages - so name it accordingly :-)
It would also make sense to refer to EuroEval https://euroeval.com/leaderboards/

commented a paper 6 months ago