6 8 8

Jonas Golde

whoisjones

AI & ML interests

Data-efficient transfer learning

Recent Activity

new activity 16 days ago

whoisjones/finerweb-multilabel-classifier-xlmr-4o:Improve model card: Add pipeline tag, paper link, code, description, and usage example

authored a paper 19 days ago

FiNERweb: Datasets and Artifacts for Scalable Multilingual Named Entity Recognition

submitted a paper 20 days ago

FiNERweb: Datasets and Artifacts for Scalable Multilingual Named Entity Recognition

View all activity

Organizations

New activity in whoisjones/finerweb-multilabel-classifier-xlmr-4o 16 days ago

Improve model card: Add pipeline tag, paper link, code, description, and usage example

❤️ 1

#1 opened 19 days ago by

nielsr

authored a paper 19 days ago

FiNERweb: Datasets and Artifacts for Scalable Multilingual Named Entity Recognition

Paper • 2512.13884 • Published 22 days ago • 14

submitted a paper to Daily Papers 20 days ago

FiNERweb: Datasets and Artifacts for Scalable Multilingual Named Entity Recognition

Paper • 2512.13884 • Published 22 days ago • 14

upvoted a paper 20 days ago

FiNERweb: Datasets and Artifacts for Scalable Multilingual Named Entity Recognition

Paper • 2512.13884 • Published 22 days ago • 14

updated a dataset 22 days ago

whoisjones/fiNERweb-x

Updated 22 days ago • 110

authored 5 papers 22 days ago

BabyHGRN: Exploring RNNs for Sample-Efficient Training of Language Models

Paper • 2412.15978 • Published Dec 20, 2024 • 1

Empirical Evaluation of Knowledge Distillation from Transformers to Subquadratic Language Models

Paper • 2504.14366 • Published Apr 19, 2025 • 1

Question Decomposition for Retrieval-Augmented Generation

Paper • 2507.00355 • Published Jul 1, 2025 • 1

Sample-Efficient Language Modeling with Linear Attention and Lightweight Enhancements

Paper • 2511.05560 • Published Nov 4, 2025 • 1

PISA-Bench: The PISA Index as a Multilingual and Multimodal Metric for the Evaluation of Vision-Language Models

Paper • 2510.24792 • Published Oct 27, 2025

updated a collection 22 days ago

fiNERweb

Collection

A multilingual dataset for NER covering 91 langauges and 25 scripts • 3 items • Updated 22 days ago • 1

updated a dataset 22 days ago

whoisjones/fiNERweb

Updated 22 days ago • 686 • 7

updated a Space about 2 months ago

PISA-Bench - The PISA Index as a Multilingual and Multimodal Metric for the Evaluation of Vision-Language Models

📚

liked a dataset 2 months ago

risqaliyevds/uzbek_ner

Viewer • Updated Jun 6, 2024 • 19.6k • 25 • 3

liked a model 2 months ago

pythainlp/thainer-corpus-v2-base-model

Token Classification • 0.1B • Updated Mar 23, 2023 • 50.3k • • 14

upvoted a paper 3 months ago

What Layers When: Learning to Skip Compute in LLMs with Residual Gates

Paper • 2510.13876 • Published Oct 13, 2025 • 11

updated a dataset 3 months ago

whoisjones/fiNERweb-x-multi

Updated Sep 29, 2025 • 445

published a dataset 3 months ago

whoisjones/fiNERweb-x-multi

Updated Sep 29, 2025 • 445

Jonas Golde

AI & ML interests

Recent Activity

Organizations

whoisjones's activity

Improve model card: Add pipeline tag, paper link, code, description, and usage example

PISA-Bench - The PISA Index as a Multilingual and Multimodal Metric for the Evaluation of Vision-Language Models