-
ModernGBERT: German-only 1B Encoder Model Trained from Scratch
Paper • 2505.13136 • Published • 22 -
LSX-UniWue/ModernGBERT_1B
Feature Extraction • 1B • Updated • 1.7k • 10 -
LSX-UniWue/ModernGBERT_134M
Feature Extraction • 0.2B • Updated • 3.35k • • 5 -
LSX-UniWue/LLaMmlein-Dataset
Viewer • Updated • 838M • 1.91k • 3
AI & ML interests
German NLP and beyond
-
ModernGBERT: German-only 1B Encoder Model Trained from Scratch
Paper • 2505.13136 • Published • 22 -
LSX-UniWue/LLaMmlein2Vec_7B
Feature Extraction • 7B • Updated • 177 -
LSX-UniWue/LLaMmlein2Vec_1B
Feature Extraction • 1B • Updated -
LSX-UniWue/LLaMmlein2Vec_120M
Feature Extraction • 0.1B • Updated • 380
https://www.informatik.uni-wuerzburg.de/datascience/projects/nlp/llammlein/
-
LLäMmlein: Compact and Competitive German-Only Language Models from Scratch
Paper • 2411.11171 • Published • 8 -
LSX-UniWue/LLaMmlein_7B
Text Generation • 7B • Updated • 288 • 7 -
LSX-UniWue/LLaMmlein_1B
Text Generation • 1B • Updated • 638 • 1 -
LSX-UniWue/LLaMmlein_120M
Text Generation • 0.1B • Updated • 754
https://www.informatik.uni-wuerzburg.de/datascience/projects/nlp/llammlein/
-
ModernGBERT: German-only 1B Encoder Model Trained from Scratch
Paper • 2505.13136 • Published • 22 -
LSX-UniWue/ModernGBERT_1B
Feature Extraction • 1B • Updated • 1.7k • 10 -
LSX-UniWue/ModernGBERT_134M
Feature Extraction • 0.2B • Updated • 3.35k • • 5 -
LSX-UniWue/LLaMmlein-Dataset
Viewer • Updated • 838M • 1.91k • 3
https://www.informatik.uni-wuerzburg.de/datascience/projects/nlp/llammlein/
-
LLäMmlein: Compact and Competitive German-Only Language Models from Scratch
Paper • 2411.11171 • Published • 8 -
LSX-UniWue/LLaMmlein_7B
Text Generation • 7B • Updated • 288 • 7 -
LSX-UniWue/LLaMmlein_1B
Text Generation • 1B • Updated • 638 • 1 -
LSX-UniWue/LLaMmlein_120M
Text Generation • 0.1B • Updated • 754
-
ModernGBERT: German-only 1B Encoder Model Trained from Scratch
Paper • 2505.13136 • Published • 22 -
LSX-UniWue/LLaMmlein2Vec_7B
Feature Extraction • 7B • Updated • 177 -
LSX-UniWue/LLaMmlein2Vec_1B
Feature Extraction • 1B • Updated -
LSX-UniWue/LLaMmlein2Vec_120M
Feature Extraction • 0.1B • Updated • 380
https://www.informatik.uni-wuerzburg.de/datascience/projects/nlp/llammlein/