Judging Quality Across Languages: A Multilingual Approach to Pretraining Data Filtering with Language Models
-
4
JQL: Judging Quality Across Languages
🦊Filter multilingual data to improve LLM training
-
Judging Quality Across Languages: A Multilingual Approach to Pretraining Data Filtering with Language Models
Paper • 2505.22232 • Published • 18 -
JQL-AI/JQL-Edu-Heads
Text Ranking • Updated • 2 -
JQL-AI/JQL-LLM-Edu-Annotations
Viewer • Updated • 11.4M • 538 • 2