CommonLID: Re-evaluating State-of-the-Art Language Identification Performance on Web Data Paper • 2601.18026 • Published Jan 25
UniSkill: A Dataset for Matching University Curricula to Professional Competencies Paper • 2603.03134 • Published Mar 3
WorkRB: A Community-Driven Evaluation Framework for AI in the Work Domain Paper • 2604.13055 • Published Mar 17
CroCo: Cross-Lingual Contrastive Preference Tuning on Self-Generations Paper • 2605.26293 • Published 8 days ago • 4
CroCo: Cross-Lingual Contrastive Preference Tuning on Self-Generations Paper • 2605.26293 • Published 8 days ago • 4
Scaling Reasoning can Improve Factuality in Large Language Models Paper • 2505.11140 • Published May 16, 2025 • 7
Kaleidoscope: In-language Exams for Massively Multilingual Vision Evaluation Paper • 2504.07072 • Published Apr 9, 2025 • 9
How Do Hackathons Foster Creativity? Towards AI Collaborative Evaluation of Creativity at Scale Paper • 2503.04290 • Published Mar 6, 2025 • 1
HiFi-KPI: A Dataset for Hierarchical KPI Extraction from Earnings Filings Paper • 2502.15411 • Published Feb 21, 2025 • 3
Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs Paper • 2502.12982 • Published Feb 18, 2025 • 19
SEFL: Harnessing Large Language Model Agents to Improve Educational Feedback Systems Paper • 2502.12927 • Published Feb 18, 2025 • 1
On-Device LLMs for Home Assistant: Dual Role in Intent Detection and Response Generation Paper • 2502.12923 • Published Feb 18, 2025
SnakModel: Lessons Learned from Training an Open Danish Large Language Model Paper • 2412.12956 • Published Dec 17, 2024 • 2
Leveraging Large Language Models for Actionable Course Evaluation Student Feedback to Lecturers Paper • 2407.01274 • Published Jul 1, 2024 • 1
INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge Paper • 2411.19799 • Published Nov 29, 2024 • 17
All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages Paper • 2411.16508 • Published Nov 25, 2024 • 10
Evidence > Intuition: Transferability Estimation for Encoder Selection Paper • 2210.11255 • Published Oct 20, 2022
SkillSpan: Hard and Soft Skill Extraction from English Job Postings Paper • 2204.12811 • Published Apr 27, 2022 • 2
Kompetencer: Fine-grained Skill Classification in Danish Job Postings via Distant Supervision and Transfer Learning Paper • 2205.01381 • Published May 3, 2022