--- title: README emoji: š colorFrom: indigo colorTo: gray sdk: static pinned: false thumbnail: >- https://cdn-uploads.huggingface.co/production/uploads/65e5fd212faa026716fd27bf/P8yHyv6OZSPLpYEPE0XII.png --- # Welcome to GEM Space š«” Greetings from GEM Space, the heart of innovation behind our paper, "FRAGILE MASTERY: ARE DOMAIN-SPECIFIC TRADE-OFFS UNDERMINING ON-DEVICE LANGUAGE MODELS?". Weāre thrilled to invite you into our world of edge AI exploration! This repository, GEM_Testing_Arsenal, is a cornerstone of our efforts to redefine On-Device Language Models (ODLMs) through the Generalized Edge Model (GEM). Keep an eye out for the paper link once itās published! > Link to the Paper: [Click Here](https://arxiv.org/abs/2503.22698) --- ## About Our Paper š ***Abstract***: The deployment of On-Device Language Models (ODLMs) on resource-constrained edge devices demands a delicate balance of efficiency, memory, power, and linguistic skill across diverse tasks. In "FRAGILE MASTERY", we explore the trade-offs between domain-specific optimization and cross-domain robustness, introducing the Generalized Edge Model (GEM). GEM integrates specialization and generalization using a Sparse Cross-Attention Router (SCAR), achieving a cross-domain F1 score of 0.89 with sub-100ms latency on platforms like Raspberry Pi 4 and Pixel 6. Across 47 benchmarks spanning eight domainsāhealthcare, legal, finance, STEM, and moreāGEM boosts general-task performance by 7% over GPT-4 Lite while matching domain-specific results. With new metrics like the Domain Specialization Index (DSI) and a balanced distillation framework cutting catastrophic forgetting by 43%, this work offers a robust foundation for edge AI. [Paper Link](https://arxiv.org/abs/2503.22698) ***Architecture***:
Fig: Our Architecture simplified