view article Article Efficient Deep Learning: A Comprehensive Overview of Optimization Techniques 👐 📚 By Isayoften • Aug 26, 2024 • 66
🧠 Traditional Chinese Reasoning Datasets Collection A curated collection of datasets designed to evaluate and train reasoning capabilities in Traditional Chinese across various domains. • 3 items • Updated 24 days ago • 8
📋 Eval Logs Collection Benchmark log generated with Twinkle Eval, recording the model's outputs for each prompt. • 1 item • Updated 24 days ago • 2
Granite 3.1 Language Models Collection A series of language models with 128K context length trained by IBM licensed under Apache 2.0 license. • 9 items • Updated May 2 • 61
Balancing Continuous Pre-Training and Instruction Fine-Tuning: Optimizing Instruction-Following in LLMs Paper • 2410.10739 • Published Oct 14, 2024 • 2
MobileLLM Collection Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 40 items • Updated 4 days ago • 116
Taiwan-Legal-Bench Collection This repository offers a dataset for evaluating legal models based on Taiwan’s laws, including legal questions, provisions, and case law. • 4 items • Updated Dec 9, 2024 • 1
⚖️ Llama-3.2-Taiwan-Legal Collection Based on the lianghsun/Llama-3.2-Taiwan-*B model, the fine-tuning was conducted using datasets related to the laws and judgments of Taiwan. • 3 items • Updated Apr 26 • 1