Optimizing LLMs for Italian: Reducing Token Fertility and Enhancing Efficiency Through Vocabulary Adaptation Paper β’ 2504.17025 β’ Published 14 days ago β’ 16
PROMPTEVALS: A Dataset of Assertions and Guardrails for Custom Production Large Language Model Pipelines Paper β’ 2504.14738 β’ Published 17 days ago β’ 5
Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction Paper β’ 2504.15266 β’ Published 16 days ago β’ 3
SilVar-Med: A Speech-Driven Visual Language Model for Explainable Abnormality Detection in Medical Imaging Paper β’ 2504.10642 β’ Published 23 days ago β’ 2
EasyEdit2: An Easy-to-use Steering Framework for Editing Large Language Models Paper β’ 2504.15133 β’ Published 17 days ago β’ 21
BookWorld: From Novels to Interactive Agent Societies for Creative Story Generation Paper β’ 2504.14538 β’ Published 18 days ago β’ 27
Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning Paper β’ 2504.17192 β’ Published 14 days ago β’ 105
Qwen2.5 Collection Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. β’ 46 items β’ Updated 9 days ago β’ 607
π§ Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community β’ 21 items β’ Updated 22 days ago β’ 136
Big-Math Collection This collection contains assets associated with the Big-Math dataset, a high-quality collection of over 250,000 math questions with verifiable answers β’ 4 items β’ Updated 22 days ago β’ 4
OpenChat: Advancing Open-source Language Models with Mixed-Quality Data Paper β’ 2309.11235 β’ Published Sep 20, 2023 β’ 15
Recent models: last 100 repos, sorted by creation date Collection The last 100 repos I have created. Sorted by creation date descending, so the most recently created repos appear at the top. β’ 121 items β’ Updated Jan 31, 2024 β’ 534