BitNet v2: Native 4-bit Activations with Hadamard Transformation for 1-bit LLMs Paper โข 2504.18415 โข Published 10 days ago โข 41
Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math Paper โข 2504.21233 โข Published 6 days ago โข 35
Unsloth Dynamic 2.0 Quants Collection New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & outperforms all leading quantization methods. โข 29 items โข Updated 5 days ago โข 83
๐ง Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community โข 21 items โข Updated 20 days ago โข 135
Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention Paper โข 2502.11089 โข Published Feb 16 โข 156
InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models Paper โข 2504.10479 โข Published 21 days ago โข 253
SemViQA: A Semantic Question Answering System for Vietnamese Information Fact-Checking Paper โข 2503.00955 โข Published Mar 2 โข 27
From Hours to Minutes: Lossless Acceleration of Ultra Long Sequence Generation up to 100K Tokens Paper โข 2502.18890 โข Published Feb 26 โข 30