view article Article Training and Finetuning Sparse Embedding Models with Sentence Transformers v5 By tomaarsen and 1 other • 25 days ago • 105
FreshStack: Building Realistic Benchmarks for Evaluating Retrieval on Technical Documents Paper • 2504.13128 • Published Apr 17 • 6
AceMath Collection We are releasing math instruction models, math reward models, general instruction models, all training datasets, and a math reward benchmark. • 11 items • Updated 4 days ago • 14
CG-RAG: Research Question Answering by Citation Graph Retrieval-Augmented LLMs Paper • 2501.15067 • Published Jan 25 • 1
Qwen2.5-1M Collection The long-context version of Qwen2.5, supporting 1M-token context lengths • 3 items • Updated 4 days ago • 119
🏟️ Long Code Arena Collection All the resources for our Long Code Arena benchmark! • 13 items • Updated Mar 14 • 6
OLMoE (November 2024) Collection Artifacts for open mixture-of-experts language models. • 13 items • Updated Apr 30 • 30
ModernBERT Collection Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated Dec 19, 2024 • 145
view article Article Accelerating LLM Inference: Fast Sampling with Gumbel-Max Trick By cxdu • Oct 24, 2024 • 12