🧠Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community • 12 items • Updated 1 day ago • 76
MiLoRA: Efficient Mixture of Low-Rank Adaptation for Large Language Models Fine-tuning Paper • 2410.18035 • Published Oct 23, 2024 • 1
Gemma-2-9B-it-Advanced Collection Merges of the advanced research fine tunes of gemma-2 9B it • 3 items • Updated Oct 20, 2024 • 3
TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models Paper • 2410.10818 • Published Oct 14, 2024 • 17
VideoChat-Flash Collection Faster and more powerful VideoChat. • 8 items • Updated 2 days ago • 9
Math Neurosurgery: Isolating Language Models' Math Reasoning Abilities Using Only Forward Passes Paper • 2410.16930 • Published Oct 22, 2024 • 8
TinySQL Collection "Convert English query to a SQL command" models and training data. • 26 items • Updated 24 days ago • 2
Know When to Fuse: Investigating Non-English Hybrid Retrieval in the Legal Domain Paper • 2409.01357 • Published Sep 2, 2024 • 3
DeepSeek-R1-ReDistill Collection Re-distilled DeepSeek R1 models • 4 items • Updated 22 days ago • 14
Social Deduction LLM (AAMAS 2025) Collection Pretrained models for "Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning" (AAMAS 2025 Version) • 3 items • Updated 10 days ago • 2
Sa2VA Model Zoo Collection Huggingace Model Zoo For Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos By Bytedance Seed CV Research • 4 items • Updated 12 days ago • 29
MoDE Collection Collection of pretrained MoDE Diffusion Policies. Variants include finetuned versions for all CALVIN benchmarks and LIBERO 90. • 9 items • Updated Dec 19, 2024 • 2
[MASK] is All You Need Collection Code, dataset, and pretrained model • 6 items • Updated 15 days ago • 9