Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
gabrielmbmb 's Collections
Audio Papers
Math Datasets
Upcycling Papers
Synthetic Data Papers
Upcycling Experiments
LLM Leaderboards

Math Datasets

updated Feb 11

A collection containing math datasets.

Upvote
-

  • nvidia/OpenMathInstruct-2

    Viewer • Updated Nov 25, 2024 • 22M • 13.3k • 173

    Note A math dataset containing 600k unique questions and around 14M question-answer pairs generated using Llama 3.1 405B Instruct and questions from GSM8K and MATH datasets. Paper is super detailed and it contains multiple ablation studies regarding the impact of the size, quality and diversity of the dataset when fine-tuning. https://huggingface.co/papers/2410.01560


  • GAIR/MathPile

    Preview • Updated Apr 3 • 102 • 185

  • GAIR/MathPile_Commercial

    Preview • Updated Apr 3 • 46 • 32

  • Vivacem/MMIQC

    Viewer • Updated Jan 20, 2024 • 2.29M • 132 • 17

  • AI-MO/NuminaMath-CoT

    Viewer • Updated Nov 25, 2024 • 860k • 3.05k • 451

  • ToheartZhang/JiuZhang3.0-Corpus-PT-CoT

    Viewer • Updated May 24, 2024 • 4.38M • 78 • 8

  • ToheartZhang/JiuZhang3.0-Corpus-PT-Tool

    Viewer • Updated May 24, 2024 • 1.6M • 73 • 1

  • HuggingFaceTB/finemath

    Viewer • Updated Feb 6 • 48.3M • 20.3k • 314

  • AI-MO/NuminaMath-1.5

    Viewer • Updated Feb 10 • 896k • 1.99k • 145
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs