Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
izumi-lab 's Collections
Miscellaneous Text Datasets for Language Models
Japanese LoRA-tuned LLMs
Japanese General Pre-trained Language Models
Japanese Financial Pre-trained Language Models
llm-japanese-dataset

Miscellaneous Text Datasets for Language Models

updated Feb 20
Upvote
-

  • izumi-lab/oscar2301-ja-filter-ja-normal

    Viewer • Updated Jul 29, 2023 • 31.4M • 194 • 5

  • izumi-lab/mc4-ja

    Viewer • Updated Jul 29, 2023 • 87.4M • 371 • 6

  • izumi-lab/mc4-ja-filter-ja-normal

    Viewer • Updated Jul 29, 2023 • 52.6M • 109 • 4

  • izumi-lab/wikinews-ja-20230728

    Viewer • Updated Jul 29, 2023 • 4.28k • 56 • 5

  • izumi-lab/wikipedia-ja-20230720

    Viewer • Updated Jul 29, 2023 • 1.36M • 147 • 12

  • izumi-lab/open-text-books

    Viewer • Updated Aug 1, 2023 • 150k • 154 • 16

  • izumi-lab/pile-modified

    Viewer • Updated Aug 5, 2023 • 211M • 85 • 3

  • izumi-lab/wikinews-en-20230728

    Viewer • Updated Jul 29, 2023 • 43.2k • 39 • 2

  • izumi-lab/wikipedia-en-20230720

    Viewer • Updated Jul 29, 2023 • 6.65M • 147 • 7
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs