Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
society-ethics 's Collections
β›”οΈπŸ”¦ Provenance, Watermarking & Deepfake Detection
πŸ—³οΈ AI for Policymakers
βš–οΈ Showing Biases in ML Systems
πŸ€¬β›” Hate Speech and Filtering
πŸͺͺπŸ”¦Model Cards
πŸ”’β˜‚οΈπŸ§‘β€πŸ€β€πŸ§‘ Privacy and AI
πŸ“Š Benchmarks and Leaderboards
πŸ“šπŸ” Understanding Datasets
πŸ’»πŸ” Understanding Models
πŸ›οΈπŸ“šπŸ–ΌοΈ Open Data: Public Domain and Open Licenses

πŸ“šπŸ” Understanding Datasets

updated Jun 4, 2024
Upvote
6

  • Runtime error
    36
    36

    Text Data Filtering

    πŸ‘


  • Runtime error
    48
    48

    Roots Search Tool

    🌸

    Search through ROOTS corpus using queries


  • Running
    38
    38

    Lilac

    🌷

    Use AI to translate text between languages


  • Build error
    98
    98

    DataMeasurementsTool

    πŸ€—


  • Running
    39
    39

    StarCoder Search

    πŸ”Ž

    Search code snippets in StarCoder dataset


  • Running
    15
    15

    OBELICS Interactive Map

    πŸ‘“

    Browse interactive OBELICS data map


  • Running
    13
    13

    GÆA / gaia / gæa

    🌏


  • Runtime error
    17
    17

    Dataset Explore

    πŸ’»


  • Running
    939
    939

    FineWeb: decanting the web for the finest text data at scale

    🍷

    Generate high-quality web text data for LLM training

Upvote
6
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs