A collection of processed CommonCrawl data as part of the BigBanyanTree initiative. Each dataset is extracted from a random 1% sample of the data.
-
big-banyan-tree/BBT_CommonCrawl_2018
Viewer • Updated • 61.5M • 124 • 3 -
big-banyan-tree/BBT_CommonCrawl_2019
Viewer • Updated • 55.8M • 45 • 2 -
big-banyan-tree/BBT_CommonCrawl_2020
Viewer • Updated • 46.9M • 38 • 2 -
big-banyan-tree/BBT_CommonCrawl_2021
Viewer • Updated • 48.5M • 166 • 2