Running 110 110 TxT360: Trillion Extracted Text π Create a large, deduplicated dataset for LLM pre-training
bunkalab/Phi-3-mini-128k-instruct-LinearBunkaScore-4.6k-DPO Text Generation β’ Updated May 30, 2024 β’ 18 β’ 2
OrdalieTech/Solon-embeddings-large-0.1 Feature Extraction β’ Updated Mar 26, 2024 β’ 11.5k β’ β’ 51
MoritzLaurer/deberta-v3-base-zeroshot-v1 Zero-Shot Classification β’ Updated Nov 29, 2023 β’ 31.5k β’ β’ 38