ALEA Mid- and Post-Train Resources Various Q&A, abstractive/extractive summarization, classification, drafting, prediction, and conversational tasks alea-institute/kl3m-sft-hearings-sample-001 Viewer • Updated Feb 1 • 3.49M • 15 • 1 alea-institute/kl3m-sft-patents Viewer • Updated Feb 3 • 1.36M • 160 • 4 alea-institute/kl3m-sft-usc-ecfr-definitions Viewer • Updated Feb 1 • 66.4k • 6 • 1 alea-institute/kl3m-data-edgar-agreements Viewer • Updated Apr 11 • 1.45M • 648 • 5
kl3m-index KL3M Dataset Indices alea-institute/kl3m-index-edgar-filings Viewer • Updated Feb 15 • 20M • 53 alea-institute/kl3m-index-edgar-filings-10-k Viewer • Updated Feb 16 • 238k • 7 alea-institute/kl3m-index-edgar-filings-10-q Viewer • Updated Feb 16 • 496k • 4 • 1 alea-institute/kl3m-index-edgar-filings-8-k Viewer • Updated Feb 16 • 1.86M • 7
kl3m KL3M models and tokenizers alea-institute/kl3m-001-32k Updated Sep 29, 2024 alea-institute/kl3m-003-64k Updated Oct 9, 2024 alea-institute/kl3m-004-128k-uncased Updated Nov 7, 2024 alea-institute/kl3m-004-128k-uncased-mlm Updated Nov 20, 2024
KL3M Embeddings alea-institute/kl3m-doc-pico-001 Fill-Mask • 0.0B • Updated Apr 11 • 4 alea-institute/kl3m-doc-pico-long-001 Feature Extraction • 0.0B • Updated Apr 11 • 9 alea-institute/kl3m-doc-pico-contracts-001 Fill-Mask • 0.0B • Updated Apr 11 • 7 alea-institute/kl3m-doc-nano-001 Feature Extraction • 0.1B • Updated Apr 11 • 7
kl3m-data alea-institute/kl3m-data-snapshot-20250324 Viewer • Updated Apr 1 • 57.8M • 302 • 1 alea-institute/kl3m-data-fr Viewer • Updated Apr 11 • 3.12M • 101 alea-institute/kl3m-data-edgar-agreements Viewer • Updated Apr 11 • 1.45M • 648 • 5 alea-institute/kl3m-data-pacer-docs Viewer • Updated Apr 11 • 1.37M • 135
ALEA Mid- and Post-Train Resources Various Q&A, abstractive/extractive summarization, classification, drafting, prediction, and conversational tasks alea-institute/kl3m-sft-hearings-sample-001 Viewer • Updated Feb 1 • 3.49M • 15 • 1 alea-institute/kl3m-sft-patents Viewer • Updated Feb 3 • 1.36M • 160 • 4 alea-institute/kl3m-sft-usc-ecfr-definitions Viewer • Updated Feb 1 • 66.4k • 6 • 1 alea-institute/kl3m-data-edgar-agreements Viewer • Updated Apr 11 • 1.45M • 648 • 5
KL3M Embeddings alea-institute/kl3m-doc-pico-001 Fill-Mask • 0.0B • Updated Apr 11 • 4 alea-institute/kl3m-doc-pico-long-001 Feature Extraction • 0.0B • Updated Apr 11 • 9 alea-institute/kl3m-doc-pico-contracts-001 Fill-Mask • 0.0B • Updated Apr 11 • 7 alea-institute/kl3m-doc-nano-001 Feature Extraction • 0.1B • Updated Apr 11 • 7
kl3m-index KL3M Dataset Indices alea-institute/kl3m-index-edgar-filings Viewer • Updated Feb 15 • 20M • 53 alea-institute/kl3m-index-edgar-filings-10-k Viewer • Updated Feb 16 • 238k • 7 alea-institute/kl3m-index-edgar-filings-10-q Viewer • Updated Feb 16 • 496k • 4 • 1 alea-institute/kl3m-index-edgar-filings-8-k Viewer • Updated Feb 16 • 1.86M • 7
kl3m-data alea-institute/kl3m-data-snapshot-20250324 Viewer • Updated Apr 1 • 57.8M • 302 • 1 alea-institute/kl3m-data-fr Viewer • Updated Apr 11 • 3.12M • 101 alea-institute/kl3m-data-edgar-agreements Viewer • Updated Apr 11 • 1.45M • 648 • 5 alea-institute/kl3m-data-pacer-docs Viewer • Updated Apr 11 • 1.37M • 135
kl3m KL3M models and tokenizers alea-institute/kl3m-001-32k Updated Sep 29, 2024 alea-institute/kl3m-003-64k Updated Oct 9, 2024 alea-institute/kl3m-004-128k-uncased Updated Nov 7, 2024 alea-institute/kl3m-004-128k-uncased-mlm Updated Nov 20, 2024