Orpheus Multilingual Research Release Collection Beta Release of multilingual models. • 12 items • Updated 5 days ago • 73
🧠 Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community • 20 items • Updated 14 days ago • 126
👩💻 OlympicCoder Collection Reasoning datasets and models for competitive coding • 4 items • Updated Mar 11 • 16
olmOCR Collection olmOCR is a document recognition pipeline for efficiently converting documents into plain text. olmocr.allenai.org • 4 items • Updated 27 days ago • 104
mrm8488/modernbert-embed-base-ft-sts-spanish-matryoshka-768-64 Sentence Similarity • Updated Jan 10 • 986 • 2