Models, tools and other research artifacts used for the training of the first series of Pleias models.

PleIAs
company
AI & ML interests
Open Science LLMs
Organization Card
PleIAs is a French private AI Lab training the next generation of Language Models for document processing.
PleIAs is committed to open science and has coordinated the release of some of the largest open corpus for pre-training.
For more information, visit our website : https://pleias.fr/
Contact us : [email protected]
Collections
7
spaces
5
models
29

PleIAs/pleias_wikidata_2
Updated
•
84

PleIAs/350m_fiction
Updated
•
26

PleIAs/350m_wikidata_low_lr_550k
Updated
•
10

PleIAs/pleias_wikidata
Updated
•
176

PleIAs/document_1
Updated
•
30

PleIAs/Pleias-Pico
Updated
•
199
•
31

PleIAs/Pleias-350m-Preview
Updated
•
687
•
18

PleIAs/KaribuAI
Text Classification
•
Updated
•
27
•
3

PleIAs/pleias_350m_rag
Updated
•
294

PleIAs/pleias_3b_literature
Updated
•
34
datasets
50
PleIAs/post-ocr
Preview
•
Updated
•
3.4k
•
6
PleIAs/Math-PDF
Viewer
•
Updated
•
7.57k
•
35
PleIAs/Medical-Commons
Viewer
•
Updated
•
3.56M
•
1.08k
•
1
PleIAs/French-PD-diverse
Viewer
•
Updated
•
307k
•
368
•
2
PleIAs/common_corpus
Viewer
•
Updated
•
470M
•
38.1k
•
245
PleIAs/verse-wikisource
Viewer
•
Updated
•
208k
•
81
•
1
PleIAs/KaribuAI
Viewer
•
Updated
•
447
•
53
•
1
PleIAs/RAG-Evals
Preview
•
Updated
•
192
PleIAs/FrenchCompariaCategorised
Viewer
•
Updated
•
38.8k
•
74
PleIAs/RAG-Resources
Updated
•
127
•
2