Models specialized in extracting structured information (JSON) from text, PDFs, scans, spreadsheets, etc.
AI & ML interests
Interactive NLP development
Recent Activity
Organization Card
We are a startup building the NuExtract Platform.
We also develop open-source Information Extraction foundation models that we share here. They are often SOTA in their category, and always under MIT license; use them without restrictions π.
spaces
6
Sleeping
36
NuMarkdown 8b Thinking
π
Reasoning model specialized for OCR/Markdown generation.
Runtime error
13
NuExtract 2.0
π
Space for numind/NuExtract-2.0-4B
Runtime error
77
NuExtract 1.5
π
Playground for NuExtract-v1.5
Running
on
T4
35
NuNER_Zero
π»
Identify named entities in text
Paused
71
NuExtract
π
models
34

numind/NuMarkdown-8B-Thinking-GGUF
8B
β’
Updated
β’
907
β’
1

numind/NuExtract-2.0-8B-GGUF
Image-Text-to-Text
β’
8B
β’
Updated
β’
530
β’
1

numind/NuExtract-2.0-4B-GGUF
Image-Text-to-Text
β’
3B
β’
Updated
β’
362
β’
1

numind/NuExtract-2.0-2B-GGUF
Image-Text-to-Text
β’
2B
β’
Updated
β’
387

numind/NuMarkdown-8B-Thinking
Image-to-Text
β’
8B
β’
Updated
β’
38k
β’
215

numind/NuExtract-2.0-8B-GPTQ
Image-Text-to-Text
β’
3B
β’
Updated
β’
108
β’
4

numind/NuExtract-2.0-8B
Image-Text-to-Text
β’
8B
β’
Updated
β’
3.59k
β’
39

numind/NuExtract-2.0-4B
Image-Text-to-Text
β’
4B
β’
Updated
β’
7.34k
β’
21

numind/NuExtract-2.0-2B
Image-Text-to-Text
β’
2B
β’
Updated
β’
4.48k
β’
30

numind/NuExtract-1.5
Text Generation
β’
4B
β’
Updated
β’
159k
β’
239