Models specialized in extracting structured information (JSON) from text, PDFs, scans, spreadsheets, etc.
AI & ML interests
Interactive NLP development
Recent Activity
View all activity
Organization Card
We are a startup building the NuExtract Platform.
We also develop open-source Information Extraction foundation models that we share here. They are often SOTA in their category, and always under MIT license; use them without restrictions 🙂.
spaces
6
Running
30
NuMarkdown 8b Thinking
👁
Reasoning model specialized for OCR/Markdown generation.
Runtime error
11
NuExtract 2.0
🚀
Space for numind/NuExtract-2.0-4B
Runtime error
77
NuExtract 1.5
👀
Playground for NuExtract-v1.5
Running
on
T4
35
NuNER_Zero
💻
Identify and highlight key entities in text
Paused
71
NuExtract
👀
models
30

numind/NuMarkdown-8B-Thinking
Image-to-Text
•
8B
•
Updated
•
7.02k
•
188

numind/NuExtract-2.0-8B-GPTQ
Image-Text-to-Text
•
3B
•
Updated
•
259
•
4

numind/NuExtract-2.0-8B
Image-Text-to-Text
•
8B
•
Updated
•
3.54k
•
30

numind/NuExtract-2.0-4B
Image-Text-to-Text
•
4B
•
Updated
•
2.37k
•
18

numind/NuExtract-2.0-2B
Image-Text-to-Text
•
2B
•
Updated
•
4.45k
•
29

numind/NuExtract-1.5
Text Generation
•
4B
•
Updated
•
104k
•
236

numind/NuExtract-2.0-4B-GPTQ
Image-Text-to-Text
•
1B
•
Updated
•
143
•
2

numind/NuExtract-2-1B-experimental
Text Generation
•
0.9B
•
Updated
•
8
•
1

numind/NuExtract-2-2B-experimental
Text Generation
•
2B
•
Updated
•
32
•
8

numind/NuExtract-2-4B-experimental
Text Generation
•
4B
•
Updated
•
11
•
3