332
OCR
🍍
olmocr / nanonets ocr / qwen2vl ocr / aya vision / rolmocr
Image-Text-to-Text ~ Demo's
olmocr / nanonets ocr / qwen2vl ocr / aya vision / rolmocr
camel doc ocr / core ocr / docscope ocr / monkey ocr
behemoth-3b / skycaptioner /spacethinker / spaceom / coreocr
nanonets ocr / monkey ocr / typhoon ocr / smoldocling
Florence-2-large / Florence-2-base
cosmos reason1 / docscopeocr / visionocr / captioner relaxed
qwen2.5-vl-7b / qwen2.5-vl-3b / abliterated-caption-it
thinking / ocr / reasoning
ocr / thinking - vlm
Multimodal Models [LFM2-VL]
experiment with the tiny vlms here