-
facebook/vjepa2-vitl-fpc64-256
Video Classification • 0.3B • Updated • 71.9k • 171 -
microsoft/xclip-base-patch32
Video Classification • 0.2B • Updated • 190k • 106 -
MCG-NJU/videomae-base
Video Classification • 94.2M • Updated • 91.7k • 49 -
OpenGVLab/VideoMAEv2-Base
Video Classification • 86.2M • Updated • 12.2k • 9
Alban NYANTUDRE
anyantudre
AI & ML interests
ML Engineer 👨🏾💻| Deep Learning (Vision, Language, Speech)
Recent Activity
updated
a collection
25 days ago
Video-models
liked
a model
25 days ago
facebook/PE-Core-L14-336
updated
a collection
25 days ago
Video-models
Organizations
Mooré - Burkina Faso 🇧🇫
Let’s help Mooré language shine in the world of AI 💛🇧🇫
-
anyantudre/MooreSpeechCorpora
Viewer • Updated • 5.54k • 6 • 2 -
Sleeping2
Moore Language Space
📚2Demo Space for Mooré language TTS, ASR and translation
-
anyantudre/moore-speech-contes
Viewer • Updated • 5.96k • 5 • 1 -
Running1
Moore translation Leaderboard
🚗1Text2text Machine Translation for Moore language
Spaces ❤️
My favorite Spaces.
-
Running on CPU Upgrade955
Open VLM Leaderboard
🌎955VLMEvalKit Evaluation Results Collection
-
Running on ZeroFeatured420
moondream1
🌔420Generate code from text prompts
-
Runtime error20
Ovis2 1B
🦫20Small model can do big things.
-
Running on Zero4
VQA Autonomous Driving SmolVLM2
🌖4Visual Question Answering - Autonomous Driving - SmolVLM2
OCR
-
Running on ZeroFeatured260
granite-docling-258M demo
📝260Convert images to structured text and answer questions
-
Runtime error36
Multimodal RAG with Granite Vision
🚀36RAG example using Granite [vision, embedding, instruct]
-
ibm-granite/granite-docling-258M
Image-Text-to-Text • 0.3B • Updated • 190k • 1.07k -
deepseek-ai/DeepSeek-OCR
Image-Text-to-Text • 3B • Updated • 3.37M • 3.03k
🤏🏾 VLMs & LLMs
My favorite small LLMs and VLMs.
-
OpenGVLab/InternVL3-1B
Image-Text-to-Text • 0.9B • Updated • 121k • 77 -
vikhyatk/moondream2
Image-Text-to-Text • 2B • Updated • 2.89M • 1.36k -
microsoft/Florence-2-base
Image-Text-to-Text • 0.2B • Updated • 614k • 327 -
HuggingFaceTB/SmolVLM2-256M-Video-Instruct
Image-Text-to-Text • 0.3B • Updated • 167k • 90
Video-models
-
facebook/vjepa2-vitl-fpc64-256
Video Classification • 0.3B • Updated • 71.9k • 171 -
microsoft/xclip-base-patch32
Video Classification • 0.2B • Updated • 190k • 106 -
MCG-NJU/videomae-base
Video Classification • 94.2M • Updated • 91.7k • 49 -
OpenGVLab/VideoMAEv2-Base
Video Classification • 86.2M • Updated • 12.2k • 9
OCR
-
Running on ZeroFeatured260
granite-docling-258M demo
📝260Convert images to structured text and answer questions
-
Runtime error36
Multimodal RAG with Granite Vision
🚀36RAG example using Granite [vision, embedding, instruct]
-
ibm-granite/granite-docling-258M
Image-Text-to-Text • 0.3B • Updated • 190k • 1.07k -
deepseek-ai/DeepSeek-OCR
Image-Text-to-Text • 3B • Updated • 3.37M • 3.03k
Mooré - Burkina Faso 🇧🇫
Let’s help Mooré language shine in the world of AI 💛🇧🇫
-
anyantudre/MooreSpeechCorpora
Viewer • Updated • 5.54k • 6 • 2 -
Sleeping2
Moore Language Space
📚2Demo Space for Mooré language TTS, ASR and translation
-
anyantudre/moore-speech-contes
Viewer • Updated • 5.96k • 5 • 1 -
Running1
Moore translation Leaderboard
🚗1Text2text Machine Translation for Moore language
🤏🏾 VLMs & LLMs
My favorite small LLMs and VLMs.
-
OpenGVLab/InternVL3-1B
Image-Text-to-Text • 0.9B • Updated • 121k • 77 -
vikhyatk/moondream2
Image-Text-to-Text • 2B • Updated • 2.89M • 1.36k -
microsoft/Florence-2-base
Image-Text-to-Text • 0.2B • Updated • 614k • 327 -
HuggingFaceTB/SmolVLM2-256M-Video-Instruct
Image-Text-to-Text • 0.3B • Updated • 167k • 90
Spaces ❤️
My favorite Spaces.
-
Running on CPU Upgrade955
Open VLM Leaderboard
🌎955VLMEvalKit Evaluation Results Collection
-
Running on ZeroFeatured420
moondream1
🌔420Generate code from text prompts
-
Runtime error20
Ovis2 1B
🦫20Small model can do big things.
-
Running on Zero4
VQA Autonomous Driving SmolVLM2
🌖4Visual Question Answering - Autonomous Driving - SmolVLM2