view article Article OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models By nvidia and 3 others • Jul 18 • 47
Hf-native ColVision Models Collection Models that can be used with the native transformers 🤗 implementation instead of colpali-engine. • 3 items • Updated Apr 16 • 7
view article Article Vision Language Models (Better, Faster, Stronger) By merve and 4 others • May 12 • 511
view article Article Training and Finetuning Reranker Models with Sentence Transformers v4 By tomaarsen • Mar 26 • 158
Running 3.11k 3.11k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published Feb 4 • 242
view article Article Self-Hosting LLaMA 3.1 70B (or any ~70B LLM) Affordably By abhinand • Aug 20, 2024 • 21
ColPali: Efficient Document Retrieval with Vision Language Models Paper • 2407.01449 • Published Jun 27, 2024 • 51
view post Post 5686 I have put together a notebook on Multimodal RAG, where we do not process the documents with hefty pipelines but natively use:- vidore/colpali for retrieval 📖 it doesn't need indexing with image-text pairs but just images!- Qwen/Qwen2-VL-2B-Instruct for generation 💬 directly feed images as is to a vision language model with no processing to text! I used ColPali implementation of the new 🐭 Byaldi library by @bclavie 🤗https://github.com/answerdotai/byaldiLink to notebook: https://github.com/merveenoyan/smol-vision/blob/main/ColPali_%2B_Qwen2_VL.ipynb 🔥 23 23 👍 10 10 ❤️ 4 4 + Reply
view article Article ColPali: Efficient Document Retrieval with Vision Language Models 👀 By manu • Jul 5, 2024 • 288