NanoBEIR 🍺 Collection A collection of smaller versions of BEIR datasets with 50 queries and up to 10K documents each. • 13 items • Updated Sep 11, 2024 • 26
reproducing-cross-encoders Collection A set of cross-encoders trained from various backbones and losses for equal comparison • 55 items • Updated 22 days ago • 4
ToMMeR -- Efficient Entity Mention Detection from Large Language Models Paper • 2510.19410 • Published Oct 22, 2025 • 1
view article Article LateOn-Code & ColGrep: LightOn unveils state-of-the-art code retrieval models and code search tooling Feb 12 • 52
Seq vs Seq: An Open Suite of Paired Encoders and Decoders Paper • 2507.11412 • Published Jul 15, 2025 • 31
The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain Paper • 2509.26507 • Published Sep 30, 2025 • 549
DIP: Unsupervised Dense In-Context Post-training of Visual Representations Paper • 2506.18463 • Published Jun 23, 2025 • 21