Next-Embedding Prediction Makes Strong Vision Learners Paper • 2512.16922 • Published 22 days ago • 83
view article Article Tokenization in Transformers v5: Simpler, Clearer, and More Modular +4 23 days ago • 108
On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models Paper • 2512.07783 • Published Dec 8, 2025 • 36
PaperDebugger: A Plugin-Based Multi-Agent System for In-Editor Academic Writing, Review, and Editing Paper • 2512.02589 • Published Dec 2, 2025 • 68
Wikontic: Constructing Wikidata-Aligned, Ontology-Aware Knowledge Graphs with Large Language Models Paper • 2512.00590 • Published Nov 29, 2025 • 45
CLaRa: Bridging Retrieval and Generation with Continuous Latent Reasoning Paper • 2511.18659 • Published Nov 24, 2025 • 19
O-Mem: Omni Memory System for Personalized, Long Horizon, Self-Evolving Agents Paper • 2511.13593 • Published Nov 17, 2025 • 25
OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe Paper • 2511.16334 • Published Nov 20, 2025 • 92
Nemotron-Personas Collection A collection of multilingual, region-specific synthetic persona datasets that support sovereign AI development across many countries and regions. • 3 items • Updated 17 days ago • 14
view article Article Open ASR Leaderboard: Trends and Insights with New Multilingual & Long-Form Tracks +2 Nov 21, 2025 • 24
GroupRank: A Groupwise Reranking Paradigm Driven by Reinforcement Learning Paper • 2511.11653 • Published Nov 10, 2025 • 55
Qari-OCR: A High-Accuracy Model for Arabic Optical Character Collection 𝐵𝑢𝑖𝑙𝑡 𝑜𝑛 𝑡ℎ𝑒 𝑝𝑜𝑤𝑒𝑟𝑓𝑢𝑙 𝑄𝑤𝑒𝑛2 𝑉𝐿 2𝐵 𝑎𝑛𝑑 𝑓𝑖𝑛𝑒-𝑡𝑢𝑛𝑒𝑑 𝑜𝑛 𝑎𝑛 𝐴𝑟𝑎𝑏𝑖𝑐 𝑂𝐶𝑅 𝑑𝑎𝑡𝑎𝑠𝑒𝑡, 𝑄𝑎𝑟𝑖 𝑣0.1 𝑑𝑒 • 7 items • Updated Jun 25, 2025 • 12
Nemotron Elastic: Towards Efficient Many-in-One Reasoning LLMs Paper • 2511.16664 • Published Nov 20, 2025 • 26
LightOnOCR Collection The Case for End-to-End and Efficient Domain-Specific Vision-Language Models for OCR • 7 items • Updated Nov 13, 2025 • 15
RedOne 2.0: Rethinking Domain-specific LLM Post-Training in Social Networking Services Paper • 2511.07070 • Published Nov 10, 2025 • 19
Jan-v2-VL Collection Jan-v2-VL: a family of VLM focused on reliable, many-step task execution. • 8 items • Updated 10 days ago • 38