MatheMagic: Generating Dynamic Mathematics Benchmarks Robust to Memorization Paper • 2510.05962 • Published Oct 7
DocHPLT: A Massively Multilingual Document-Level Translation Dataset Paper • 2508.13079 • Published Aug 18 • 1
EMMA-500: Enhancing Massively Multilingual Adaptation of Large Language Models Paper • 2409.17892 • Published Sep 26, 2024 • 2
An Expanded Massive Multilingual Dataset for High-Performance Language Technologies Paper • 2503.10267 • Published Mar 13 • 2