Scaling DoRA: High-Rank Adaptation via Factored Norms and Fused Kernels Paper • 2603.22276 • Published 22 days ago • 14
ChartNet: A Million-Scale, High-Quality Multimodal Dataset for Robust Chart Understanding Paper • 2603.27064 • Published 18 days ago • 26
SPASM: Stable Persona-driven Agent Simulation for Multi-turn Dialogue Generation Paper • 2604.09212 • Published 5 days ago • 1
Polyglot Teachers: Evaluating Language Models for Multilingual Synthetic Data Generation Paper • 2604.11290 • Published 2 days ago • 1
SciPredict: Can LLMs Predict the Outcomes of Scientific Experiments in Natural Sciences? Paper • 2604.10718 • Published 3 days ago • 2
Learning Long-term Motion Embeddings for Efficient Kinematics Generation Paper • 2604.11737 • Published 2 days ago • 2
TorchUMM: A Unified Multimodal Model Codebase for Evaluation, Analysis, and Post-training Paper • 2604.10784 • Published 3 days ago • 5
Low-rank Optimization Trajectories Modeling for LLM RLVR Acceleration Paper • 2604.11446 • Published 2 days ago • 3
SWE-AGILE: A Software Agent Framework for Efficiently Managing Dynamic Reasoning Context Paper • 2604.11716 • Published 2 days ago • 3
Playing Along: Learning a Double-Agent Defender for Belief Steering via Theory of Mind Paper • 2604.11666 • Published 2 days ago • 3
Eliciting Medical Reasoning with Knowledge-enhanced Data Synthesis: A Semi-Supervised Reinforcement Learning Approach Paper • 2604.11547 • Published 2 days ago • 4
Zero-shot World Models Are Developmentally Efficient Learners Paper • 2604.10333 • Published 4 days ago • 6
Not All Denoising Steps Are Equal: Model Scheduling for Faster Masked Diffusion Language Models Paper • 2604.02340 • Published 4 days ago • 6
General365: Benchmarking General Reasoning in Large Language Models Across Diverse and Challenging Tasks Paper • 2604.11778 • Published 2 days ago • 6
SPEED-Bench: A Unified and Diverse Benchmark for Speculative Decoding Paper • 2604.09557 • Published Feb 10 • 8