DiffThinker: Towards Generative Multimodal Reasoning with Diffusion Models Paper • 2512.24165 • Published 5 days ago • 41
Differences That Matter: Auditing Models for Capability Gap Discovery and Rectification Paper • 2512.16921 • Published 17 days ago • 7
Next-Embedding Prediction Makes Strong Vision Learners Paper • 2512.16922 • Published 17 days ago • 82
Insight Miner: A Time Series Analysis Dataset for Cross-Domain Alignment with Natural Language Paper • 2512.11251 • Published 24 days ago • 6
DiffusionVL: Translating Any Autoregressive Models into Diffusion Vision Language Models Paper • 2512.15713 • Published 18 days ago • 16
OmniPSD: Layered PSD Generation with Diffusion Transformer Paper • 2512.09247 • Published 26 days ago • 46
Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models Paper • 2511.08577 • Published Nov 11, 2025 • 105
view article Article Optimizing Mixture-of-Experts Training: A Cost-Effective, Two-Sided Approach Sep 30, 2025 • 3
Follow the Flow: Fine-grained Flowchart Attribution with Neurosymbolic Agents Paper • 2506.01344 • Published Jun 2, 2025 • 6
THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning Paper • 2509.13761 • Published Sep 17, 2025 • 16
Visual-TableQA: Open-Domain Benchmark for Reasoning over Table Images Paper • 2509.07966 • Published Sep 9, 2025 • 4
Harnessing Uncertainty: Entropy-Modulated Policy Gradients for Long-Horizon LLM Agents Paper • 2509.09265 • Published Sep 11, 2025 • 47
A Survey of Reinforcement Learning for Large Reasoning Models Paper • 2509.08827 • Published Sep 10, 2025 • 190
Beyond Correctness: Harmonizing Process and Outcome Rewards through RL Training Paper • 2509.03403 • Published Sep 3, 2025 • 22
Implicit Actor Critic Coupling via a Supervised Learning Framework for RLVR Paper • 2509.02522 • Published Sep 2, 2025 • 25