Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination Paper • 2507.10532 • Published Jul 14, 2025 • 89
Deliberation on Priors: Trustworthy Reasoning of Large Language Models on Knowledge Graphs Paper • 2505.15210 • Published May 21, 2025 • 18
view article Article OpenEvolve: An Open Source Implementation of Google DeepMind's AlphaEvolve May 20, 2025 • 55
Qwen3 Collection Qwen's new Qwen3 models. In Unsloth Dynamic 2.0, GGUF, 4-bit and 16-bit Safetensor formats. Includes 128K Context Length variants. • 79 items • Updated 9 days ago • 250
SIFT: Grounding LLM Reasoning in Contexts via Stickers Paper • 2502.14922 • Published Feb 19, 2025 • 32
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 11 items • Updated 3 days ago • 549