DuPO: Enabling Reliable LLM Self-Verification via Dual Preference Optimization Paper • 2508.14460 • Published 5 days ago • 74
view article Article From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels By drbh and 1 other • 7 days ago • 37
view article Article MCP for Research: How to Connect AI to Research Tools By dylanebert • 7 days ago • 32
Conformal Prediction of Classifiers with Many Classes based on Noisy Labels Paper • 2501.12749 • Published Jan 22 • 1
Part I: Tricks or Traps? A Deep Dive into RL for LLM Reasoning Paper • 2508.08221 • Published 13 days ago • 41
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models Paper • 2508.06471 • Published 16 days ago • 157
view article Article Build an AI Shopping Assistant with Gradio MCP Servers By freddyaboulton • 25 days ago • 50
gpt-oss Collection OpenAI's gpt-oss-20b and gpt-oss-120b is here! The powerful open models are available in GGUF, original & 4-bit formats. • 12 items • Updated 3 days ago • 26
gpt-oss Collection Open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. • 2 items • Updated 18 days ago • 316
view article Article Welcome GPT OSS, the new open-source model family from OpenAI! By reach-vb and 11 others • 20 days ago • 473
DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation Paper • 2506.20639 • Published Jun 25 • 29
view article Article Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face By abidlabs and 4 others • 27 days ago • 159
Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving Paper • 2507.23726 • Published 24 days ago • 108
view article Article LLMs recognise bias but also reproduce harmful stereotypes: an analysis of bias in leading LLMs By davidberenstein1957 and 3 others • Jul 2 • 16