2 18 18

Cong Yu

Benyucong

https://benyucong.github.io/

AI & ML interests

Machine Learning Systems, Database, Quantum Computing

Recent Activity

upvoted a paper about 2 months ago

VeriCoT: Neuro-symbolic Chain-of-Thought Validation via Logical Consistency Checks

upvoted a paper 2 months ago

A Survey of Data Agents: Emerging Paradigm or Overstated Hype?

liked a model 3 months ago

deepseek-ai/DeepSeek-OCR

View all activity

Organizations

upvoted a paper about 2 months ago

VeriCoT: Neuro-symbolic Chain-of-Thought Validation via Logical Consistency Checks

Paper • 2511.04662 • Published Nov 6, 2025 • 34

upvoted a paper 2 months ago

A Survey of Data Agents: Emerging Paradigm or Overstated Hype?

Paper • 2510.23587 • Published Oct 27, 2025 • 65

upvoted 9 papers 3 months ago

Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning

Paper • 2510.19338 • Published Oct 22, 2025 • 114

Fine-Tuning Large Language Models on Quantum Optimization Problems for Circuit Generation

Paper • 2504.11109 • Published Apr 15, 2025 • 2

DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search

Paper • 2509.25454 • Published Sep 29, 2025 • 141

Quantum Verifiable Rewards for Post-Training Qiskit Code Assistant

Paper • 2508.20907 • Published Aug 28, 2025 • 1

OpenRLHF: An Easy-to-use, Scalable and High-performance RLHF Framework

Paper • 2405.11143 • Published May 20, 2024 • 40

QUASAR: Quantum Assembly Code Generation Using Tool-Augmented LLMs via Agentic RL

Paper • 2510.00967 • Published Oct 1, 2025 • 11

upvoted a paper 4 months ago

VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use

Paper • 2509.01055 • Published Sep 1, 2025 • 76

upvoted a paper 6 months ago

Towards Agentic RAG with Deep Reasoning: A Survey of RAG-Reasoning Systems in LLMs

Paper • 2507.09477 • Published Jul 13, 2025 • 86

upvoted a collection 8 months ago

Qwen3

Collection

84 items • Updated 8 days ago • 1.54k

upvoted an article 10 months ago

Article

What is test-time compute and how to scale it?

Feb 6, 2025

•

110

upvoted an article over 1 year ago

Article

Introduction to 3D Gaussian Splatting

Sep 18, 2023

•

123

upvoted a paper almost 2 years ago

Jamba: A Hybrid Transformer-Mamba Language Model

Paper • 2403.19887 • Published Mar 28, 2024 • 111

upvoted a paper about 2 years ago

LLM in a flash: Efficient Large Language Model Inference with Limited Memory

Paper • 2312.11514 • Published Dec 12, 2023 • 260