4 12 1

SihengLi

Siheng99

SihengLi99

AI & ML interests

Artificial Intelligence

Recent Activity

upvoted a paper 4 days ago

Improving Multi-step RAG with Hypergraph-based Memory for Long-Context Complex Relational Modeling

upvoted a paper 4 days ago

Mindscape-Aware Retrieval Augmented Generation for Improved Long Context Understanding

upvoted a paper 3 months ago

Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward

View all activity

Organizations

upvoted 2 papers 4 days ago

Improving Multi-step RAG with Hypergraph-based Memory for Long-Context Complex Relational Modeling

Paper • 2512.23959 • Published 8 days ago • 88

Mindscape-Aware Retrieval Augmented Generation for Improved Long Context Understanding

Paper • 2512.17220 • Published 19 days ago • 108

upvoted a paper 3 months ago

Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward

Paper • 2510.03222 • Published Oct 3, 2025 • 75

authored a paper 3 months ago

Reinforcement Learning on Pre-Training Data

Paper • 2509.19249 • Published Sep 23, 2025 • 67

upvoted a paper 3 months ago

Reinforcement Learning on Pre-Training Data

Paper • 2509.19249 • Published Sep 23, 2025 • 67

upvoted a paper 6 months ago

MiroMind-M1: An Open-Source Advancement in Mathematical Reasoning via Context-Aware Multi-Stage Policy Optimization

Paper • 2507.14683 • Published Jul 19, 2025 • 134

updated a collection 7 months ago

🌸RePO

Collection

RePO: Replay-Enhanced Policy Optimization • 6 items • Updated Jun 6, 2025

updated a model 7 months ago

Siheng99/Qwen3-1.7B-DeepMath-1024samples-RePO

Text Generation • 2B • Updated Jun 6, 2025 • 4

published a model 7 months ago

Siheng99/Qwen3-1.7B-DeepMath-1024samples-RePO

Text Generation • 2B • Updated Jun 6, 2025 • 4

updated a model 7 months ago

Siheng99/Qwen3-1.7B-DeepMath-1024samples-GRPO

Text Generation • 2B • Updated Jun 6, 2025 • 5

published a model 7 months ago

Siheng99/Qwen3-1.7B-DeepMath-1024samples-GRPO

Text Generation • 2B • Updated Jun 6, 2025 • 5

updated a model 7 months ago

Siheng99/Qwen2.5-Math-7B-DeepMath-1024samples-RePO

Text Generation • 8B • Updated Jun 6, 2025 • 4

published a model 7 months ago

Siheng99/Qwen2.5-Math-7B-DeepMath-1024samples-RePO

Text Generation • 8B • Updated Jun 6, 2025 • 4

updated a model 7 months ago

Siheng99/Qwen2.5-Math-7B-DeepMath-1024samples-GRPO

Text Generation • 8B • Updated Jun 6, 2025 • 6

published a model 7 months ago

Siheng99/Qwen2.5-Math-7B-DeepMath-1024samples-GRPO

Text Generation • 8B • Updated Jun 6, 2025 • 6

SihengLi

AI & ML interests

Recent Activity

Organizations

Siheng99's activity

🎉 Free Image Generator Now Available!