4 3

Jack

SixPlusSeven13

AI & ML interests

None yet

Recent Activity

upvoted a paper 16 days ago

ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding

upvoted a paper 2 months ago

Critique-RL: Training Language Models for Critiquing through Two-Stage Reinforcement Learning

upvoted a paper 2 months ago

BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping

View all activity

Organizations

None yet

upvoted a paper 16 days ago

ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding

Paper • 2512.13586 • Published 16 days ago • 87

upvoted 2 papers 2 months ago

Critique-RL: Training Language Models for Critiquing through Two-Stage Reinforcement Learning

Paper • 2510.24320 • Published Oct 28, 2025 • 19

BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping

Paper • 2510.18927 • Published Oct 21, 2025 • 83

New activity in AgentGym/AgentGym-RL-Data-ID 4 months ago

Upload webarena_train.json

#3 opened 4 months ago by

SixPlusSeven13

New activity in AgentGym/AgentTraj-L 4 months ago

Update sciworld_train.json

#3 opened 4 months ago by

SixPlusSeven13

New activity in AgentGym/AgentEval 4 months ago

Upload 2 files

#2 opened 4 months ago by

SixPlusSeven13

New activity in AgentGym/AgentTraj-L 4 months ago

Upload searchqa_train.json

#2 opened 4 months ago by

SixPlusSeven13

Upload searchqa_train.json

#2 opened 4 months ago by

SixPlusSeven13

New activity in AgentGym/AgentEval 4 months ago

Upload 2 files

#2 opened 4 months ago by

SixPlusSeven13

Jack

AI & ML interests

Recent Activity

Organizations

SixPlusSeven13's activity

Upload webarena_train.json

Update sciworld_train.json

Upload 2 files

Upload searchqa_train.json

Upload searchqa_train.json

Upload 2 files

🎉 Free Image Generator Now Available!