Le Yu's picture

Le Yu

vanillaOVO

·

https://yule-buaa.github.io/

yule-BUAA

AI & ML interests

None yet

Recent Activity

upvoted a paper 27 days ago

Agentic Reinforced Policy Optimization

upvoted a paper about 1 month ago

Group Sequence Policy Optimization

authored a paper about 1 month ago

RefCritic: Training Long Chain-of-Thought Critic Models with Refinement Feedback

View all activity

Organizations

None yet

Collections 1

Papers 10

arxiv:2507.15024

arxiv:2506.01939

arxiv:2505.10527

arxiv:2505.09388

models 16

vanillaOVO/WizardLM-13B-V1.2

Text Generation • Updated Jun 20, 2024 • 4

vanillaOVO/WizardCoder-Python-13B-V1.0

Text Generation • Updated Jun 20, 2024 • 6

vanillaOVO/WizardMath-13B-V1.0

Text Generation • Updated Jun 20, 2024 • 128 • 1

vanillaOVO/WizardLM-7B-V1.0

Text Generation • Updated Jun 20, 2024 • 77 • 1

vanillaOVO/WizardMath-7B-V1.0

Text Generation • Updated Jun 20, 2024 • 4

vanillaOVO/WizardCoder-Python-7B-V1.0

Text Generation • Updated Jun 20, 2024 • 1.03k • 1

vanillaOVO/roberta_base_glue_ckpts

Updated Apr 9, 2024 • 1

vanillaOVO/supermario_v4

Text Generation • 7B • Updated Apr 4, 2024 • 14 • 1

vanillaOVO/supermario_v3

Text Generation • 7B • Updated Apr 4, 2024 • 14

vanillaOVO/supermario_v2

Text Generation • 7B • Updated Apr 4, 2024 • 9 • 1

datasets 0

None public yet