5 17

Wenkai Yang

Keven16

https://keven980716.github.io/

keven980716

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

DARC: Decoupled Asymmetric Reasoning Curriculum for LLM Evolution

upvoted a paper about 1 month ago

Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

upvoted a paper about 1 month ago

NVIDIA Nemotron 3: Efficient and Open Intelligence

View all activity

Organizations

None yet

upvoted a paper 3 days ago

DARC: Decoupled Asymmetric Reasoning Curriculum for LLM Evolution

Paper • 2601.13761 • Published 5 days ago • 15

upvoted 3 papers about 1 month ago

Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Paper • 2512.20848 • Published Dec 23, 2025 • 35

NVIDIA Nemotron 3: Efficient and Open Intelligence

Paper • 2512.20856 • Published Dec 24, 2025 • 35

TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times

Paper • 2512.16093 • Published Dec 18, 2025 • 94

upvoted a paper about 2 months ago

Mixture of Horizons in Action Chunking

Paper • 2511.19433 • Published Nov 24, 2025 • 18

upvoted a paper 3 months ago

Visual Backdoor Attacks on MLLM Embodied Decision Making via Contrastive Trigger Learning

Paper • 2510.27623 • Published Oct 31, 2025 • 13

commented a paper 3 months ago

Critique-RL: Training Language Models for Critiquing through Two-Stage Reinforcement Learning

Paper • 2510.24320 • Published Oct 28, 2025 • 21 •

authored a paper 3 months ago

LaSeR: Reinforcement Learning with Last-Token Self-Rewarding

Paper • 2510.14943 • Published Oct 16, 2025 • 40

upvoted 2 papers 3 months ago

Stress Testing Generalization: How Minor Modifications Undermine Large Language Model Performance

Paper • 2502.12459 • Published Feb 18, 2025 • 2

LaSeR: Reinforcement Learning with Last-Token Self-Rewarding

Paper • 2510.14943 • Published Oct 16, 2025 • 40

updated a collection 3 months ago

LaSeR

Collection

Models from the paper "LaSeR: Reinforcement Learning with Last-Token Self-Rewarding" • 5 items • Updated Oct 17, 2025 • 1

commented a paper 3 months ago

LaSeR: Reinforcement Learning with Last-Token Self-Rewarding

Paper • 2510.14943 • Published Oct 16, 2025 • 40 •

updated a collection 3 months ago

LaSeR

Collection

Models from the paper "LaSeR: Reinforcement Learning with Last-Token Self-Rewarding" • 5 items • Updated Oct 17, 2025 • 1

published a dataset 3 months ago

Keven16/LaSeR_training_data

Viewer • Updated Oct 16, 2025 • 104k • 46 • 2

published 2 models 3 months ago

Keven16/ORZ-7B-LaSeR

8B • Updated Oct 15, 2025 • 3 • 1

Keven16/OctoThinker-3B-Short-LaSeR

4B • Updated Oct 15, 2025 • 3

Wenkai Yang

AI & ML interests

Recent Activity

Organizations

Keven16's activity

🎉 Free Image Generator Now Available!