3 18 1

Devin Thang

winvswon78

devininthelab

AI & ML interests

None yet

Recent Activity

new activity 22 days ago

stabilityai/stable-video-diffusion-img2vid-xt:Can SVD be use with DDPMScheduler?

upvoted a paper about 1 month ago

OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe

commented on an article about 2 months ago

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

View all activity

Organizations

New activity in stabilityai/stable-video-diffusion-img2vid-xt 22 days ago

Can SVD be use with DDPMScheduler?

#128 opened 22 days ago by

winvswon78

upvoted a paper about 1 month ago

OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe

Paper • 2511.16334 • Published Nov 20, 2025 • 92

commented on Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment about 2 months ago

Same question

updated a dataset about 2 months ago

winvswon78/objaverse_1k_human_furniture

Viewer • Updated Nov 10, 2025 • 2k • 1

published a dataset about 2 months ago

winvswon78/objaverse_1k_human_furniture

Viewer • Updated Nov 10, 2025 • 2k • 1

updated a dataset about 2 months ago

winvswon78/objaverse_furniture_human_single_mask

Preview • Updated Nov 10, 2025 • 2

published a dataset about 2 months ago

winvswon78/objaverse_furniture_human_single_mask

Preview • Updated Nov 10, 2025 • 2

updated a dataset about 2 months ago

winvswon78/objaverse_human_furniture_2k

Viewer • Updated Nov 9, 2025 • 15.8k • 4

published a dataset about 2 months ago

winvswon78/objaverse_human_furniture_2k

Viewer • Updated Nov 9, 2025 • 15.8k • 4

updated a dataset about 2 months ago

winvswon78/t2v_epipolar_dpo

Updated Nov 8, 2025

published a dataset about 2 months ago

winvswon78/t2v_epipolar_dpo

Updated Nov 8, 2025

upvoted an article 3 months ago

Article

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

Feb 11, 2025

•

updated a model 4 months ago

winvswon78/Qwen2.5-Math-1.5B-GRPO

Updated Sep 19, 2025

published 2 models 4 months ago

winvswon78/Qwen2.5-Math-1.5B-GRPO

Updated Sep 19, 2025

winvswon78/Qwen2-0.5B-GRPO-test

Updated Sep 19, 2025

upvoted an article 4 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Feb 7, 2025

•

267

New activity in lmms-lab/EMMA 4 months ago

[bot] Conversion to Parquet

#1 opened 5 months ago by

parquet-converter

upvoted a paper 5 months ago

Reconstructing 4D Spatial Intelligence: A Survey

Paper • 2507.21045 • Published Jul 28, 2025 • 35

updated a dataset 5 months ago

winvswon78/emma_stone

Viewer • Updated Aug 5, 2025 • 64 • 54

published a dataset 5 months ago

winvswon78/emma_stone

Viewer • Updated Aug 5, 2025 • 64 • 54

Devin Thang

AI & ML interests

Recent Activity

Organizations

winvswon78's activity

Can SVD be use with DDPMScheduler?

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

[bot] Conversion to Parquet

🎉 Free Image Generator Now Available!