[Fine-tuning] 🚀SFT/DPO/GRPO support!

#20

by study-hjt - opened Mar 28

Mar 28

•

currently only the training of the thinker part is supported... (text/audio/image/video -> text)

study-hjt changed discussion title from 🚀SFT/DPO/GRPO support! to [Fine-tuning] 🚀SFT/DPO/GRPO support! Apr 25

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment