[Fine-tuning] 🚀SFT/DPO/GRPO support!

#20
by study-hjt - opened

currently only the training of the thinker part is supported... (text/audio/image/video -> text)

here ~ 😊
https://github.com/modelscope/ms-swift/pull/3613

study-hjt changed discussion title from 🚀SFT/DPO/GRPO support! to [Fine-tuning] 🚀SFT/DPO/GRPO support!

Sign up or log in to comment