xirena's picture

xirena

xirena
·

AI & ML interests

DPO, PPO, Pre-training, Fine-tuning, and RLHF Training.

Organizations

Newstar Research ASIA's profile picture