On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification Paper • 2508.05629 • Published 17 days ago • 157
view article Article ChatML vs Harmony: Understanding the new Format from OpenAI 🔍 By kuotient • 15 days ago • 27
naver-hyperclovax/HyperCLOVAX-SEED-Think-14B Text Generation • 15B • Updated 25 days ago • 42.8k • 86
meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8 Image-Text-to-Text • 402B • Updated May 22 • 108k • • 127
meta-llama/Llama-4-Scout-17B-16E-Instruct Image-Text-to-Text • 109B • Updated May 22 • 775k • • 1.06k
meta-llama/Llama-4-Maverick-17B-128E-Instruct Image-Text-to-Text • 402B • Updated May 22 • 49.7k • • 395