-
Aligning Teacher with Student Preferences for Tailored Training Data Generation
Paper • 2406.19227 • Published • 26 -
Pre-training Distillation for Large Language Models: A Design Space Exploration
Paper • 2410.16215 • Published • 16 -
Baichuan Alignment Technical Report
Paper • 2410.14940 • Published • 52 -
MiniPLM: Knowledge Distillation for Pre-Training Language Models
Paper • 2410.17215 • Published • 17
By
ByRookie
AI & ML interests
None yet
Recent Activity
liked
a model
22 days ago
nvidia/Llama-3_3-Nemotron-Super-49B-v1_5-FP8
liked
a dataset
22 days ago
nvidia/Nemotron-Post-Training-Dataset-v1
liked
a model
22 days ago
MetaStoneTec/XBai-o4