Jonathan Mitchell
jmitch5
AI & ML interests
Generative Modeling
Recent Activity
commented on
an
article
20 days ago
Efficient LLM Pretraining: Packed Sequences and Masked Attention