Pretrained models from the paper "Predicting the Order of Upcoming Tokens Improves Language Modeling"
Zayd Muhammad Kawakibi Zuhri PRO
zaydzuhri
AI & ML interests
I really like watching loss go down
Recent Activity
updated
a model about 5 hours ago
zaydzuhri/tasklets_tokenizer_300 published
a model about 5 hours ago
zaydzuhri/tasklets_tokenizer_300 updated
a dataset about 6 hours ago
zaydzuhri/single-recall-test-128 Organizations
None yet