arxiv:2404.08634
Sunny Sanyal
Sunny111
AI & ML interests
Efficient Training Recipes of Large Models (mostly LLMs)
Recent Activity
upvoted
a
paper
5 days ago
Pre-training Small Base LMs with Fewer Tokens
liked
a model
8 days ago
GuminiResearch/Gumini-1.5B-Base
liked
a model
10 days ago
GuminiResearch/Gumini-1B-Base