Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
SoraWatermarkRemover
Log In
Sign Up
chuxin-llm
/
Scaling-Laws-for-Local-SGD-in-LLM-Intermediate-Checkpoints
like
0
Follow
chuxin
20
arXiv:
2409.13198
License:
mit
Model card
Files
Files and versions
xet
Community
1
main
Scaling-Laws-for-Local-SGD-in-LLM-Intermediate-Checkpoints
/
base
12.5 GB
1 contributor
History:
1 commit
colourful-tree
Upload 40 files
e1b5902
verified
about 1 year ago
base_0.005b
Upload 40 files
about 1 year ago
base_0.012b
Upload 40 files
about 1 year ago
base_0.025b
Upload 40 files
about 1 year ago
base_0.05b
Upload 40 files
about 1 year ago
base_0.1b
Upload 40 files
about 1 year ago
base_0.2b
Upload 40 files
about 1 year ago
base_0.4b
Upload 40 files
about 1 year ago
base_0.8b
Upload 40 files
about 1 year ago