Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
VoxCPM
Log In
Sign Up
aws-neuron
/
optimum-neuron-cache
like
24
Follow
AWS Inferentia and Trainium
134
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
557
6f20ccf
optimum-neuron-cache
/
neuronxcc-2.15.128.0+56dc5a86
/
0_REGISTRY
/
0.0.25
/
inference
/
llama
/
princeton-nlp
/
Sheared-LLaMA-1.3B
1.8 kB
4 contributors
History:
2 commits
dacorvo
HF Staff
Synchronizing local compiler cache.
a74c2af
verified
12 months ago
10d86b739c51fa605a82.json
900 Bytes
Synchronizing local compiler cache.
12 months ago
b903bdb1ec594662f8b5.json
900 Bytes
Synchronizing local compiler cache.
12 months ago