Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
aws-neuron
/
optimum-neuron-cache
like
13
Follow
AWS Inferentia and Trainium
74
License:
apache-2.0
Model card
Files
Files and versions
Community
279
b48598b
optimum-neuron-cache
/
neuronxcc-2.12.68.0+4480452af
/
0_REGISTRY
/
0.0.21.dev0
/
inference
/
llama
/
princeton-nlp
/
Sheared-LLaMA-1.3B
8 contributors
History:
1 commit
dacorvo
HF staff
Synchronizing local compiler cache.
c9f4999
verified
10 months ago
624ef8314775a5c7b63b.json
Safe
881 Bytes
Synchronizing local compiler cache.
10 months ago
f1c71b95ef4e98e06b6a.json
Safe
881 Bytes
Synchronizing local compiler cache.
10 months ago