Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
aws-neuron
/
optimum-neuron-cache
like
13
Follow
AWS Inferentia and Trainium
73
License:
apache-2.0
Model card
Files
Files and versions
Community
276
cc33b0b
optimum-neuron-cache
/
neuronxcc-2.12.68.0+4480452af
/
0_REGISTRY
/
0.0.20
/
inference
/
llama
/
meta-llama
/
Llama-2-70b-chat-hf
8 contributors
History:
2 commits
dacorvo
HF staff
Synchronizing local compiler cache.
608a595
verified
10 months ago
30c73ec5edddf208c905.json
Safe
861 Bytes
Synchronizing local compiler cache.
10 months ago
bb799f26ba52a99d8a74.json
Safe
861 Bytes
Synchronizing local compiler cache.
10 months ago