Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
aws-neuron
/
optimum-neuron-cache
like
13
Follow
AWS Inferentia and Trainium
72
License:
apache-2.0
Model card
Files
Files and versions
Community
274
883b35a
optimum-neuron-cache
8 contributors
History:
4004 commits
5cp
Synchronizing local compiler cache.
883b35a
verified
10 months ago
2.10.0.35+3817a0c8c
Upload 2.10.0.35+3817a0c8c/vit/4c6b290b947af928bf1124b41149ee066602a97b0576bb2b92364e02039535c480be73b81f3d90d47842a3acf89aaeb9903818ae32d62e3070f238c7829b58fa/87400e737bae35747a91790622536baf7e48bfc8ac44da05c9dfe03b373c552b8740a037b64e2417cd307eaa8be5586dcae8f6be0b61c0f1cd0d22681379cc3f/MODULE_15288447620435424526+d41d8cd9/compile_flags.txt with huggingface_hub
about 1 year ago
2.11.0.18+fada6114a
Upload 2.11.0.18+fada6114a/vit/ea88be553a835f83559b7c494d4b00971f50f7a13e3aec51ab30386fcf74223c8b7b0ca80faf3b9b844387e7525f1ac570327ba82be2cb5e96ed4068bf9ba637/62d3162b6f000625bc02c2edaa0adab39d15d2309de8505b4b1cdeeaa655b42d953772e355da099f041ee55523420a52912244c3633372466dfbab0a38ba774a/MODULE_9577983374043297799+d41d8cd9/compile_flags.txt with huggingface_hub
about 1 year ago
2.11.0.34+c5231f848
Upload 2.11.0.34+c5231f848/llama/af52d4925feaa72ca214d06a0057935d2e396a299f72a28aa3d1b6a50759bf01fdf2635e9d0adc0f86d2dc6ca5e390a59b0d0e07dadeb26d05700e3377192a0b/a323cfa789ff7ee78a7930ecabb43afd182dd97e27ec302d2747e36c34fc5d6f4e5a455a69b0b133ce38fdfc97e3456e3b0f748dcbf602981a8ad96304e4cad9/MODULE_14970393104740836807+d41d8cd9/model.neff with huggingface_hub
11 months ago
2.11.0.35+4f5279863
Upload 2.11.0.35+4f5279863/llama/e0705a7aca48fd51955f11d7fb5c481bcd9eeba982837347ff2ce1d8bb90599011d6367745b49590dd88cfe8c7c7ad0904ecb35b821408fa33b3d651babd1a7f/b1376a32545109074892b6229d893fa62766a573a9e32640a5df7ad25d2a5c5d0e961836541fa9a75fe61cc9f49f4e6bc41f0edfd243a6408ffb66c49d64ecd1/MODULE_11114785808703757383+d41d8cd9/compile_flags.txt with huggingface_hub
about 1 year ago
2.11.0.6+71b2938aa
Upload 2.11.0.6+71b2938aa/vit/ea88be553a835f83559b7c494d4b00971f50f7a13e3aec51ab30386fcf74223c8b7b0ca80faf3b9b844387e7525f1ac570327ba82be2cb5e96ed4068bf9ba637/2db27bc94dc6bf8d4379677bc453bd53f8cdfc6e19244f94149ad17da04c536e07e62743ea9220eb6832265bb9265015a67d153a0f1c733c9a4d17467ed31bf7/MODULE_9577983374043297799+d41d8cd9/compile_flags.txt with huggingface_hub
about 1 year ago
2.4.0.21+b7621be18
Upload 2.4.0.21+b7621be18/bert/347919dd5f03227cdbec18c00151fbdafcabdce74ee781d248860419bdd8c0ff371426c088963af0429065e8938fec7d5fc746ae2ae765370e39cfd6ce34021c/af45ab8c436d6e6ddc365e34928c9c56858d4b9f7a3cf171ea79526478e7e644cd0b8ef1cffd4806f34fa5d90a8cba71d745b1f888a91d72a58ba50d9d0cf4b7/MODULE_15894782863176321812/MODULE_0_SyncTensorsGraph.92_15894782863176321812_ip-172-31-42-12-ebca967b-44796-5faf4497a3fcf/96630263-e516-4036-bb3b-e7d73d045f6c/MODULE_0_SyncTensorsGraph.92_15894782863176321812_ip-172-31-42-12-ebca967b-44796-5faf4497a3fcf.neff with huggingface_hub
over 1 year ago
2.5.0.28+1be23f232
Upload 2.5.0.28+1be23f232/gpt2/eebca3d92cc30b1c5da3e23694399e7ce638d1150899b985bf3ab75d33b746eb90d7cb1267f9a8e007040dfa830cf4af01392295ddd59e357073153970663dfd/b61d001df7ff7041bf4dd4f8210e6810c1f4762c61ecf9576dbcc0ec27efad9271e92119536ba8004b08c25e4f61114936a4d37e22ece720ad7d533a9cc924e0/MODULE_17318955223217006245/MODULE_6_SyncTensorsGraph.4457_17318955223217006245_ip-172-31-33-155-e9203f9d-10513-6009a1d22629b/c957a403-6b2f-415b-8dc9-f4ade41c51c2/MODULE_6_SyncTensorsGraph.4457_17318955223217006245_ip-172-31-33-155-e9203f9d-10513-6009a1d22629b.neff with huggingface_hub
over 1 year ago
2.6.0.19+3d819e565
Upload 2.6.0.19+3d819e565/marian/b90d8ab9fb434ad3dfba7a568c835e33b545da0b3bafcecce09741a75dcb7f802a8480ec222a5dbbd79e1655e66910c5107037a1aacbf66bdda041a042045d7c/4e3254e4f824724d529aa8d7aafd0624f762f7efabaaf06b249e4d9181109285f5beabfaa02c42d5c924659e069467af7ea35227ea31995b5370d5ab6ac7ef2b/MODULE_5981796870352054802/MODULE_12_SyncTensorsGraph.2325_5981796870352054802_ip-172-31-42-12-fd9a8cff-4156230-5fcfa96137dd7/3d52d296-3c40-476b-a770-428ad0c5f307/compile_flags.txt with huggingface_hub
over 1 year ago
2.7.0.40+f7c6cf2a3
Upload 2.7.0.40+f7c6cf2a3/t5/b5d927c9f7066674483250e96aa3168c7f898daa063e56e4f2e2c3ae6a2b37dfae19c36bf61edf8d8c5aac6d9615bacaef3813b96ef4720b3bda73335bd7c436/6e82f4183d2a55622c22fc3605db4e4b7d70dc6f7049562a7cfd5474358770d9aceb329838418e9789c9ac12cf7a31efa04b65b9070fbedcda008ef54e7a797a/MODULE_3455400400948827543+d41d8cd9/model.hlo.pb with huggingface_hub
over 1 year ago
2.8.0.25+a3ad0f342
Upload 2.8.0.25+a3ad0f342/bert/f06b9913a86b334229e895cea50e39eb892b612c9b6e64b5b1cc7454239f8994fbb2dcfeaf2c89f8b7fd4e1529efb2f930cca05920ad1796cd9d791e7d6daa14/3e33a65cb9a0c9e408b0757261edb020c67d2fd86aa0423a4d42b077b6bd63118ccc5c107e14498013a98de615c37be81056c11ce52f5b7499fd50f81166cf5c/MODULE_4633304630716196125+d41d8cd9/model.hlo.pb with huggingface_hub
over 1 year ago
neuronxcc-2.12.54.0+f631c2365
Synchronizing local compiler cache.
11 months ago
neuronxcc-2.12.68.0+4480452af
Synchronizing local compiler cache.
10 months ago
.gitattributes
262 kB
Synchronizing local compiler cache.
10 months ago
README.md
28 Bytes
initial commit
over 1 year ago
registry.json
27.1 kB
Add TinyLlama/TinyLlama-1.1B-Chat-v0.6 in registry for NeuronHash a323cfa789ff7ee78a7930ecabb43afd182dd97e27ec302d2747e36c34fc5d6f4e5a455a69b0b133ce38fdfc97e3456e3b0f748dcbf602981a8ad96304e4cad9
11 months ago