AMD RyzenAI Models mohitsha/timm-resnet18-onnx-quantized-ryzen Updated Mar 21 mohitsha/transformers-resnet18-onnx-quantized-ryzen Image Classification • Updated Mar 21 • 23 mohitsha/Llama-2-7b-hf-quantized-brevitas Updated Mar 27 mohitsha/opt-125m-quantized-brevitas Text Generation • Updated Mar 27 • 10
FP8 KV Cache Models with FP8 KV Cache Scales mohitsha/Llama-2-70b-chat-hf-FP8-KV Text Generation • Updated Jun 25 • 6 mohitsha/Llama-2-7b-chat-hf-FP8-KV Text Generation • Updated Jun 25 • 9 mohitsha/Llama-2-7b-chat-hf-FP8-KV-AMMO Text Generation • Updated Jun 25 • 17 mohitsha/Llama-2-70b-chat-hf-FP8-KV-AMMO Text Generation • Updated Jun 25 • 14