steerapi
/

Llama-2-7b-chat-hf-onnx-awq-w8-g0

Text Generation

Inference Endpoints

Model card Files Files and versions Community

Llama-2-7b-chat-hf-onnx-awq-w8-g0 / onnx

1 contributor

History: 1 commit

steerapi's picture

Upload folder using huggingface_hub

83cc9a7 about 1 year ago

decoder_model_merged_quantized.onnx

12.1 MB
LFS

Upload folder using huggingface_hub about 1 year ago
decoder_model_merged_quantized.onnx_data

6.74 GB
LFS

Upload folder using huggingface_hub about 1 year ago
quantize_config.json

992 Bytes

Upload folder using huggingface_hub about 1 year ago