Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
steerapi
/
TheBloke-Llama-2-7b-chat-fp16-w8-g128
like
0
Text Generation
Transformers
ONNX
llama
Inference Endpoints
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
TheBloke-Llama-2-7b-chat-fp16-w8-g128
/
onnx
1 contributor
History:
5 commits
steerapi
Upload folder using huggingface_hub
d786ec5
about 1 year ago
decoder_model_merged_quantized.onnx
Safe
331 MB
LFS
Upload folder using huggingface_hub
about 1 year ago
decoder_model_merged_quantized.onnx_data
Safe
6.74 GB
LFS
Upload folder using huggingface_hub
about 1 year ago
quantize_config.json
Safe
992 Bytes
Upload folder using huggingface_hub
about 1 year ago