Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
mohitsha
/
Llama-2-7b-chat-hf-AMMO-TRT
like
0
Model card
Files
Files and versions
Community
main
Llama-2-7b-chat-hf-AMMO-TRT
/
README.md
mohitsha
HF staff
Update README.md
b11cd75
verified
5 months ago
preview
code
|
raw
Copy download link
history
blame
contribute
delete
Safe
152 Bytes
LLama2 Model with FP8 KV Cache checkpoint for TRTLM
Generated using
https://github.com/vllm-project/vllm/blob/main/examples/fp8/quantizer/quantize.py