mohitsha's picture
mohitsha HF staff
Update README.md
b11cd75 verified

LLama2 Model with FP8 KV Cache checkpoint for TRTLM

Generated using https://github.com/vllm-project/vllm/blob/main/examples/fp8/quantizer/quantize.py