Quantize DeepSeek-Coder-V2-Instruct to W8A8(INT8)?

#2
by traphix - opened

Because Ampere does not support fp8. So, any plan to quantize DeepSeek-Coder-V2-Instruct to W8A8(INT8)?

Sign up or log in to comment