Because Ampere does not support fp8. So, any plan to quantize DeepSeek-Coder-V2-Instruct to W8A8(INT8)?
· Sign up or log in to comment