InternLMGPTQForCausalLM hasn't fused attention module yet, will skip inject fused attention. InternLMGPTQForCausalLM hasn't fused mlp module yet, will skip inject fused mlp.

#2
by 11011Free - opened

运行python脚本有这两个信息, 要如何处理
InternLMGPTQForCausalLM hasn't fused attention module yet, will skip inject fused attention.
InternLMGPTQForCausalLM hasn't fused mlp module yet, will skip inject fused mlp.

Sign up or log in to comment