运行python脚本有这两个信息, 要如何处理InternLMGPTQForCausalLM hasn't fused attention module yet, will skip inject fused attention.InternLMGPTQForCausalLM hasn't fused mlp module yet, will skip inject fused mlp.
· Sign up or log in to comment