use vllm==0.6.3 load this model,it generate the fowllowing error
#1
by
wc-llm
- opened
when I use vllm==0.6.3 load this model,it generate the fowllowing error
File "/usr/local/lib/python3.10/dist-packages/vllm/model_executor/parameter.py", line 133, in load_qkv_weight
assert param_data.shape == loaded_weight.shape
AssertionError
wc-llm
changed discussion status to
closed