The self._attn method of QwenAttention has a bug of attention_mask

#10

by chencyudel - opened Aug 14, 2023

Discussion

chencyudel

Aug 14, 2023

The self._attn method of QwenAttention need a fix of the bug lacking attention_mask adding to attention_weights

jklj077

Qwen org Sep 26, 2023

Thanks for the feedback. We have updated the code (as part of the support for batch inference), which I think should fix this problem as well. Please pull the latest code and see if the problem is fixed for you. Let me know if the problem still exists.

jklj077 changed discussion status to closed Sep 26, 2023

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment