Can I use flash attention 2 with this model?
#15
by
anuragrawal
- opened
Hi,
I am currently comparing time improvements for openai/whisper-medium.en vs distil-whisper/distil-medium.en
As suggested in the model card for distil-whisper/distil-medium.en, I am using flash attention 2 to get the best results. I don't completely understand the concept behind flash attention 2. Do I need to use it with openai/whisper-medium.en for a fair comparison? If yes,
- Is it feasible to use flash attention 2 with openai/whisper-medium.en?
- How?
Thanks!
I have NVIDIA GeForce RTX 3060 GPU.