Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
@@ -34,10 +34,15 @@ More details on model performance across various devices, can be found
|
|
34 |
- Model size (WhisperDecoder): 531 MB
|
35 |
|
36 |
|
|
|
|
|
37 |
| Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Precision | Primary Compute Unit | Target Model
|
38 |
| ---|---|---|---|---|---|---|---|
|
39 |
-
| Samsung Galaxy S23 Ultra (Android 13) | Snapdragon® 8 Gen 2 | TFLite |
|
40 |
-
| Samsung Galaxy S23 Ultra (Android 13) | Snapdragon® 8 Gen 2 | TFLite | 26.
|
|
|
|
|
|
|
41 |
|
42 |
|
43 |
## Installation
|
@@ -95,9 +100,28 @@ device. This script does the following:
|
|
95 |
python -m qai_hub_models.models.whisper_small_en.export
|
96 |
```
|
97 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
98 |
## How does this work?
|
99 |
|
100 |
-
This [export script](https://
|
101 |
leverages [Qualcomm® AI Hub](https://aihub.qualcomm.com/) to optimize, validate, and deploy this model
|
102 |
on-device. Lets go through each step below in detail:
|
103 |
|
@@ -175,6 +199,7 @@ AI Hub. [Sign up for access](https://myaccount.qualcomm.com/signup).
|
|
175 |
|
176 |
|
177 |
|
|
|
178 |
## Deploying compiled model to Android
|
179 |
|
180 |
|
@@ -196,7 +221,7 @@ Explore all available models on [Qualcomm® AI Hub](https://aihub.qualcomm.com/)
|
|
196 |
## License
|
197 |
- The license for the original implementation of Whisper-Small-En can be found
|
198 |
[here](https://github.com/openai/whisper/blob/main/LICENSE).
|
199 |
-
- The license for the compiled assets for on-device deployment can be found [here](
|
200 |
|
201 |
## References
|
202 |
* [Robust Speech Recognition via Large-Scale Weak Supervision](https://cdn.openai.com/papers/whisper.pdf)
|
|
|
34 |
- Model size (WhisperDecoder): 531 MB
|
35 |
|
36 |
|
37 |
+
|
38 |
+
|
39 |
| Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Precision | Primary Compute Unit | Target Model
|
40 |
| ---|---|---|---|---|---|---|---|
|
41 |
+
| Samsung Galaxy S23 Ultra (Android 13) | Snapdragon® 8 Gen 2 | TFLite | 610.635 ms | 8 - 417 MB | FP16 | GPU | [WhisperEncoder.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperEncoder.tflite)
|
42 |
+
| Samsung Galaxy S23 Ultra (Android 13) | Snapdragon® 8 Gen 2 | TFLite | 26.644 ms | 16 - 20 MB | FP16 | NPU | [WhisperDecoder.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperDecoder.tflite)
|
43 |
+
| Samsung Galaxy S23 Ultra (Android 13) | Snapdragon® 8 Gen 2 | QNN Model Library | 1969.063 ms | 1 - 216 MB | FP16 | NPU | [WhisperEncoder.so](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperEncoder.so)
|
44 |
+
| Samsung Galaxy S23 Ultra (Android 13) | Snapdragon® 8 Gen 2 | QNN Model Library | 24.731 ms | 118 - 191 MB | FP16 | NPU | [WhisperDecoder.so](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperDecoder.so)
|
45 |
+
|
46 |
|
47 |
|
48 |
## Installation
|
|
|
100 |
python -m qai_hub_models.models.whisper_small_en.export
|
101 |
```
|
102 |
|
103 |
+
```
|
104 |
+
Profile Job summary of WhisperEncoder
|
105 |
+
--------------------------------------------------
|
106 |
+
Device: Snapdragon X Elite CRD (11)
|
107 |
+
Estimated Inference Time: 1682.16 ms
|
108 |
+
Estimated Peak Memory Range: 0.92-0.92 MB
|
109 |
+
Compute Units: NPU (1473) | Total (1473)
|
110 |
+
|
111 |
+
Profile Job summary of WhisperDecoder
|
112 |
+
--------------------------------------------------
|
113 |
+
Device: Snapdragon X Elite CRD (11)
|
114 |
+
Estimated Inference Time: 20.40 ms
|
115 |
+
Estimated Peak Memory Range: 121.48-121.48 MB
|
116 |
+
Compute Units: NPU (2255) | Total (2255)
|
117 |
+
|
118 |
+
|
119 |
+
```
|
120 |
+
|
121 |
+
|
122 |
## How does this work?
|
123 |
|
124 |
+
This [export script](https://aihub.qualcomm.com/models/whisper_small_en/qai_hub_models/models/Whisper-Small-En/export.py)
|
125 |
leverages [Qualcomm® AI Hub](https://aihub.qualcomm.com/) to optimize, validate, and deploy this model
|
126 |
on-device. Lets go through each step below in detail:
|
127 |
|
|
|
199 |
|
200 |
|
201 |
|
202 |
+
|
203 |
## Deploying compiled model to Android
|
204 |
|
205 |
|
|
|
221 |
## License
|
222 |
- The license for the original implementation of Whisper-Small-En can be found
|
223 |
[here](https://github.com/openai/whisper/blob/main/LICENSE).
|
224 |
+
- The license for the compiled assets for on-device deployment can be found [here](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/Qualcomm+AI+Hub+Proprietary+License.pdf)
|
225 |
|
226 |
## References
|
227 |
* [Robust Speech Recognition via Large-Scale Weak Supervision](https://cdn.openai.com/papers/whisper.pdf)
|