bhushans commited on
Commit
ff64ffd
1 Parent(s): af238ac

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +48 -49
README.md CHANGED
@@ -38,49 +38,47 @@ More details on model performance across various devices, can be found
38
 
39
  | Model | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Precision | Primary Compute Unit | Target Model
40
  |---|---|---|---|---|---|---|---|---|
41
- | WhisperEncoder | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 | TFLITE | 704.009 ms | 45 - 439 MB | FP16 | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperEncoder.tflite) |
42
- | WhisperEncoder | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 | QNN | 866.62 ms | 2 - 230 MB | FP16 | NPU | [Whisper-Small-En.so](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperEncoder.so) |
43
- | WhisperEncoder | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 | TFLITE | 514.369 ms | 108 - 196 MB | FP16 | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperEncoder.tflite) |
44
- | WhisperEncoder | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 | QNN | 615.589 ms | 0 - 839 MB | FP16 | NPU | [Whisper-Small-En.so](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperEncoder.so) |
45
- | WhisperEncoder | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 | ONNX | 808.662 ms | 145 - 4336 MB | FP16 | NPU | [Whisper-Small-En.onnx](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperEncoder.onnx) |
46
- | WhisperEncoder | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite | TFLITE | 537.047 ms | 111 - 139 MB | FP16 | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperEncoder.tflite) |
47
- | WhisperEncoder | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite | QNN | 494.644 ms | 0 - 908 MB | FP16 | NPU | Use Export Script |
48
- | WhisperEncoder | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite | ONNX | 698.505 ms | 117 - 2773 MB | FP16 | NPU | [Whisper-Small-En.onnx](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperEncoder.onnx) |
49
- | WhisperEncoder | QCS8550 (Proxy) | QCS8550 Proxy | TFLITE | 686.962 ms | 40 - 440 MB | FP16 | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperEncoder.tflite) |
50
- | WhisperEncoder | QCS8550 (Proxy) | QCS8550 Proxy | QNN | 693.295 ms | 1 - 2 MB | FP16 | NPU | Use Export Script |
51
- | WhisperEncoder | SA8255 (Proxy) | SA8255P Proxy | TFLITE | 711.433 ms | 18 - 355 MB | FP16 | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperEncoder.tflite) |
52
- | WhisperEncoder | SA8255 (Proxy) | SA8255P Proxy | QNN | 701.7 ms | 1 - 2 MB | FP16 | NPU | Use Export Script |
53
- | WhisperEncoder | SA8775 (Proxy) | SA8775P Proxy | TFLITE | 703.544 ms | 61 - 457 MB | FP16 | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperEncoder.tflite) |
54
- | WhisperEncoder | SA8775 (Proxy) | SA8775P Proxy | QNN | 713.618 ms | 0 - 57 MB | FP16 | NPU | Use Export Script |
55
- | WhisperEncoder | SA8650 (Proxy) | SA8650P Proxy | TFLITE | 712.376 ms | 33 - 421 MB | FP16 | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperEncoder.tflite) |
56
- | WhisperEncoder | SA8650 (Proxy) | SA8650P Proxy | QNN | 724.601 ms | 1 - 30 MB | FP16 | NPU | Use Export Script |
57
- | WhisperEncoder | SA8295P ADP | SA8295P | TFLITE | 658.817 ms | 108 - 140 MB | FP16 | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperEncoder.tflite) |
58
- | WhisperEncoder | SA8295P ADP | SA8295P | QNN | 727.239 ms | 3 - 8 MB | FP16 | NPU | Use Export Script |
59
- | WhisperEncoder | QCS8450 (Proxy) | QCS8450 Proxy | TFLITE | 972.067 ms | 75 - 170 MB | FP16 | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperEncoder.tflite) |
60
- | WhisperEncoder | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN | 526.155 ms | 0 - 0 MB | FP16 | NPU | Use Export Script |
61
- | WhisperEncoder | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX | 1356.64 ms | 449 - 449 MB | FP16 | NPU | [Whisper-Small-En.onnx](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperEncoder.onnx) |
62
- | WhisperDecoder | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 | TFLITE | 25.328 ms | 16 - 19 MB | FP16 | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperDecoder.tflite) |
63
- | WhisperDecoder | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 | QNN | 12.008 ms | 61 - 130 MB | FP16 | NPU | [Whisper-Small-En.so](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperDecoder.so) |
64
- | WhisperDecoder | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 | ONNX | 57.142 ms | 121 - 124 MB | FP16 | NPU | [Whisper-Small-En.onnx](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperDecoder.onnx) |
65
- | WhisperDecoder | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 | TFLITE | 19.228 ms | 32 - 1144 MB | FP16 | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperDecoder.tflite) |
66
- | WhisperDecoder | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 | QNN | 9.452 ms | 59 - 158 MB | FP16 | NPU | [Whisper-Small-En.so](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperDecoder.so) |
67
  | WhisperDecoder | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite | TFLITE | 16.628 ms | 16 - 263 MB | FP16 | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperDecoder.tflite) |
68
- | WhisperDecoder | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite | QNN | 7.482 ms | 50 - 183 MB | FP16 | NPU | Use Export Script |
69
- | WhisperDecoder | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite | ONNX | 39.601 ms | 111 - 883 MB | FP16 | NPU | [Whisper-Small-En.onnx](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperDecoder.onnx) |
70
- | WhisperDecoder | QCS8550 (Proxy) | QCS8550 Proxy | TFLITE | 25.84 ms | 16 - 18 MB | FP16 | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperDecoder.tflite) |
71
- | WhisperDecoder | QCS8550 (Proxy) | QCS8550 Proxy | QNN | 12.335 ms | 57 - 58 MB | FP16 | NPU | Use Export Script |
72
- | WhisperDecoder | SA8255 (Proxy) | SA8255P Proxy | TFLITE | 25.356 ms | 15 - 18 MB | FP16 | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperDecoder.tflite) |
73
- | WhisperDecoder | SA8255 (Proxy) | SA8255P Proxy | QNN | 12.62 ms | 64 - 65 MB | FP16 | NPU | Use Export Script |
74
- | WhisperDecoder | SA8775 (Proxy) | SA8775P Proxy | TFLITE | 25.355 ms | 14 - 18 MB | FP16 | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperDecoder.tflite) |
75
- | WhisperDecoder | SA8775 (Proxy) | SA8775P Proxy | QNN | 12.69 ms | 65 - 66 MB | FP16 | NPU | Use Export Script |
76
- | WhisperDecoder | SA8650 (Proxy) | SA8650P Proxy | TFLITE | 24.58 ms | 15 - 18 MB | FP16 | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperDecoder.tflite) |
77
- | WhisperDecoder | SA8650 (Proxy) | SA8650P Proxy | QNN | 12.971 ms | 61 - 62 MB | FP16 | NPU | Use Export Script |
78
- | WhisperDecoder | SA8295P ADP | SA8295P | TFLITE | 27.039 ms | 16 - 243 MB | FP16 | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperDecoder.tflite) |
79
- | WhisperDecoder | SA8295P ADP | SA8295P | QNN | 14.311 ms | 57 - 62 MB | FP16 | NPU | Use Export Script |
80
- | WhisperDecoder | QCS8450 (Proxy) | QCS8450 Proxy | TFLITE | 28.297 ms | 16 - 1104 MB | FP16 | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperDecoder.tflite) |
81
- | WhisperDecoder | QCS8450 (Proxy) | QCS8450 Proxy | QNN | 15.403 ms | 57 - 156 MB | FP16 | NPU | Use Export Script |
82
- | WhisperDecoder | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN | 10.907 ms | 61 - 61 MB | FP16 | NPU | Use Export Script |
83
- | WhisperDecoder | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX | 49.551 ms | 232 - 232 MB | FP16 | NPU | [Whisper-Small-En.onnx](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperDecoder.onnx) |
84
 
85
 
86
 
@@ -145,8 +143,8 @@ Profiling Results
145
  WhisperEncoder
146
  Device : Samsung Galaxy S23 (13)
147
  Runtime : TFLITE
148
- Estimated inference time (ms) : 704.0
149
- Estimated peak memory usage (MB): [45, 439]
150
  Total # Ops : 911
151
  Compute Unit(s) : GPU (900 ops) CPU (11 ops)
152
 
@@ -154,8 +152,8 @@ Compute Unit(s) : GPU (900 ops) CPU (11 ops)
154
  WhisperDecoder
155
  Device : Samsung Galaxy S23 (13)
156
  Runtime : TFLITE
157
- Estimated inference time (ms) : 25.3
158
- Estimated peak memory usage (MB): [16, 19]
159
  Total # Ops : 2573
160
  Compute Unit(s) : NPU (2573 ops)
161
  ```
@@ -176,11 +174,12 @@ in memory using the `jit.trace` and then call the `submit_compile_job` API.
176
  import torch
177
 
178
  import qai_hub as hub
179
- from qai_hub_models.models.whisper_small_en import WhisperEncoder,WhisperDecoder
180
 
181
  # Load the model
182
- encoder_model = WhisperEncoder.from_pretrained()
183
- decoder_model = WhisperDecoder.from_pretrained()
 
184
 
185
  # Device
186
  device = hub.Device("Samsung Galaxy S23")
 
38
 
39
  | Model | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Precision | Primary Compute Unit | Target Model
40
  |---|---|---|---|---|---|---|---|---|
41
+ | WhisperEncoder | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 | TFLITE | 722.7 ms | 69 - 449 MB | FP16 | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperEncoder.tflite) |
42
+ | WhisperEncoder | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 | QNN | 820.248 ms | 0 - 209 MB | FP16 | NPU | [Whisper-Small-En.so](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperEncoder.so) |
43
+ | WhisperEncoder | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 | TFLITE | 518.95 ms | 111 - 201 MB | FP16 | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperEncoder.tflite) |
44
+ | WhisperEncoder | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 | ONNX | 778.511 ms | 113 - 3977 MB | FP16 | NPU | [Whisper-Small-En.onnx](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperEncoder.onnx) |
45
+ | WhisperEncoder | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite | QNN | 516.678 ms | 0 - 906 MB | FP16 | NPU | Use Export Script |
46
+ | WhisperEncoder | QCS8550 (Proxy) | QCS8550 Proxy | TFLITE | 696.316 ms | 85 - 465 MB | FP16 | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperEncoder.tflite) |
47
+ | WhisperEncoder | QCS8550 (Proxy) | QCS8550 Proxy | QNN | 644.566 ms | 1 - 2 MB | FP16 | NPU | Use Export Script |
48
+ | WhisperEncoder | SA7255P ADP | SA7255P | TFLITE | 4426.504 ms | 108 - 142 MB | FP16 | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperEncoder.tflite) |
49
+ | WhisperEncoder | SA7255P ADP | SA7255P | QNN | 3210.318 ms | 1 - 8 MB | FP16 | NPU | Use Export Script |
50
+ | WhisperEncoder | SA8255 (Proxy) | SA8255P Proxy | TFLITE | 705.299 ms | 26 - 406 MB | FP16 | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperEncoder.tflite) |
51
+ | WhisperEncoder | SA8255 (Proxy) | SA8255P Proxy | QNN | 638.347 ms | 1 - 2 MB | FP16 | NPU | Use Export Script |
52
+ | WhisperEncoder | SA8295P ADP | SA8295P | QNN | 700.683 ms | 3 - 9 MB | FP16 | NPU | Use Export Script |
53
+ | WhisperEncoder | SA8650 (Proxy) | SA8650P Proxy | TFLITE | 709.764 ms | 78 - 445 MB | FP16 | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperEncoder.tflite) |
54
+ | WhisperEncoder | SA8650 (Proxy) | SA8650P Proxy | QNN | 678.71 ms | 1 - 3 MB | FP16 | NPU | Use Export Script |
55
+ | WhisperEncoder | SA8775P ADP | SA8775P | TFLITE | 1293.65 ms | 108 - 140 MB | FP16 | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperEncoder.tflite) |
56
+ | WhisperEncoder | SA8775P ADP | SA8775P | QNN | 603.983 ms | 1 - 6 MB | FP16 | NPU | Use Export Script |
57
+ | WhisperEncoder | QCS8450 (Proxy) | QCS8450 Proxy | TFLITE | 969.375 ms | 110 - 205 MB | FP16 | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperEncoder.tflite) |
58
+ | WhisperEncoder | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN | 504.049 ms | 0 - 0 MB | FP16 | NPU | Use Export Script |
59
+ | WhisperEncoder | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX | 1342.641 ms | 237 - 237 MB | FP16 | NPU | [Whisper-Small-En.onnx](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperEncoder.onnx) |
60
+ | WhisperDecoder | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 | TFLITE | 28.657 ms | 16 - 100 MB | FP16 | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperDecoder.tflite) |
61
+ | WhisperDecoder | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 | QNN | 11.929 ms | 61 - 141 MB | FP16 | NPU | [Whisper-Small-En.so](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperDecoder.so) |
62
+ | WhisperDecoder | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 | ONNX | 58.778 ms | 120 - 123 MB | FP16 | NPU | [Whisper-Small-En.onnx](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperDecoder.onnx) |
63
+ | WhisperDecoder | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 | TFLITE | 23.885 ms | 16 - 148 MB | FP16 | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperDecoder.tflite) |
64
+ | WhisperDecoder | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 | QNN | 9.45 ms | 446 - 552 MB | FP16 | NPU | [Whisper-Small-En.so](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperDecoder.so) |
65
+ | WhisperDecoder | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 | ONNX | 47.995 ms | 85 - 1135 MB | FP16 | NPU | [Whisper-Small-En.onnx](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperDecoder.onnx) |
 
66
  | WhisperDecoder | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite | TFLITE | 16.628 ms | 16 - 263 MB | FP16 | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperDecoder.tflite) |
67
+ | WhisperDecoder | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite | QNN | 8.06 ms | 53 - 188 MB | FP16 | NPU | Use Export Script |
68
+ | WhisperDecoder | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite | ONNX | 44.088 ms | 69 - 697 MB | FP16 | NPU | [Whisper-Small-En.onnx](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperDecoder.onnx) |
69
+ | WhisperDecoder | QCS8550 (Proxy) | QCS8550 Proxy | TFLITE | 28.65 ms | 16 - 101 MB | FP16 | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperDecoder.tflite) |
70
+ | WhisperDecoder | QCS8550 (Proxy) | QCS8550 Proxy | QNN | 11.92 ms | 61 - 71 MB | FP16 | NPU | Use Export Script |
71
+ | WhisperDecoder | SA7255P ADP | SA7255P | QNN | 74.962 ms | 56 - 64 MB | FP16 | NPU | Use Export Script |
72
+ | WhisperDecoder | SA8255 (Proxy) | SA8255P Proxy | TFLITE | 29.533 ms | 16 - 99 MB | FP16 | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperDecoder.tflite) |
73
+ | WhisperDecoder | SA8255 (Proxy) | SA8255P Proxy | QNN | 12.125 ms | 57 - 62 MB | FP16 | NPU | Use Export Script |
74
+ | WhisperDecoder | SA8295P ADP | SA8295P | TFLITE | 30.807 ms | 16 - 162 MB | FP16 | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperDecoder.tflite) |
75
+ | WhisperDecoder | SA8295P ADP | SA8295P | QNN | 14.596 ms | 57 - 62 MB | FP16 | NPU | Use Export Script |
76
+ | WhisperDecoder | SA8650 (Proxy) | SA8650P Proxy | TFLITE | 29.43 ms | 16 - 99 MB | FP16 | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperDecoder.tflite) |
77
+ | WhisperDecoder | SA8650 (Proxy) | SA8650P Proxy | QNN | 12.052 ms | 65 - 66 MB | FP16 | NPU | Use Export Script |
78
+ | WhisperDecoder | SA8775P ADP | SA8775P | TFLITE | 33.02 ms | 16 - 174 MB | FP16 | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperDecoder.tflite) |
79
+ | WhisperDecoder | QCS8450 (Proxy) | QCS8450 Proxy | TFLITE | 34.145 ms | 16 - 139 MB | FP16 | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperDecoder.tflite) |
80
+ | WhisperDecoder | QCS8450 (Proxy) | QCS8450 Proxy | QNN | 15.967 ms | 57 - 173 MB | FP16 | NPU | Use Export Script |
81
+ | WhisperDecoder | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX | 52.917 ms | 232 - 232 MB | FP16 | NPU | [Whisper-Small-En.onnx](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/WhisperDecoder.onnx) |
 
82
 
83
 
84
 
 
143
  WhisperEncoder
144
  Device : Samsung Galaxy S23 (13)
145
  Runtime : TFLITE
146
+ Estimated inference time (ms) : 722.7
147
+ Estimated peak memory usage (MB): [69, 449]
148
  Total # Ops : 911
149
  Compute Unit(s) : GPU (900 ops) CPU (11 ops)
150
 
 
152
  WhisperDecoder
153
  Device : Samsung Galaxy S23 (13)
154
  Runtime : TFLITE
155
+ Estimated inference time (ms) : 28.7
156
+ Estimated peak memory usage (MB): [16, 100]
157
  Total # Ops : 2573
158
  Compute Unit(s) : NPU (2573 ops)
159
  ```
 
174
  import torch
175
 
176
  import qai_hub as hub
177
+ from qai_hub_models.models.whisper_small_en import Model
178
 
179
  # Load the model
180
+ model = Model.from_pretrained()
181
+ encoder_model = model.encoder
182
+ decoder_model = model.decoder
183
 
184
  # Device
185
  device = hub.Device("Samsung Galaxy S23")