v0.34.0
Browse filesSee https://github.com/quic/ai-hub-models/releases/v0.34.0 for changelog.
README.md
CHANGED
@@ -24,6 +24,7 @@ More details on model performance across various devices, can be found
|
|
24 |
[here](https://aihub.qualcomm.com/models/huggingface_wavlm_base_plus).
|
25 |
|
26 |
|
|
|
27 |
### Model Details
|
28 |
|
29 |
- **Model Type:** Model_use_case.speech_recognition
|
@@ -35,31 +36,31 @@ More details on model performance across various devices, can be found
|
|
35 |
|
36 |
| Model | Precision | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit | Target Model
|
37 |
|---|---|---|---|---|---|---|---|---|
|
38 |
-
| HuggingFace-WavLM-Base-Plus | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | TFLITE |
|
39 |
| HuggingFace-WavLM-Base-Plus | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | QNN_DLC | 872.273 ms | 1 - 827 MB | NPU | [HuggingFace-WavLM-Base-Plus.dlc](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.dlc) |
|
40 |
-
| HuggingFace-WavLM-Base-Plus | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | TFLITE |
|
41 |
| HuggingFace-WavLM-Base-Plus | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | QNN_DLC | 578.052 ms | 0 - 925 MB | NPU | [HuggingFace-WavLM-Base-Plus.dlc](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.dlc) |
|
42 |
-
| HuggingFace-WavLM-Base-Plus | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | TFLITE |
|
43 |
| HuggingFace-WavLM-Base-Plus | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | QNN_DLC | 290.655 ms | 3 - 54 MB | NPU | [HuggingFace-WavLM-Base-Plus.dlc](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.dlc) |
|
44 |
-
| HuggingFace-WavLM-Base-Plus | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | TFLITE | 306.
|
45 |
| HuggingFace-WavLM-Base-Plus | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | QNN_DLC | 324.338 ms | 1 - 826 MB | NPU | [HuggingFace-WavLM-Base-Plus.dlc](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.dlc) |
|
46 |
-
| HuggingFace-WavLM-Base-Plus | float | SA7255P ADP | Qualcomm® SA7255P | TFLITE |
|
47 |
| HuggingFace-WavLM-Base-Plus | float | SA7255P ADP | Qualcomm® SA7255P | QNN_DLC | 872.273 ms | 1 - 827 MB | NPU | [HuggingFace-WavLM-Base-Plus.dlc](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.dlc) |
|
48 |
-
| HuggingFace-WavLM-Base-Plus | float | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | TFLITE |
|
49 |
| HuggingFace-WavLM-Base-Plus | float | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | QNN_DLC | 292.19 ms | 0 - 51 MB | NPU | [HuggingFace-WavLM-Base-Plus.dlc](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.dlc) |
|
50 |
-
| HuggingFace-WavLM-Base-Plus | float | SA8295P ADP | Qualcomm® SA8295P | TFLITE | 580.
|
51 |
| HuggingFace-WavLM-Base-Plus | float | SA8295P ADP | Qualcomm® SA8295P | QNN_DLC | 448.219 ms | 1 - 976 MB | NPU | [HuggingFace-WavLM-Base-Plus.dlc](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.dlc) |
|
52 |
-
| HuggingFace-WavLM-Base-Plus | float | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | TFLITE |
|
53 |
| HuggingFace-WavLM-Base-Plus | float | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | QNN_DLC | 291.549 ms | 0 - 50 MB | NPU | [HuggingFace-WavLM-Base-Plus.dlc](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.dlc) |
|
54 |
-
| HuggingFace-WavLM-Base-Plus | float | SA8775P ADP | Qualcomm® SA8775P | TFLITE | 306.
|
55 |
| HuggingFace-WavLM-Base-Plus | float | SA8775P ADP | Qualcomm® SA8775P | QNN_DLC | 324.338 ms | 1 - 826 MB | NPU | [HuggingFace-WavLM-Base-Plus.dlc](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.dlc) |
|
56 |
-
| HuggingFace-WavLM-Base-Plus | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | TFLITE |
|
57 |
| HuggingFace-WavLM-Base-Plus | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | QNN_DLC | 293.176 ms | 0 - 50 MB | NPU | [HuggingFace-WavLM-Base-Plus.dlc](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.dlc) |
|
58 |
| HuggingFace-WavLM-Base-Plus | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | ONNX | 475.477 ms | 1 - 51 MB | NPU | [HuggingFace-WavLM-Base-Plus.onnx](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.onnx) |
|
59 |
-
| HuggingFace-WavLM-Base-Plus | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE |
|
60 |
| HuggingFace-WavLM-Base-Plus | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | QNN_DLC | 216.247 ms | 1 - 1015 MB | NPU | [HuggingFace-WavLM-Base-Plus.dlc](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.dlc) |
|
61 |
| HuggingFace-WavLM-Base-Plus | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | ONNX | 348.261 ms | 1 - 961 MB | NPU | [HuggingFace-WavLM-Base-Plus.onnx](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.onnx) |
|
62 |
-
| HuggingFace-WavLM-Base-Plus | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | TFLITE |
|
63 |
| HuggingFace-WavLM-Base-Plus | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | QNN_DLC | 221.424 ms | 0 - 703 MB | NPU | [HuggingFace-WavLM-Base-Plus.dlc](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.dlc) |
|
64 |
| HuggingFace-WavLM-Base-Plus | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | ONNX | 278.897 ms | 1 - 813 MB | NPU | [HuggingFace-WavLM-Base-Plus.onnx](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.onnx) |
|
65 |
| HuggingFace-WavLM-Base-Plus | float | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN_DLC | 313.971 ms | 285 - 285 MB | NPU | [HuggingFace-WavLM-Base-Plus.dlc](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.dlc) |
|
@@ -121,17 +122,7 @@ device. This script does the following:
|
|
121 |
```bash
|
122 |
python -m qai_hub_models.models.huggingface_wavlm_base_plus.export
|
123 |
```
|
124 |
-
|
125 |
-
Profiling Results
|
126 |
-
------------------------------------------------------------
|
127 |
-
HuggingFace-WavLM-Base-Plus
|
128 |
-
Device : cs_8275 (ANDROID 14)
|
129 |
-
Runtime : TFLITE
|
130 |
-
Estimated inference time (ms) : 813.9
|
131 |
-
Estimated peak memory usage (MB): [0, 806]
|
132 |
-
Total # Ops : 873
|
133 |
-
Compute Unit(s) : npu (873 ops) gpu (0 ops) cpu (0 ops)
|
134 |
-
```
|
135 |
|
136 |
|
137 |
## How does this work?
|
|
|
24 |
[here](https://aihub.qualcomm.com/models/huggingface_wavlm_base_plus).
|
25 |
|
26 |
|
27 |
+
|
28 |
### Model Details
|
29 |
|
30 |
- **Model Type:** Model_use_case.speech_recognition
|
|
|
36 |
|
37 |
| Model | Precision | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit | Target Model
|
38 |
|---|---|---|---|---|---|---|---|---|
|
39 |
+
| HuggingFace-WavLM-Base-Plus | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | TFLITE | 814.068 ms | 0 - 805 MB | NPU | [HuggingFace-WavLM-Base-Plus.tflite](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.tflite) |
|
40 |
| HuggingFace-WavLM-Base-Plus | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | QNN_DLC | 872.273 ms | 1 - 827 MB | NPU | [HuggingFace-WavLM-Base-Plus.dlc](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.dlc) |
|
41 |
+
| HuggingFace-WavLM-Base-Plus | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | TFLITE | 549.073 ms | 0 - 1192 MB | NPU | [HuggingFace-WavLM-Base-Plus.tflite](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.tflite) |
|
42 |
| HuggingFace-WavLM-Base-Plus | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | QNN_DLC | 578.052 ms | 0 - 925 MB | NPU | [HuggingFace-WavLM-Base-Plus.dlc](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.dlc) |
|
43 |
+
| HuggingFace-WavLM-Base-Plus | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | TFLITE | 257.914 ms | 0 - 53 MB | NPU | [HuggingFace-WavLM-Base-Plus.tflite](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.tflite) |
|
44 |
| HuggingFace-WavLM-Base-Plus | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | QNN_DLC | 290.655 ms | 3 - 54 MB | NPU | [HuggingFace-WavLM-Base-Plus.dlc](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.dlc) |
|
45 |
+
| HuggingFace-WavLM-Base-Plus | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | TFLITE | 306.675 ms | 0 - 804 MB | NPU | [HuggingFace-WavLM-Base-Plus.tflite](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.tflite) |
|
46 |
| HuggingFace-WavLM-Base-Plus | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | QNN_DLC | 324.338 ms | 1 - 826 MB | NPU | [HuggingFace-WavLM-Base-Plus.dlc](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.dlc) |
|
47 |
+
| HuggingFace-WavLM-Base-Plus | float | SA7255P ADP | Qualcomm® SA7255P | TFLITE | 814.068 ms | 0 - 805 MB | NPU | [HuggingFace-WavLM-Base-Plus.tflite](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.tflite) |
|
48 |
| HuggingFace-WavLM-Base-Plus | float | SA7255P ADP | Qualcomm® SA7255P | QNN_DLC | 872.273 ms | 1 - 827 MB | NPU | [HuggingFace-WavLM-Base-Plus.dlc](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.dlc) |
|
49 |
+
| HuggingFace-WavLM-Base-Plus | float | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | TFLITE | 258.884 ms | 0 - 53 MB | NPU | [HuggingFace-WavLM-Base-Plus.tflite](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.tflite) |
|
50 |
| HuggingFace-WavLM-Base-Plus | float | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | QNN_DLC | 292.19 ms | 0 - 51 MB | NPU | [HuggingFace-WavLM-Base-Plus.dlc](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.dlc) |
|
51 |
+
| HuggingFace-WavLM-Base-Plus | float | SA8295P ADP | Qualcomm® SA8295P | TFLITE | 580.917 ms | 0 - 1100 MB | NPU | [HuggingFace-WavLM-Base-Plus.tflite](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.tflite) |
|
52 |
| HuggingFace-WavLM-Base-Plus | float | SA8295P ADP | Qualcomm® SA8295P | QNN_DLC | 448.219 ms | 1 - 976 MB | NPU | [HuggingFace-WavLM-Base-Plus.dlc](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.dlc) |
|
53 |
+
| HuggingFace-WavLM-Base-Plus | float | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | TFLITE | 260.218 ms | 0 - 52 MB | NPU | [HuggingFace-WavLM-Base-Plus.tflite](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.tflite) |
|
54 |
| HuggingFace-WavLM-Base-Plus | float | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | QNN_DLC | 291.549 ms | 0 - 50 MB | NPU | [HuggingFace-WavLM-Base-Plus.dlc](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.dlc) |
|
55 |
+
| HuggingFace-WavLM-Base-Plus | float | SA8775P ADP | Qualcomm® SA8775P | TFLITE | 306.675 ms | 0 - 804 MB | NPU | [HuggingFace-WavLM-Base-Plus.tflite](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.tflite) |
|
56 |
| HuggingFace-WavLM-Base-Plus | float | SA8775P ADP | Qualcomm® SA8775P | QNN_DLC | 324.338 ms | 1 - 826 MB | NPU | [HuggingFace-WavLM-Base-Plus.dlc](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.dlc) |
|
57 |
+
| HuggingFace-WavLM-Base-Plus | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | TFLITE | 261.021 ms | 0 - 56 MB | NPU | [HuggingFace-WavLM-Base-Plus.tflite](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.tflite) |
|
58 |
| HuggingFace-WavLM-Base-Plus | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | QNN_DLC | 293.176 ms | 0 - 50 MB | NPU | [HuggingFace-WavLM-Base-Plus.dlc](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.dlc) |
|
59 |
| HuggingFace-WavLM-Base-Plus | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | ONNX | 475.477 ms | 1 - 51 MB | NPU | [HuggingFace-WavLM-Base-Plus.onnx](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.onnx) |
|
60 |
+
| HuggingFace-WavLM-Base-Plus | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 182.929 ms | 0 - 848 MB | NPU | [HuggingFace-WavLM-Base-Plus.tflite](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.tflite) |
|
61 |
| HuggingFace-WavLM-Base-Plus | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | QNN_DLC | 216.247 ms | 1 - 1015 MB | NPU | [HuggingFace-WavLM-Base-Plus.dlc](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.dlc) |
|
62 |
| HuggingFace-WavLM-Base-Plus | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | ONNX | 348.261 ms | 1 - 961 MB | NPU | [HuggingFace-WavLM-Base-Plus.onnx](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.onnx) |
|
63 |
+
| HuggingFace-WavLM-Base-Plus | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | TFLITE | 170.569 ms | 0 - 800 MB | NPU | [HuggingFace-WavLM-Base-Plus.tflite](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.tflite) |
|
64 |
| HuggingFace-WavLM-Base-Plus | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | QNN_DLC | 221.424 ms | 0 - 703 MB | NPU | [HuggingFace-WavLM-Base-Plus.dlc](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.dlc) |
|
65 |
| HuggingFace-WavLM-Base-Plus | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | ONNX | 278.897 ms | 1 - 813 MB | NPU | [HuggingFace-WavLM-Base-Plus.onnx](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.onnx) |
|
66 |
| HuggingFace-WavLM-Base-Plus | float | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN_DLC | 313.971 ms | 285 - 285 MB | NPU | [HuggingFace-WavLM-Base-Plus.dlc](https://huggingface.co/qualcomm/HuggingFace-WavLM-Base-Plus/blob/main/HuggingFace-WavLM-Base-Plus.dlc) |
|
|
|
122 |
```bash
|
123 |
python -m qai_hub_models.models.huggingface_wavlm_base_plus.export
|
124 |
```
|
125 |
+
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
126 |
|
127 |
|
128 |
## How does this work?
|
precompiled/qualcomm-snapdragon-x-elite/HuggingFace-WavLM-Base-Plus.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 213848952
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:76bbd3dbc8fadf7ac999adee98db689e350acd0b51597dfca379f5b2db9d0079
|
3 |
size 213848952
|
precompiled/qualcomm-snapdragon-x-elite/HuggingFace-WavLM-Base-Plus.onnx.zip
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:66d9b691eb12fc6356dac23934c11fd750bd5a239cff8de756a39f9d01572b1d
|
3 |
+
size 180614190
|
precompiled/qualcomm-snapdragon-x-elite/sdk_versions.yml
ADDED
@@ -0,0 +1,5 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
sdk_versions:
|
2 |
+
qnn_context_binary:
|
3 |
+
qairt: 2.34.2.250528164111_119506
|
4 |
+
precompiled_qnn_onnx:
|
5 |
+
qairt: 2.33.2.250410134701_117956
|