Update README.md
Browse files
README.md
CHANGED
@@ -35,7 +35,8 @@ __Goals of elastic models:__
|
|
35 |
|
36 |
> It\'s important to note that specific quality degradation can vary. We aim for S models to retain high perceptual quality. The "Original" in tables refers to the non-compiled Hugging Face model, while "XL" is the compiled original. S, M, L are ANNA-quantized and compiled.
|
37 |
|
38 |
-
|
|
|
39 |
|
40 |
## Audio Examples
|
41 |
|
@@ -139,28 +140,27 @@ The `Original` column in latency benchmarks typically refers to the Hugging Face
|
|
139 |
|
140 |
Performance for generating audio (decoder stage, max_new_tokens = 256 (5 seconds audio)).
|
141 |
|
142 |
-
| GPU Type | S | M | L | XL (Compiled Original) | Original (HF, non-compiled) |
|
143 |
-
|----------|--------|--------|--------|------------------------|-----------------------------|
|
144 |
-
| H100 | 122.75 | 124.70 | 126.21 | 126.71 | 45.33 |
|
145 |
-
| L40S | 96.74 | 90.90 | 86.51 | 83.31 | 44.69 |
|
146 |
|
147 |
-
|
|
|
|
|
|
|
|
|
|
|
148 |
|
149 |
**Batch Size 16:**
|
150 |
-
| GPU Type | S Mode (TPS) | XL Mode (TPS) |
|
151 |
-
|----------|--------------|---------------|
|
152 |
-
| H100 | 94.21 | 97.96 |
|
153 |
-
| L40S | 69.66 | 63.19 |
|
154 |
|
155 |
-
|
156 |
-
|
157 |
-
|
158 |
-
|
|
159 |
-
| L40S | 54.81 | 51.34 |
|
160 |
|
161 |
-
|
162 |
|
163 |
-
|
|
|
|
|
|
|
164 |
|
165 |
|
166 |
## Links
|
|
|
35 |
|
36 |
> It\'s important to note that specific quality degradation can vary. We aim for S models to retain high perceptual quality. The "Original" in tables refers to the non-compiled Hugging Face model, while "XL" is the compiled original. S, M, L are ANNA-quantized and compiled.
|
37 |
|
38 |
+
|
39 |
+

|
40 |
|
41 |
## Audio Examples
|
42 |
|
|
|
140 |
|
141 |
Performance for generating audio (decoder stage, max_new_tokens = 256 (5 seconds audio)).
|
142 |
|
|
|
|
|
|
|
|
|
143 |
|
144 |
+
**Batch Size 1:**
|
145 |
+
|
146 |
+
| GPU Type | S | M | L | XL | Original |
|
147 |
+
|--------|---|---|---|----|----|
|
148 |
+
| H100 | 130.52 | 129.87 | 128.57 | 129.25 | 44.80 |
|
149 |
+
| L40S | 101.70 | 95.65 | 89.99 | 83.39 | 44.43 |
|
150 |
|
151 |
**Batch Size 16:**
|
|
|
|
|
|
|
|
|
152 |
|
153 |
+
| GPU Type | S | M | L | XL | Original |
|
154 |
+
|--------|---|---|---|----|----|
|
155 |
+
| H100 | 106.06 | 105.82 | 107.07 | 106.55 | 41.09 |
|
156 |
+
| L40S | 74.97 | 71.52 | 68.09 | 63.86 | 36.40 |
|
|
|
157 |
|
158 |
+
**Batch Size 32:**
|
159 |
|
160 |
+
| GPU Type | S | M | L | XL | Original |
|
161 |
+
|--------|---|---|---|----|----|
|
162 |
+
| H100 | 83.58 | 84.13 | 84.04 | 83.90 | 34.50 |
|
163 |
+
| L40S | 57.36 | 55.60 | 53.73 | 51.33 | 28.72 |
|
164 |
|
165 |
|
166 |
## Links
|