Update README.md
Browse files
README.md
CHANGED
|
@@ -212,7 +212,7 @@ print(f"Transcription: {transcription}")
|
|
| 212 |
```
|
| 213 |
|
| 214 |
__System requirements:__
|
| 215 |
-
* GPUs: NVIDIA GeForce H100, L40S
|
| 216 |
* CPU: AMD, Intel
|
| 217 |
* Python: 3.8-3.12 (check dependencies for specific versions)
|
| 218 |
|
|
@@ -223,6 +223,15 @@ pip install thestage
|
|
| 223 |
pip install 'thestage-elastic-models[nvidia]'
|
| 224 |
pip install flash-attn==2.7.3 --no-build-isolation
|
| 225 |
pip uninstall apex
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 226 |
```
|
| 227 |
|
| 228 |
Then go to [app.thestage.ai](https://app.thestage.ai), login and generate API token from your profile page. Set up API token as follows:
|
|
@@ -260,6 +269,8 @@ Performance for transcribing audio (tps):
|
|
| 260 |
|----------|---|----------|
|
| 261 |
| H100 | 223.47 | 82.84 |
|
| 262 |
| L40S | 210.67 | [TBD] |
|
|
|
|
|
|
|
| 263 |
|
| 264 |
## Links
|
| 265 |
|
|
|
|
| 212 |
```
|
| 213 |
|
| 214 |
__System requirements:__
|
| 215 |
+
* GPUs: NVIDIA GeForce 4090, NVIDIA GeForce 5090, H100, L40S
|
| 216 |
* CPU: AMD, Intel
|
| 217 |
* Python: 3.8-3.12 (check dependencies for specific versions)
|
| 218 |
|
|
|
|
| 223 |
pip install 'thestage-elastic-models[nvidia]'
|
| 224 |
pip install flash-attn==2.7.3 --no-build-isolation
|
| 225 |
pip uninstall apex
|
| 226 |
+
|
| 227 |
+
# or for blackwell support
|
| 228 |
+
pip install 'thestage-elastic-models[blackwell]'
|
| 229 |
+
pip install torch==2.7.0+cu128 torchvision torchaudio --index-url https://download.pytorch.org/whl/cu128
|
| 230 |
+
# please download the appropriate version of Wheels for your system from https://github.com/Zarrac/flashattention-blackwell-wheels-whl-ONLY-5090-5080-5070-5060-flash-attention-/releases/tag/FlashAttention
|
| 231 |
+
mv flash_attn-2.7.4.post1-rtx5090-torch2.7.0cu128cxx11abiTRUE-cp311-linux_x86_64.whl flash_attn-2.7.4.post1-0rtx5090torch270cu128cxx11abiTRUE-cp311-cp311-linux_x86_64.whl
|
| 232 |
+
pip install flash_attn-2.7.4.post1-0rtx5090torch270cu128cxx11abiTRUE-cp311-cp311-linux_x86_64.whl
|
| 233 |
+
pip install tensorrt==10.11.0.33
|
| 234 |
+
pip uninstall apex
|
| 235 |
```
|
| 236 |
|
| 237 |
Then go to [app.thestage.ai](https://app.thestage.ai), login and generate API token from your profile page. Set up API token as follows:
|
|
|
|
| 269 |
|----------|---|----------|
|
| 270 |
| H100 | 223.47 | 82.84 |
|
| 271 |
| L40S | 210.67 | [TBD] |
|
| 272 |
+
| GeForce RTX 4090 | 240 | 86.63 |
|
| 273 |
+
| GeForce RTX 5090 | [TBD] | [TBD] |
|
| 274 |
|
| 275 |
## Links
|
| 276 |
|