Compiled engines for running Whisper with TRT LLM for much faster inference.

baseten
company
Verified
AI & ML interests
None defined yet.
Recent Activity
View all activity
Collections
1
models
715

baseten/whisper_trt_large_v3_20250604_NVIDIA_L4_0_13_0
Updated

baseten/DeepSeek-R1-0528-FP4
Updated
•
813

baseten/whisper_trt_large_v3_turbo_exp20250522_NVIDIA_H100_80GB_HBM3_MIG_3g_40gb_0_13_0
Updated

baseten/whisper_trt_large_v3_turbo_exp20250522_NVIDIA_H100_80GB_HBM3_0_13_0
Updated

baseten/whisper_trt_large_v3_exp20250522_NVIDIA_H100_80GB_HBM3_0_13_0
Updated

baseten/whisper_trt_large_v3_exp20250522_NVIDIA_H100_80GB_HBM3_MIG_3g_40gb_0_13_0
Updated

baseten/whisper_trt_large_v3_turbo_test_NVIDIA_L4_0_16_0
Updated

baseten/Qwen2.5-32B-Instruct-128k
Text Generation
•
Updated
•
13

baseten/btest-Llama-3.1-8B-Instruct-NVIDIA-H100-80GB-HBM3-v0.18.1-TP1
Updated
•
8

baseten/btest-Qwen-0.5B-NVIDIA-H100-80GB-HBM3-v0.18.1-TP1
Updated
•
3