Compiled engines for running Whisper with TRT LLM for much faster inference.
baseten
company
Verified
AI & ML interests
None defined yet.
Recent Activity
View all activity
models
653

baseten/whisper_trt_large_v3_turbo_250730_NVIDIA_H100_80GB_HBM3_0_13_0
Updated

baseten/whisper_trt_large_v3_250729_NVIDIA_L4_0_13_0
Updated

baseten/whisper_trt_large_v3_250729_NVIDIA_H100_80GB_HBM3_0_13_0
Updated

baseten/whisper_trt_large_v3_250729_NVIDIA_H100_80GB_HBM3_MIG_3g_40gb_0_13_0
Updated

baseten/whisper_trt_large_v3_turbo_20250723_NVIDIA_L4_0_13_0
Updated

baseten/whisper_trt_large_v3_turbo_20250723_NVIDIA_H100_80GB_HBM3_MIG_3g_40gb_0_13_0
Updated

baseten/Kimi-K2-Instruct-FP4
581B
•
Updated
•
2k

baseten/whisper_trt_large_v3_turbo_test_NVIDIA_H100_80GB_HBM3_MIG_3g_40gb_0_13_0
Updated

baseten/whisper_trt_large_v3_test_NVIDIA_H100_80GB_HBM3_MIG_3g_40gb_0_13_0
Updated

baseten/btest-Llama-3.1-8B-Instruct-NVIDIA-A10G-v0.20.0-TP2
Updated
•
3