Compiled engines for running Whisper with TRT LLM for much faster inference.

baseten
company
Verified
AI & ML interests
None defined yet.
Recent Activity
Collections
1
models
671

baseten/v3-nextn-head0
Updated
•
13

baseten/r1-nextn-head0
Updated
•
19

baseten/btest-TinyLlama-1.1B-Chat-v1.0-NVIDIA-H100-80GB-HBM3-v0.16.0-TP1-lookahead
Updated
•
3

baseten/btest-TinyLlama-1.1B-Chat-v1.0-NVIDIA-H100-80GB-HBM3-v0.16.0-TP1
Updated
•
12

baseten/r1-nextn-heads
Updated
•
35

baseten/whisper_trt_large_v3_turbo_NVIDIA_A10G_0_13_0
Updated

baseten/whisper_trt_large_v3_NVIDIA_L4_0_13_0_20250210
Updated

baseten/whisper_trt_large_v3_test_decoder_NVIDIA_H100_80GB_HBM3_0_13_0
Updated

baseten/btest-Qwen2.5-Coder-7B-NVIDIA-H100-80GB-HBM3-v0.16.0-TP4-FP8
Updated
•
11

baseten/btest-Qwen2.5-Coder-7B-NVIDIA-H100-80GB-HBM3-v0.16.0-TP4
Updated
•
27