Compiled engines for running Whisper with TRT LLM for much faster inference.

baseten
company
Verified
AI & ML interests
None defined yet.
Recent Activity
View all activity
Collections
1
models
686

baseten/btest-Qwen2.5-7B-Instruct-NVIDIA-H100-80GB-HBM3-v0.17.0-TP1
Updated

baseten/btest-engine-builder-tllm-llama-1b
Text Generation
•
Updated
•
4

baseten/whisper_trt_large_v3_turbo_test20250307_NVIDIA_L4_0_13_0
Updated

baseten/btest-Llama-3.1-8B-Instruct-NVIDIA-H100-80GB-HBM3-v0.17.0-TP2
Updated
•
13

baseten/btest-Llama-3.1-8B-Instruct-NVIDIA-H100-80GB-HBM3-v0.17.0-TP1
Updated
•
31

baseten/btest-TinyLlama-1.1B-Chat-v1.0-NVIDIA-H100-80GB-HBM3-v0.17.0-TP2
Updated
•
12

baseten/btest-TinyLlama-1.1B-Chat-v1.0-NVIDIA-H100-80GB-HBM3-v0.17.0-TP1
Updated
•
27

baseten/btest-Llama-3.1-70B-Instruct-NVIDIA-H100-80GB-HBM3-v0.17.0-TP4
Updated
•
9

baseten/btest-Llama-3.1-8B-Instruct-NVIDIA-H100-80GB-HBM3-v0.17.0-TP4
Updated
•
7

baseten/whisper_trt_large_v3_turbo_test20250306_NVIDIA_H100_80GB_HBM3_MIG_3g_40gb_0_13_0
Updated