Running Inference Benchmarking Results Phi-4 (8000 Tokens) 📊 Generate detailed latency metrics for model benchmarks
Sleeping Inference Benchmarking Results Phi-4 (200 Tokens) 📊 Visualize benchmarking results for models
textgeflecht/Qwen2.5-Coder-32B-Instruct-FP8-dynamic Text Generation • 33B • Updated 22 days ago • 152