Spaces:

sergiopaniego
/

vlm_object_understanding

Running on Zero

App Files Files Community

Show inference time for both models

by vikhyatk - opened 1 day ago

base: refs/heads/main

←

from: refs/pr/2

Discussion Files changed

+24

-9

vikhyatk

1 day ago

Added an extra output that shows inference time for both models. I removed the @GPU annotation on the 'detect' function because it was causing a ~400ms hit to the first model's inference time (presumably because of ZeroGPU initialization that the second model is able to take advantage of). With the annotation removed both models get the performance hit resulting in a more apples-to-apples comparison.

Here's what it looks like:

Show inference time for both models089817df

sergiopaniego

Owner 1 day ago

Thanks! This is super useful!

sergiopaniego changed pull request status to merged 1 day ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment