Cannot scale inference endpoint to zero MANUALLY

#21
by ArunSharma93 - opened
  1. My deployed inference endpoint did not scale to zero after 15 minutes of no activity, so I have been charged for 1,000+ un-used minutes while I slept. I have trippled checked and my setup is correct

  2. When trying to scale the inference endpoint down to zero MANUALLY. I am un-able to do so - see attached screen shot

The inference endpoints are not production ready.

Screenshot 2025-03-11 at 09.49.38.png

Your need to confirm your account before you can post a new comment.

Sign up or log in to comment