optimum-internal-testing/optimum-neuron-cache-ci
Updated
@appleCorePotatoes that is correct, we have suspended TPU usage on inference endpoints for now, we do not have this option anymore, but you can find other solutions to deploy, with different price/performance alternatives.