optimum/neuron-export · Using neuron cache instead of creating a pull-request in the target repo

Our experience with the neuron SDK showed us that a model exported to Neuron for a specific version might not work as expected for the next SDK versions.
This is why storing exported models in dedicated repositories is not the most efficient way to share them: it is better to just store the cached compilation artifacts in the aws-neuron/optimum-neuron_compile-cache directory instead. This can be achieved transparently after having exported a model by calling the optimum neuron cache synchronization helper, either programatically or through the CLI.