Upload architectureV3.py

#3
by win10 - opened

Fix RuntimeError on h100: Trying to backward through the graph a second time (or directly access saved tensors after they have already been freed). Saved intermediate values ​​of the graph are freed when you call .backward() or autograd.grad(). Specify retain_graph=True if you need to backward through the graph a second time or if you need to access saved tensors after calling backward.

@moelanoby You can take a look at it, I hope you won't find it disrespectful.

If it does fix it on an H100 GPU then sure I won't find it disrespectful

moelanoby changed pull request status to merged

Sign up or log in to comment