[SR-9665] [AD] Memory leaks in AD-synthesized code #52109
Labels
bug
A deviation from expected or documented behavior. Also: expected but undesirable behavior.
swift for tensorflow
Additional Detail from JIRA
md5: 52bd64279af763b4c0920c828b77f588
Issue Description:
I believe at least checkpoints + some adjoint intermediates aren't being freed.
For instance, this snippet hits a GPU OOM (on a card with 8GiB VRAM) after 313 steps:
The text was updated successfully, but these errors were encountered: