optimizer / build /torch27-cxx11-cu128-x86_64-linux
5.67 MB
iamwyldecat's picture
fix(muon): delete intermediate tensors immediately to lower peak mem usage
bdd2678