optimizer / build /torch26-cxx11-cu118-x86_64-linux
5.38 MB
iamwyldecat's picture
fix(muon): delete intermediate tensors immediately to lower peak mem usage
bdd2678