optimizer / build /torch27-cxx11-cu128-x86_64-linux
1.9 MB
iamwyldecat's picture
refactor(muon): change argument adam_wd to weight_decay and handle params' type
02ac540