training problem！！！！！

#31

by yuduan - opened 5 days ago

5 days ago

Hello, I have a question. I followed the Selforcing Plus setup in section 2.2 (High Noise) and ran it with backward simulation. The timestep is set between 900–1000, shift = 5, denoising_step_list = [990, 960, 930, 900], minstep = 900, maxstep = 990, dfake = 5. I’m training on 16 GPUs. The DMD loss decreased from 0.8 to 0.3, while the critic loss went from 0.5 up to as high as 3, and is now oscillating around 1.5 after about 600 steps. Is this normal?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment