You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Module has an accGradInput method, which accumulates grad into a tensor. So we need to zero that tensor at the beginning of each iteration.
This is not necessary if we only accumulate once in each iteration. We can provide a mode flag. If the user set that flag, the accumulating gradient will become overwrite the tensor, which can save the zeroGrad time in the training.
Activity
update doc (intel#1524)