-
Notifications
You must be signed in to change notification settings - Fork 146
Open
Description
This may be a concise question. Perhaps because of my shortage of knowledge, what I found in some representative papers (like the original diffusion policy code) is that authors always predict the original noise (epsilon), not the original sample in their models. Can I ask the reason why you choose the sample, not the epsilon, to predict? Is there any advantage to predict the sample over the epsilon and do some previous works in academics show that prediction on samples truly improve the performance? Thanks for your reply. Really appreciate to your work
Metadata
Metadata
Assignees
Labels
No labels