When running code on binary classification, does it mean that we only need to modify the num_classes = 2 instead of 10 as default? When I tried it, I found a lot of perturbed samples as NaN value. Have you ever found this before?