Hi, there is an issue with broadcasting when using batch size > 1 with the line:
https://github.com/XLabs-AI/x-flux/blob/main/train_flux_lora_deepspeed.py#L254
the shape of t should be (batch_size, 1, 1) to be broadcasted and multiplied with x_0 and x_t
You can try and fix it this way:
https://github.com/aravindhv10/x-flux/blob/aravind_prodigy_dataset/train_flux_lora_deepspeed.py#L253-L256
or perhaps there might be a better way? The above method worked for me.