Broadcasting issue when using training batch size > 1

Hi, there is an issue with broadcasting when using batch size > 1 with the line:
https://github.com/XLabs-AI/x-flux/blob/main/train_flux_lora_deepspeed.py#L254
the shape of t should be (batch_size, 1, 1) to be broadcasted and multiplied with x_0 and x_t

You can try and fix it this way:

https://github.com/aravindhv10/x-flux/blob/aravind_prodigy_dataset/train_flux_lora_deepspeed.py#L253-L256

or perhaps there might be a better way? The above method worked for me.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Broadcasting issue when using training batch size > 1 #102

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Broadcasting issue when using training batch size > 1 #102

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions