Question about training data mixture. （训练数据混合问题）

Does each mini-batch include both multimodal understanding and image generation data? In [this code](https://github.com/AILab-CVC/SEED-X/blob/65292a65ea3aa9542b719836e4c6bd5e31426172/src/models/mllm/modeling_llama_xformer.py#L731), will the LLM loss turn NaN due to the absence of multimodal understanding data in a mini-batch?
What to do when a mini-batch only contains single modality data?

 当单卡上的数据全为一种模态时（比如只有image generation 模态的数据），llm loss会变成nan，请问这个情况怎么处理? 
这种不平衡的mini-batch data是否会影响模型的效果？

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Question about training data mixture. （训练数据混合问题） #30

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Question about training data mixture. （训练数据混合问题） #30

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions