Setting Train/Validation Split for an MoE Model #1048

kjrstory · 2025-08-06T08:10:08Z

kjrstory
Aug 6, 2025

I have a question about setting up the train and validation splits for a "MoE Gating Network for External Aerodynamics". Is there a recommended guideline? Specifically, should I reuse the same train/validation sets that were used to train DoMINO, FigConvNet, and X-MeshGraphNet(XAeroNet)?

In my current pipelines, VTP files are generated only for the validation cases of those three models. For MoE training, I would need to generate VTPs for the training cases as well. Does this approach make sense, or is there a better practice you recommend?

Answered by Dibyajyoti-Chakraborty

Aug 13, 2025

It is not necessary to use the same train/val set that was used for training the individual experts. For example, in some corner cases, it may not be recommended if a model works really well for training but not on validation. Here, the MoE may learn to put a larger weight on that particular expert. In cases where the models consistently show similar skill in training vs validation, it does not matter.
So, feel free to use your particular mix of training and validation as long as they are independent and the training does not influence the validation in any way. I suggest checking the individual model performance in your train-val split. They should be consistent. For example, if expert 1…

View full answer

Dibyajyoti-Chakraborty · 2025-08-13T18:55:16Z

Dibyajyoti-Chakraborty
Aug 13, 2025
Maintainer

It is not necessary to use the same train/val set that was used for training the individual experts. For example, in some corner cases, it may not be recommended if a model works really well for training but not on validation. Here, the MoE may learn to put a larger weight on that particular expert. In cases where the models consistently show similar skill in training vs validation, it does not matter.
So, feel free to use your particular mix of training and validation as long as they are independent and the training does not influence the validation in any way. I suggest checking the individual model performance in your train-val split. They should be consistent. For example, if expert 1 is relatively 10% better than expert 2 in the train set on average, it should be similar in the val set too.
Although it will be interesting to see what happens if this consistency is not present. Hope it answers your question.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Setting Train/Validation Split for an MoE Model #1048

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Setting Train/Validation Split for an MoE Model #1048

Uh oh!

kjrstory Aug 6, 2025

Replies: 1 comment

Uh oh!

Dibyajyoti-Chakraborty Aug 13, 2025 Maintainer

kjrstory
Aug 6, 2025

Dibyajyoti-Chakraborty
Aug 13, 2025
Maintainer