### Version 0.2.0 ### On which installation method(s) does this occur? Docker ### Describe the issue This can mostly be fixed by modifying the number of samples in the datapipe (for example [here](https://github.com/NVIDIA/modulus-launch/blob/main/examples/weather/fcn_afno/train_era5.py#L119)) to be divisible by the number of processors/GPUs. A long term fix would be to automatically avoid failure cases where the size is not exactly divisible by the number of GPUs. ### Minimum reproducible example _No response_ ### Relevant log output _No response_ ### Environment details _No response_