Added n_workers in generate_data for diffusion_FWI #1229

Dibyajyoti-Chakraborty · 2025-11-13T19:13:51Z

PhysicsNeMo Pull Request

Description

Added option to multiple workers per GPU in generate_data.py for diffusion_FWI. Now it can use multiple workers (default 8) for each GPU for faster generation.

Checklist

I am familiar with the Contributing Guidelines.
New or existing tests cover these changes.
The documentation is up to date with these changes.
The CHANGELOG.md is up to date with these changes.
An issue is linked to this pull request.

Dependencies

Review Process

All PRs are reviewed by the PhysicsNeMo team before merging.

Depending on which files are changed, GitHub may automatically assign a maintainer for review.

We are also testing AI-based code review tools (e.g., Greptile), which may add automated comments with a confidence score.
This score reflects the AI’s assessment of merge readiness and is not a qualitative judgment of your work, nor is
it an indication that the PR will be accepted / rejected.

AI-generated feedback should be reviewed critically for usefulness.
You are not required to respond to every AI comment, but they are intended to help both authors and reviewers.
Please react to Greptile comments with 👍 or 👎 to provide feedback on their accuracy.

greptile-apps · 2025-11-13T19:15:13Z

Greptile Overview

Greptile Summary

This PR adds a --n_workers command-line argument to the diffusion_FWI data generation script, allowing multiple worker processes per GPU (default 8) for improved parallel processing throughput. The change modifies the multiprocessing pool to use num_gpus * n_workers total processes while distributing work across GPUs using modulo-based assignment. While this could improve performance by better utilizing GPU resources, there are critical resource management concerns with multiple workers potentially overwhelming individual GPUs.

PR Description Notes:

The checklist indicates tests, documentation, changelog, and linked issues are not addressed
No new dependencies are introduced

Important Files Changed

Filename	Score	Overview
examples/geophysics/diffusion_fwi/data/generate_data.py	2/5	Added configurable workers per GPU but introduces potential GPU memory conflicts and has formatting issues

Critical Issues Identified:

GPU Memory Management Risk: Multiple workers assigned to the same GPU through modulo operation (i % num_gpus) could cause CUDA out-of-memory errors since there's no coordination or memory limiting between workers on the same device.
Missing Resource Constraints: No mechanism to prevent GPU oversubscription or manage memory allocation when multiple workers target the same GPU simultaneously.
Code Quality Issues:
- Inconsistent formatting in the logging statement with unnecessary line continuation and extra spaces
- The argument help text has a minor grammatical issue ("Num" should be "Number")
Missing Documentation: The PR checklist shows documentation and changelog updates are incomplete, which is important for user-facing CLI changes.

Recommendation: Before merging, implement proper GPU memory management (e.g., memory limits per worker, GPU locks, or torch multiprocessing) and address the formatting issues. Consider adding validation to ensure n_workers doesn't exceed reasonable limits for available GPU memory.

Confidence Score: 2/5 - While the performance optimization intent is valid, the implementation has significant resource management risks that could cause runtime failures.

greptile-apps

_{1 file reviewed, 2 comments}

_{Edit Code Review Agent Settings | Greptile}

greptile-apps · 2025-11-13T19:15:11Z

examples/geophysics/diffusion_fwi/data/generate_data.py

        args: list[tuple[str, str, int, int]] = [
            (filepath, output_path, i % num_gpus, user_source_frequency)
            for i, filepath in enumerate(file_list)


logic: Multiple workers will be assigned to the same GPU (via modulo), but each worker loads models/data onto the same GPU device without coordination. This could cause CUDA out-of-memory errors. Have you tested this with multiple workers per GPU to ensure GPU memory usage doesn't exceed available VRAM?

It has been tested. Default n_wrokers are set according to that.

examples/geophysics/diffusion_fwi/data/generate_data.py

String formatting in logging Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>

Added n_workers in generate_data for diffusion_FWI

f82c077

Dibyajyoti-Chakraborty requested a review from CharlelieLrt November 13, 2025 19:13

greptile-apps bot reviewed Nov 13, 2025

View reviewed changes

Dibyajyoti-Chakraborty and others added 2 commits November 13, 2025 14:21

Update examples/geophysics/diffusion_fwi/data/generate_data.py

a53d06a

String formatting in logging Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>

Merge branch 'NVIDIA:main' into main

ceb0f71

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Added n_workers in generate_data for diffusion_FWI #1229

Added n_workers in generate_data for diffusion_FWI #1229

Uh oh!

Dibyajyoti-Chakraborty commented Nov 13, 2025

Uh oh!

greptile-apps bot commented Nov 13, 2025

Uh oh!

greptile-apps bot left a comment

Uh oh!

greptile-apps bot Nov 13, 2025

Uh oh!

Dibyajyoti-Chakraborty Nov 13, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Added n_workers in generate_data for diffusion_FWI #1229

Are you sure you want to change the base?

Added n_workers in generate_data for diffusion_FWI #1229

Uh oh!

Conversation

Dibyajyoti-Chakraborty commented Nov 13, 2025

PhysicsNeMo Pull Request

Description

Checklist

Dependencies

Review Process

Uh oh!

greptile-apps bot commented Nov 13, 2025

Greptile Overview

Greptile Summary

Important Files Changed

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot Nov 13, 2025

Choose a reason for hiding this comment

Uh oh!

Dibyajyoti-Chakraborty Nov 13, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant