Fix NameError in pretrain_retro.py (add import_module), remove unused… #2084

vignesh1507 · 2025-11-02T03:35:38Z

I fixed a runtime error and cleaned up minor issues in pretrain_retro.py:

What I changed

Added missing import:
from importlib import import_module Reason: import_module(...) is called in core_model_provider when args.spec is set; without this import the code throws a NameError at runtime.
Removed unused import:
MegatronTokenizer (unused) — removed to avoid noise and potential lint warnings.
Improved log message:
Changed print_rank_0('building GPT model ...') → print_rank_0('building Retro model ...') to accurately reflect the model being constructed.
Why

The missing import is a definite runtime bug that will break the experimental spec-loading path (args.spec).
Removing unused imports helps keep the codebase clean and avoids linter complaints.
Updating the log message reduces confusion when reading logs (this script builds a Retro model, not GPT).
Testing / How I validated

Static review to confirm import_module usage and the missing import.
Confirmed MegatronTokenizer is not referenced elsewhere in the file.
Confirmed the corrected log message matches the behavior of core_model_provider.
Risk / Backward compatibility

Low risk: only adds an import, removes an unused import, and updates a log string. No behavior/functionality changes to model logic.
If other modules relied on MegatronTokenizer being imported at this module level (unlikely), we should import it explicitly from megatron.core.tokenizers.

… import, and clarify log message I fixed a runtime error and cleaned up minor issues in pretrain_retro.py: What I changed Added missing import: from importlib import import_module Reason: import_module(...) is called in core_model_provider when args.spec is set; without this import the code throws a NameError at runtime. Removed unused import: MegatronTokenizer (unused) — removed to avoid noise and potential lint warnings. Improved log message: Changed print_rank_0('building GPT model ...') → print_rank_0('building Retro model ...') to accurately reflect the model being constructed. Why The missing import is a definite runtime bug that will break the experimental spec-loading path (args.spec). Removing unused imports helps keep the codebase clean and avoids linter complaints. Updating the log message reduces confusion when reading logs (this script builds a Retro model, not GPT). Testing / How I validated Static review to confirm import_module usage and the missing import. Confirmed MegatronTokenizer is not referenced elsewhere in the file. Confirmed the corrected log message matches the behavior of core_model_provider. Risk / Backward compatibility Low risk: only adds an import, removes an unused import, and updates a log string. No behavior/functionality changes to model logic. If other modules relied on MegatronTokenizer being imported at this module level (unlikely), they should import it explicitly from megatron.core.tokenizers.

copy-pr-bot · 2025-11-02T03:35:42Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

yashaswikarnati · 2025-11-02T16:32:07Z

/ok to test d019406

github-actions · 2025-11-02T16:32:26Z

Thank you for your contribution!

NVIDIA Megatron-LM is currently transitioning to development on Github. We will aim to review your PR after we complete our transition and stabilize our Github development process.

Thank you for your understanding.

yashaswikarnati approved these changes Nov 2, 2025

View reviewed changes

copy-pr-bot bot temporarily deployed to nemo-ci November 2, 2025 16:32 Inactive

copy-pr-bot bot had a problem deploying to nemo-ci November 2, 2025 16:32 Failure

ko3n1g added this to the Core 0.16 milestone Nov 2, 2025

copy-pr-bot bot temporarily deployed to nemo-ci November 2, 2025 16:32 Inactive

copy-pr-bot bot had a problem deploying to nemo-ci November 2, 2025 16:32 Failure

copy-pr-bot bot temporarily deployed to public November 2, 2025 16:35 Inactive

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix NameError in pretrain_retro.py (add import_module), remove unused… #2084

Fix NameError in pretrain_retro.py (add import_module), remove unused… #2084

Uh oh!

vignesh1507 commented Nov 2, 2025

Uh oh!

copy-pr-bot bot commented Nov 2, 2025

Uh oh!

yashaswikarnati commented Nov 2, 2025

Uh oh!

github-actions bot commented Nov 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Fix NameError in pretrain_retro.py (add import_module), remove unused… #2084

Are you sure you want to change the base?

Fix NameError in pretrain_retro.py (add import_module), remove unused… #2084

Uh oh!

Conversation

vignesh1507 commented Nov 2, 2025

Uh oh!

copy-pr-bot bot commented Nov 2, 2025

Uh oh!

yashaswikarnati commented Nov 2, 2025

Uh oh!

github-actions bot commented Nov 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants