Add Nemotron nano v2 vl #1136

cuichenx · 2025-10-29T16:39:43Z

NVIDIA Nemotron Nano v2 VL is an open 12B multimodal reasoning model for document intelligence and video understanding. It enables AI assistants to extract, interpret, and act on information across text, images, tables, and videos. This makes the model valuable for agents focused on data analysis, document processing and visual understanding in applications like generating reports, curating videos, and dense captioning for media asset management, and retrieval-augmented search.

NeMo Megatron Bridge supports finetuning this model (including LoRA finetuning) on single-image, multi-image, and video datasets. The finetuned model can be converted back to the 🤗 Hugging Face format for downstream evaluation.

The model is currently available in the nvcr.io/nvidia/nemo:25.09.nemotron_nano_v2_vl container. This is the PR to the main branch.

Documentation: https://docs.nvidia.com/nemo/megatron-bridge/latest/models/vlm/nemotron-nano-v2-vl.html
Notable differences compared to the code in the nvcr.io/nvidia/nemo:25.09.nemotron_nano_v2_vl container:

The forward step is renamed to llava_step instead of nemotron_nano_v2_vl_step
The vlm inference script is moved to a standalone script hf_to_megatron_generate_nemotron_vlm.py‎ to distinguish the two different types of models, and the argument --use_llava_model is removed (hard coded into the new script)

Requires this megatron branch: NVIDIA/Megatron-LM#2115

Signed-off-by: yaoyu-33 <[email protected]>

…motron-nano-v2-vl

# Conflicts: # src/megatron/bridge/training/config.py

Signed-off-by: yaoyu-33 <[email protected]>

model Signed-off-by: yaoyu-33 <[email protected]>

Signed-off-by: yaoyu-33 <[email protected]>

Nemotron Nano V2 VL bridge and provider See merge request chcui/Megatron-Bridge!1

Signed-off-by: yaoyu-33 <[email protected]>

HF export See merge request chcui/Megatron-Bridge!2

Signed-off-by: yaoyu-33 <[email protected]>

Signed-off-by: yaoyu-33 <[email protected]> Signed-off-by: Chen Cui <[email protected]> Co-authored-by: yaoyu-33 <[email protected]> Co-authored-by: Li Ding <[email protected]> Signed-off-by: NeMo Bot <[email protected]>

pablo-garay · 2025-11-17T20:26:37Z

Code LGTM from CICD/tests perspective

liding-nv · 2025-11-18T02:47:12Z

examples/conversion/hf_megatron_roundtrip_multi_gpu.py

        default=None,
        help="Path to load the model in Megatron checkpoint format. If provided, model will not start from HF checkpoint.",
    )
+    parser.add_argument("--not-strict", action="store_true", help="Perform loose validation during weight export")


looks like args.not_strict was not passed to main function... @cuichenx maybe fix this in another pr since this one has been merged

Signed-off-by: yaoyu-33 <[email protected]> Signed-off-by: Chen Cui <[email protected]> Co-authored-by: yaoyu-33 <[email protected]> Co-authored-by: Li Ding <[email protected]>

cuichenx and others added 30 commits September 17, 2025 22:01

add wip code

e63ed61

update utils for transformers config in hydra

7858117

Signed-off-by: yaoyu-33 <[email protected]>

temp save

457bace

Signed-off-by: yaoyu-33 <[email protected]>

pipeclean conversion (forward wip)

22233a2

Merge branch 'refs/heads/main' into qwen-25vl-training

6937da4

vlm generate script updates for nemotron vl

c67f734

Merge remote-tracking branch 'refs/remotes/origin/main' into chcui/ne…

fcca45c

…motron-nano-v2-vl

fix after merging with main

790cd8d

clean up

3a9ab4f

fix forward pass

e0fc7d1

add /no_think sys prompt

44faee0

Merge branch 'refs/heads/main' into qwen-25vl-training

8a51440

# Conflicts: # src/megatron/bridge/training/config.py

lint

3bc6ba5

Signed-off-by: yaoyu-33 <[email protected]>

revert qwen-vl changes in gpt

8061e0f

Signed-off-by: yaoyu-33 <[email protected]>

revert qwen-vl changes in gpt #2

df4755a

Signed-off-by: yaoyu-33 <[email protected]>

Add mock dataset provider for qwen25 vl

975efd2

Signed-off-by: yaoyu-33 <[email protected]>

add qwen25 vl dataset support from auto

be708c2

model Signed-off-by: yaoyu-33 <[email protected]>

lint

6822d34

Signed-off-by: yaoyu-33 <[email protected]>

enable multi image and video inputs

ec9c7cd

update _attn_implementation

bc8c605

Signed-off-by: yaoyu-33 <[email protected]>

update comments

689f491

Signed-off-by: yaoyu-33 <[email protected]>

Merge branch 'chcui/nemotron-nano-v2-vl' into 'dev/nemotron-nano-v2-vl'

cf2c769

Nemotron Nano V2 VL bridge and provider See merge request chcui/Megatron-Bridge!1

add preloaded dataset provider

4f0e90f

Signed-off-by: yaoyu-33 <[email protected]>

enable hf export (need to manually copy over modeling files)

4959ea5

expose strict

98caa7a

update _processor to a private attr

2af0c2e

Signed-off-by: yaoyu-33 <[email protected]>

Merge branch 'chcui/hf_export' into 'dev/nemotron-nano-v2-vl'

4a3ef3b

HF export See merge request chcui/Megatron-Bridge!2

Merge branch 'refs/heads/main' into chcui/nano-v2-vl-training

7f3818e

update qwen training utils

ccf6abe

Signed-off-by: yaoyu-33 <[email protected]>

training bug fix

94c6192

Signed-off-by: yaoyu-33 <[email protected]>

copy-pr-bot bot temporarily deployed to nemo-ci November 16, 2025 21:07 Inactive

cuichenx enabled auto-merge (squash) November 17, 2025 01:19

cuichenx requested review from ananthsub, liding-nv, suiyoubi, yaoyu-33 and yfw November 17, 2025 17:37

suiyoubi approved these changes Nov 17, 2025

View reviewed changes

liding-nv approved these changes Nov 17, 2025

View reviewed changes

pablo-garay approved these changes Nov 17, 2025

View reviewed changes

cuichenx merged commit 04c9f05 into main Nov 17, 2025
43 checks passed

cuichenx deleted the chcui/nemotron-nano-v2-vl branch November 17, 2025 20:26

liding-nv reviewed Nov 18, 2025

View reviewed changes

yfw restored the chcui/nemotron-nano-v2-vl branch November 19, 2025 01:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Nemotron nano v2 vl #1136

Add Nemotron nano v2 vl #1136

Uh oh!

cuichenx commented Oct 29, 2025 •

edited

Loading

Uh oh!

Uh oh!

pablo-garay commented Nov 17, 2025

Uh oh!

liding-nv Nov 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

9 participants

Add Nemotron nano v2 vl #1136

Add Nemotron nano v2 vl #1136

Uh oh!

Conversation

cuichenx commented Oct 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

pablo-garay commented Nov 17, 2025

Uh oh!

liding-nv Nov 18, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

9 participants

cuichenx commented Oct 29, 2025 •

edited

Loading