[bugfix] fix glm5.2 indexer_type 'shared' by Jintao-Huang · Pull Request #134 · modelscope/mcore-bridge

Jintao-Huang · 2026-06-23T12:40:03Z

No description provided.

gemini-code-assist

Code Review

This pull request updates the GPT bridge to conditionally handle the indexer state dict update when using the 'dsa' experimental attention variant by checking and reducing the indexer's presence across the pipeline parallel group. Feedback suggests using defensive programming with getattr to avoid potential AttributeError exceptions when accessing core_attention and indexer on mg_attn.

Important

The consumer version of Gemini Code Assist on GitHub is being sunset. Starting June 18, 2026, new organization installations will be blocked, and all code review activity will officially cease on July 17, 2026.
For more details on the timeline and next steps, please review the Help Documentation.

Jintao-Huang · 2026-06-23T15:53:30Z

/gemini review

gemini-code-assist

Code Review

This pull request introduces support for GLM 5.2 with a shared DSA (Dense Sparse Attention) indexer by adding the glm_moe_dsa model implementation. This includes custom attention, GPT model, transformer block, and loader classes to manage shared indexer states across pipeline parallel stages, as well as bridge updates for handling indexer states. The feedback suggests defensive retrieval of indexer_types to prevent potential AttributeErrors and specifying the dimension in squeeze() on the attention mask to avoid unintended squeezing of the batch dimension when the batch size is 1.

Important

The consumer version of Gemini Code Assist on GitHub is being sunset. Starting June 18, 2026, new organization installations will be blocked, and all code review activity will officially cease on July 17, 2026.
For more details on the timeline and next steps, please review the Help Documentation.

gemini-code-assist · 2026-06-23T15:55:41Z

+
+    def __init__(self, config, submodules, layer_number, *args, **kwargs):
+        super().__init__(config, submodules, layer_number, *args, **kwargs)
+        indexer_types = config.hf_config.indexer_types


To prevent potential AttributeError if config.hf_config is None, it is safer to use getattr to retrieve indexer_types defensively.

Suggested change

indexer_types = config.hf_config.indexer_types

indexer_types = getattr(getattr(config, 'hf_config', None), 'indexer_types', None)

gemini-code-assist · 2026-06-23T15:55:41Z

+        else:
+            b = query.size(1)
+            assert attention_mask.shape == (b, 1, sq, skv)
+            mask = attention_mask.squeeze()


Using attention_mask.squeeze() without specifying a dimension is dangerous because if the batch size b is 1, it will squeeze the batch dimension as well, leading to inconsistent tensor shapes (e.g., (sq, skv) instead of (1, sq, skv)). Specifying the dimension to squeeze (dimension 1, which is the singleton dimension for attention heads/groups) ensures consistent shapes regardless of the batch size.

Suggested change

mask = attention_mask.squeeze()

mask = attention_mask.squeeze(1)

gemini-code-assist · 2026-06-23T15:55:41Z

+    def get_transformer_layer_spec(self, vp_stage: Optional[int] = None):
+        transformer_layer_spec = super().get_transformer_layer_spec(vp_stage)
+
+        indexer_types = self.config.hf_config.indexer_types


To prevent potential AttributeError if self.config.hf_config is None, it is safer to use getattr to retrieve indexer_types defensively.

Suggested change

indexer_types = self.config.hf_config.indexer_types

indexer_types = getattr(getattr(self.config, 'hf_config', None), 'indexer_types', None)

update

cfc634b

gemini-code-assist Bot reviewed Jun 23, 2026

View reviewed changes

Comment thread src/mcore_bridge/bridge/gpt_bridge.py

Jintao-Huang added 3 commits June 23, 2026 20:44

fix glm5.2 indexer shared

b13eb6f

update

600ee06

update

e618f29

Jintao-Huang changed the title ~~[bugfix] fix glm5 indexer shared~~ [bugfix] fix glm5.2 indexer_type 'shared' Jun 23, 2026

hjh0119 approved these changes Jun 23, 2026

View reviewed changes

fix

c40515f

gemini-code-assist Bot reviewed Jun 23, 2026

View reviewed changes

Jintao-Huang merged commit 86312fc into modelscope:main Jun 23, 2026
1 check passed

Jintao-Huang mentioned this pull request Jun 23, 2026

sft lora 训练glm 5.2时出现KeyError modelscope/ms-swift#9617

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[bugfix] fix glm5.2 indexer_type 'shared'#134

[bugfix] fix glm5.2 indexer_type 'shared'#134
Jintao-Huang merged 5 commits into
modelscope:mainfrom
Jintao-Huang:fix_glm_5_indexer_shared

Jintao-Huang commented Jun 23, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Jintao-Huang commented Jun 23, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot Jun 23, 2026

Uh oh!

gemini-code-assist Bot Jun 23, 2026

Uh oh!

gemini-code-assist Bot Jun 23, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	indexer_types = config.hf_config.indexer_types
	indexer_types = getattr(getattr(config, 'hf_config', None), 'indexer_types', None)

	mask = attention_mask.squeeze()
	mask = attention_mask.squeeze(1)

	indexer_types = self.config.hf_config.indexer_types
	indexer_types = getattr(getattr(self.config, 'hf_config', None), 'indexer_types', None)

Conversation

Jintao-Huang commented Jun 23, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Jintao-Huang commented Jun 23, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Jun 23, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Jun 23, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Jun 23, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants