fix qwen3-30-a3b lcb-code score #4142

yao-fengchen · 2025-11-20T11:08:25Z

No description provided.

Copilot

Pull request overview

This PR fixes accuracy issues in the dlinfer backend by improving numerical precision in rotary embeddings, updating tensor type conversions, and upgrading dependencies to stable releases.

Refactored rotary embedding inverse frequency calculation to use native float types and eliminate unnecessary type conversions
Updated Ascend NPU backend tensor operations to use explicit int32 conversions for improved compatibility
Added support for grouped MoE routing with n_groups parameter
Upgraded CANN and torch-npu dependencies from release candidates to stable versions

Reviewed changes

Copilot reviewed 5 out of 5 changed files in this pull request and generated 2 comments.

Show a summary per file

File	Description
lmdeploy/pytorch/backends/dlinfer/rotary_embedding.py	Improved numerical precision by changing base parameter to float and refactoring inv_freq calculation to avoid intermediate int64 conversions
lmdeploy/pytorch/backends/dlinfer/moe.py	Added n_groups parameter support to align with base SoftmaxTopKBuilder interface
lmdeploy/pytorch/backends/dlinfer/ascend/op_backend.py	Updated tensor type conversions to explicitly use int32 for Ascend NPU compatibility
docker/Dockerfile_ascend_a3	Upgraded CANN from 8.3.rc1.alpha002 to 8.3.rc1 and torch-npu from 2.8.0rc1 to 2.8.0
docker/Dockerfile_ascend_a2_300i	Upgraded CANN from 8.3.rc1.alpha002 to 8.3.rc1 and torch-npu from 2.8.0rc1 to 2.8.0

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

lmdeploy/pytorch/backends/dlinfer/ascend/op_backend.py

lmdeploy/pytorch/backends/dlinfer/rotary_embedding.py

yao-fengchen added 3 commits November 24, 2025 03:24

calculate inv_freq on device

d0bf5f6

adapt for dlinfer attn

82c0711

update code

9ece6a8

yao-fengchen force-pushed the fix_accuracy branch from 17c386e to 9ece6a8 Compare November 24, 2025 03:25

yao-fengchen added 2 commits November 24, 2025 03:33

fix dlinfer moe para err

7e831e3

update cann version

223d145

jinminxi104 requested a review from Copilot November 24, 2025 07:16

Copilot started reviewing on behalf of jinminxi104 November 24, 2025 07:20 View session

Copilot finished reviewing on behalf of jinminxi104 November 24, 2025 07:22

Copilot AI reviewed Nov 24, 2025

View reviewed changes

lmdeploy/pytorch/backends/dlinfer/ascend/op_backend.py Outdated Show resolved Hide resolved

lmdeploy/pytorch/backends/dlinfer/rotary_embedding.py Outdated Show resolved Hide resolved

update code

fc709b7

jinminxi104 approved these changes Nov 26, 2025

View reviewed changes

jinminxi104 changed the title ~~fix accuracy~~ fix qwen3-30-a3b lcb-code score Nov 26, 2025

jinminxi104 marked this pull request as ready for review November 26, 2025 02:48

jinminxi104 requested review from grimoire and lvhan028 November 26, 2025 02:58

grimoire approved these changes Nov 26, 2025

View reviewed changes

lvhan028 added the Bug:P1 label Nov 26, 2025

lvhan028 merged commit 21c22f0 into InternLM:main Nov 26, 2025
14 of 15 checks passed

yao-fengchen deleted the fix_accuracy branch November 28, 2025 08:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix qwen3-30-a3b lcb-code score #4142

fix qwen3-30-a3b lcb-code score #4142

yao-fengchen commented Nov 20, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

fix qwen3-30-a3b lcb-code score #4142

fix qwen3-30-a3b lcb-code score #4142

Conversation

yao-fengchen commented Nov 20, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants