Why the minimum of weight quantization is one more than activation and bias quantization? #11521

codereba · 2025-06-10T09:48:15Z

codereba
Jun 10, 2025

I read the code of get_8a8w_qnn_ptq_config, I found the minimum values of weight quantization is a one more than activation and bias quantization, please refer to:
https://github.com/pytorch/executorch/blob/main/backends/qualcomm/quantizer/qconfig.py#L88

weight_quantization_spec = QuantizationSpec(
dtype=torch.int8,
quant_min=torch.iinfo(torch.int8).min + 1,
quant_max=torch.iinfo(torch.int8).max,
qscheme=torch.per_tensor_symmetric,
ch_axis=0,
observer_or_fake_quant_ctr=MinMaxObserver.with_args(**extra_args),
)

I think the difference between the minimum values of weight quantization and the activation and bias quantization should not be the const 1.

And there should check the minimum value should be lower than the maximum value firstly.

I just try to make the code be best, could you please explain the code?

Thanks for the great work of executorch.

cccclai · 2025-06-16T23:26:36Z

cccclai
Jun 16, 2025
Collaborator

@shewu-quic @chunit-quic @haowhsu-quic @qiurc mind help answering this question? I'm guessing it's due to the kernel restriction, but would like to confirm

0 replies

haowhsu-quic · 2025-06-17T00:37:01Z

haowhsu-quic
Jun 17, 2025

Hi @codereba, thank you for reading through the code. For parameter quantization, HTP requires symmetrical encoding. Since [min, max] of int8 equal to [-128, 127], we have to add 1 manually to have the same range, i.e. [-127, 127], and the zero point can be mapped exactly to 0.

1 reply

codereba Jun 17, 2025
Author

I understood,
Great, Thank you.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Why the minimum of weight quantization is one more than activation and bias quantization? #11521

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 2 comments 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Why the minimum of weight quantization is one more than activation and bias quantization? #11521

Uh oh!

Uh oh!

codereba Jun 10, 2025

Replies: 2 comments · 1 reply

Uh oh!

cccclai Jun 16, 2025 Collaborator

Uh oh!

haowhsu-quic Jun 17, 2025

Uh oh!

codereba Jun 17, 2025 Author

codereba
Jun 10, 2025

Replies: 2 comments 1 reply

cccclai
Jun 16, 2025
Collaborator

haowhsu-quic
Jun 17, 2025

codereba Jun 17, 2025
Author