Fix: Guard against None num_query_tokens in Blip2Processor (to avoid TypeError) #42311

Flakes342 · 2025-11-20T20:46:39Z

What does this PR do?

Fixes #42203

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Fixed a bug where Blip2Processor assumes num_query_tokens is an int and does `max_length - self.num_query_tokens` which raises TypeError when `num_query_tokens` is None.

This change:

Treats num_query_tokens as 0 when it is None (with a warning).
Adds a minimal unit test to prevent regression.

Who can review?

@Rocketknight1

Rocketknight1 · 2025-11-21T16:59:16Z

cc @zucchini-nlp! I think it's fine without the warning message, but deferring to her on what behaviour we want from processors 😅

zucchini-nlp

@Flakes342 thanks for the PR!

To support older models from the hub which weren't updated, I think we should update the default value of num_query_tokens.

The modeling code assumes that input_ids have special image placeholders and with self.num_query_tokens=None the placeholders will not be added in the input text. Therefore the model will not see an input image

Let's add the default value of 32 which is the only value used in all official checkpoints. Ping me for another review when ready :)

HuggingFaceDocBuilderDev · 2025-11-24T09:17:20Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

…ust-loss into bug/nqt

Flakes342 · 2025-11-24T23:08:25Z

@zucchini-nlp thr pr is ready for your review

github-actions · 2025-11-26T00:48:55Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: blip_2

zucchini-nlp · 2025-11-26T08:17:18Z

run-slow: blip_2

zucchini-nlp · 2025-11-26T08:17:57Z

src/transformers/models/blip_2/processing_blip_2.py

            Number of tokens used by the Qformer as queries, should be same as in model's config.
    """

    def __init__(self, image_processor, tokenizer, num_query_tokens=None, **kwargs):


Sorry if I wasn't clear, I meant the default value in the signature here

github-actions · 2025-11-26T08:18:32Z

This comment contains run-slow, running the specified jobs:

models: ["models/blip_2"]
quantizations: []

github-actions · 2025-11-26T08:38:46Z

CI Results

Workflow Run ⚙️

Model CI Report

❌ Failed tests

blip_2:
tests/models/blip_2/test_modeling_blip_2.py::Blip2ModelIntegrationTest::test_inference_itm
tests/models/blip_2/test_modeling_blip_2.py::Blip2ModelIntegrationTest::test_inference_itm_fp16

Flakes342 added 3 commits November 21, 2025 02:10

nqt fixed

dbb8e5c

Merge branch 'huggingface:main' into bug/nqt

a8975b5

Merge branch 'main' into bug/nqt

8f10994

zucchini-nlp reviewed Nov 24, 2025

View reviewed changes

Flakes342 added 4 commits November 25, 2025 03:40

Defaults to 32 and prevents negative truncation values

4db134f

Defaults to 32 and prevents negative truncation values

adabecd

Merge branch 'bug/nqt' of https://github.com/Flakes342/transformers-c…

4904cef

…ust-loss into bug/nqt

Merge branch 'main' into bug/nqt

13e0b82

Merge branch 'main' into bug/nqt

fcba610

zucchini-nlp reviewed Nov 26, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix: Guard against None num_query_tokens in Blip2Processor (to avoid TypeError) #42311

Fix: Guard against None num_query_tokens in Blip2Processor (to avoid TypeError) #42311

Uh oh!

Flakes342 commented Nov 20, 2025

Uh oh!

Rocketknight1 commented Nov 21, 2025

Uh oh!

zucchini-nlp left a comment •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Nov 24, 2025

Uh oh!

Flakes342 commented Nov 24, 2025

Uh oh!

github-actions bot commented Nov 26, 2025

Uh oh!

zucchini-nlp commented Nov 26, 2025

Uh oh!

zucchini-nlp Nov 26, 2025

Uh oh!

github-actions bot commented Nov 26, 2025

Uh oh!

github-actions bot commented Nov 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Fix: Guard against None num_query_tokens in Blip2Processor (to avoid TypeError) #42311

Are you sure you want to change the base?

Fix: Guard against None num_query_tokens in Blip2Processor (to avoid TypeError) #42311

Uh oh!

Conversation

Flakes342 commented Nov 20, 2025

What does this PR do?

Before submitting

Fixed a bug where Blip2Processor assumes num_query_tokens is an int and does max_length - self.num_query_tokens which raises TypeError when num_query_tokens is None.

Who can review?

Uh oh!

Rocketknight1 commented Nov 21, 2025

Uh oh!

zucchini-nlp left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented Nov 24, 2025

Uh oh!

Flakes342 commented Nov 24, 2025

Uh oh!

github-actions bot commented Nov 26, 2025

Uh oh!

zucchini-nlp commented Nov 26, 2025

Uh oh!

zucchini-nlp Nov 26, 2025

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Nov 26, 2025

Uh oh!

github-actions bot commented Nov 26, 2025

CI Results

Model CI Report

❌ Failed tests

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Fixed a bug where Blip2Processor assumes num_query_tokens is an int and does `max_length - self.num_query_tokens` which raises TypeError when `num_query_tokens` is None.

zucchini-nlp left a comment •

edited

Loading