Add Youtu-LLM model #43166

LuJunru · 2026-01-08T10:02:59Z

What does this PR do?

This PR adds the implementation for the released Youtu-LLM model. The model has the following features:

Type: Autoregressive Causal Language Models with Dense MLA
Release versions: Base and Instruct

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

@ArthurZucker @Cyrilvallez

LuJunru · 2026-01-08T11:01:59Z

View the CircleCI Test Summary for this PR:

https://huggingface.co/spaces/transformers-community/circle-ci-viz?pr=43166&sha=5dab39

Hi @ArthurZucker @Cyrilvallez

May I ask if it is possible to concentrate the test only on Youtu-LLM (the new model)? The summary here seems report errors raised by other models.

junru

…ition_embedding in DiT (huggingface#43068) * qwen2_5_omni: make max_mel_frames an inference-time knob * not fail with raising ValueError, instead make it continue to run by choosing a target_duration that's capped and aligned * added unit tests for Token2WavShape shape mismatch Signed-off-by: Dong Wang <[email protected]> * make fixup * remove unit test which takes too much GPU memory Signed-off-by: Dong Wang <[email protected]> * reduce gpu memory usage from the unit test * addressed comments Signed-off-by: Dong Wang <[email protected]> --------- Signed-off-by: Dong Wang <[email protected]>

LuJunru · 2026-01-09T01:51:26Z

Hi @ArthurZucker @Cyrilvallez

It seems Youtu-LLM-related codes have passed the auto review. The remaining check fails on other models.

github-actions · 2026-01-09T22:40:04Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: auto, youtu_llm

github-actions · 2026-01-09T22:48:36Z

View the CircleCI Test Summary for this PR:

https://huggingface.co/spaces/transformers-community/circle-ci-viz?pr=43166&sha=fab87c

add Youtu-LLM model

5dab39b

LuJunru mentioned this pull request Jan 8, 2026

Add Youtu-LLM model #43165

Closed

5 tasks

Merge branch 'main' into add-youtu-llm

7a87762

LuJunru and others added 7 commits January 8, 2026 19:14

add testing indicators in model test

22eaca4

upgrade code quality according to latest main branch

e2b6dcb

Merge branch 'main' into add-youtu-llm

1a8ca39

correct unnecessary tokenizer annotation

014919a

resolve conflicts

7be5052

Merge branch 'main' into add-youtu-llm

ac69b0c

LuJunru added 3 commits January 9, 2026 22:55

Merge branch 'main' into add-youtu-llm

23ca813

Merge branch 'main' into add-youtu-llm

b226ed1

Merge branch 'main' into add-youtu-llm

fab87c3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Youtu-LLM model #43166

Add Youtu-LLM model #43166

LuJunru commented Jan 8, 2026

Uh oh!

LuJunru commented Jan 8, 2026

Uh oh!

LuJunru commented Jan 9, 2026

Uh oh!

github-actions bot commented Jan 9, 2026

Uh oh!

github-actions bot commented Jan 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add Youtu-LLM model #43166

Are you sure you want to change the base?

Add Youtu-LLM model #43166

Conversation

LuJunru commented Jan 8, 2026

What does this PR do?

Before submitting

Who can review?

Uh oh!

LuJunru commented Jan 8, 2026

Uh oh!

LuJunru commented Jan 9, 2026

Uh oh!

github-actions bot commented Jan 9, 2026

Uh oh!

github-actions bot commented Jan 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants