[ci] Refactor CI files to use new CI pipeline generator #246

khluu · 2025-12-09T00:38:03Z

Change structure & format of CI files to use new vLLM project Buildkite pipeline generator https://github.com/vllm-project/ci-infra/tree/main/buildkite/pipeline_generator

Signed-off-by: Kevin H. Luu <[email protected]>

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2025-12-09T00:40:52Z

.buildkite/jobs/tests.yaml

+- label: "Diffusion Model Test"
+  timeout_in_minutes: 15
+  commands:
+    - pytest -s -v tests/single_stage/test_diffusion_model.py


GPU tests no longer run in built container

The GPU test steps are now plain command invocations without the docker or Kubernetes plugins that previously ran them inside public.ecr.aws/q9t5s3a7/vllm-ci-test-repo:$BUILDKITE_COMMIT with HF cache mounts. With this change they execute directly on the host even though the build job still builds and pushes the container, so any GPU agent lacking the full Python environment or pre-seeded HuggingFace cache (common in these pipelines) will fail as soon as pytest starts because the required dependencies/models are missing.

Useful? React with 👍 / 👎.

Signed-off-by: Kevin H. Luu <[email protected]>

congw729 · 2025-12-09T02:40:51Z

.buildkite/jobs/tests.yaml

+    - pytest -s -v tests/multi_stages/
+
+- label: "Omni Model Test with H100"
+  timeout_in_minutes: 20


We used to set the timeout to 15 minutes. @ywang96 Do you agree we set 20 minutes for testing on H100?

The timeout is already 20 minutes on main branch https://github.com/vllm-project/vllm-omni/blob/main/.buildkite/pipeline.yml#L55

The timeout is already 20 minutes on main branch https://github.com/vllm-project/vllm-omni/blob/main/.buildkite/pipeline.yml#L55

Oops, my mistake! Thanks for catching that.

congw729 · 2025-12-09T02:41:45Z

.buildkite/jobs/tests.yaml

+  gpu: h100
+  num_gpus: 2
+  commands:
+    - export VLLM_WORKER_MULTIPROC_METHOD=spawn


Do we also need to set the logging level here, align with the Omni Model Test?

congw729 · 2025-12-09T06:39:12Z

.buildkite/.pipeline_gen_v2

Is this empty file mandatory for the Buildkite test?

Ya it's used as an indicator whether a branch has the new refactored changes or not, to route CI bootstrap step to use the correct pipeline generator. The new pipeline generator wouldn't work with the old yaml file, and vice versa.

Ya it's used as an indicator whether a branch has the new refactored changes or not, to route CI bootstrap step to use the correct pipeline generator. The new pipeline generator wouldn't work with the old yaml file, and vice versa.

Thanks for the elaboration, very clear.

ZJY0516 · 2025-12-09T08:43:49Z

.buildkite/jobs/tests.yaml

+  no_plugin: true
+
+- label: "Diffusion Model Test"
+  timeout_in_minutes: 15


Is the timeout applied per label or per command?

It's per job/label

ZJY0516 · 2025-12-11T07:36:57Z

Do we have any plan to merge this? @congw729 @khluu @ywang96

congw729 · 2025-12-11T09:40:24Z

Do we have any plan to merge this? @congw729 @khluu @ywang96

It looks food for me.

hsliuustc0106

lgtm

khluu · 2025-12-14T08:23:27Z

I plan to merge this once we migrate vllm-project/vllm over to the new CI pipeline generator, which is right after vllm v0.13.0 release (Dec 17), so that we can have a consistent CI file structure across vllm, vllm-omni, and other ecosystem projects within vllm-project.

khluu added 6 commits December 8, 2025 15:20

push

6b7982a

Signed-off-by: Kevin H. Luu <[email protected]>

push

07282be

Signed-off-by: Kevin H. Luu <[email protected]>

push

c0ca921

Signed-off-by: Kevin H. Luu <[email protected]>

push

6940d7c

Signed-off-by: Kevin H. Luu <[email protected]>

push

84c0a26

Signed-off-by: Kevin H. Luu <[email protected]>

push

a8b5f0b

Signed-off-by: Kevin H. Luu <[email protected]>

khluu requested a review from hsliuustc0106 as a code owner December 9, 2025 00:38

push

c9647d4

Signed-off-by: Kevin H. Luu <[email protected]>

chatgpt-codex-connector bot reviewed Dec 9, 2025

View reviewed changes

khluu added 2 commits December 8, 2025 16:41

push

560ecc0

Signed-off-by: Kevin H. Luu <[email protected]>

push

61e2c3e

Signed-off-by: Kevin H. Luu <[email protected]>

khluu changed the title ~~[DNM][ci] Use new CI pipeline generator~~ [ci] Refactor CI files to use new CI pipeline generator Dec 9, 2025

This comment was marked as outdated.

Sign in to view

congw729 reviewed Dec 9, 2025

View reviewed changes

ZJY0516 reviewed Dec 9, 2025

View reviewed changes

david6666666 mentioned this pull request Dec 9, 2025

[Roadmap]: preparing for 1230 release #165

Open

59 tasks

hsliuustc0106 approved these changes Dec 11, 2025

View reviewed changes

[ci] Refactor CI files to use new CI pipeline generator #246

Are you sure you want to change the base?

[ci] Refactor CI files to use new CI pipeline generator #246

Uh oh!

Conversation

khluu commented Dec 9, 2025

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Dec 9, 2025

Choose a reason for hiding this comment

Uh oh!

This comment was marked as outdated.

Uh oh!

congw729 Dec 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

khluu Dec 9, 2025

Choose a reason for hiding this comment

Uh oh!

congw729 Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

congw729 Dec 9, 2025

Choose a reason for hiding this comment

Uh oh!

congw729 Dec 9, 2025

Choose a reason for hiding this comment

Uh oh!

khluu Dec 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

congw729 Dec 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ZJY0516 Dec 9, 2025

Choose a reason for hiding this comment

Uh oh!

khluu Dec 9, 2025

Choose a reason for hiding this comment

Uh oh!

ZJY0516 commented Dec 11, 2025

Uh oh!

congw729 commented Dec 11, 2025

Uh oh!

hsliuustc0106 left a comment

Choose a reason for hiding this comment

Uh oh!

khluu commented Dec 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

congw729 Dec 9, 2025 •

edited

Loading

khluu Dec 9, 2025 •

edited

Loading

congw729 Dec 10, 2025 •

edited

Loading