[WIP][CI][Accuracy] Add HunyuanImage3 pixel accuracy test and nightly CI by BLANKETusers · Pull Request #3657 · vllm-project/vllm-omni

BLANKETusers · 2026-05-16T09:28:01Z

Summary

Add assert_images_pixel_close helper for full-image pixel-level comparison
with mean/p99 absolute channel difference metrics and detailed diagnostics
Add test_hunyuan_image3_pixel_accuracy that generates images via offline
end2end.py and compares output against a pre-saved baseline image
Add nightly CI step (4× H100) in the Diffusion X2I group to gate pixel
accuracy regressions
Rename diffusers_image → baseline_image across accuracy helper APIs
(assert_similarity, assert_image_sequence_similarity)

Files changed

File	Change
`.buildkite/test-nightly.yml`	+38 lines: new CI step `vllm-omni · HunyuanImage3 · Accuracy Test`
`tests/assets/hunyuan/hunyuan_baseline.png`	Baseline reference image (1024×1024)
`tests/e2e/accuracy/helpers.py`	+68 lines: `assert_images_pixel_close`; rename params
`tests/e2e/accuracy/test_hunyuan_image3_pixel_accuracy.py`	+142 lines: new test

Test plan

HUNYUAN_IMAGE3_DEPLOY_CONFIG=../hunyuan_image3_dit_copy.yaml pytest -s -v tests/e2e/accuracy/test_hunyuan_image3_pixel_accuracy.py --run-level

Test Result

1 passed, 18 warnings in 101.49s (0:01:41)

Pixel Metrics

tencent/HunyuanImage-3.0-Instruct — (offline vs baseline)

Metric	Threshold	Status
mean_abs_diff	≤ 0.02	✓
p99_abs_diff	≤ 0.10	✓
p50	—	—
p90	—	—
p95	—	—
p99	—	—
p99.9	—	—

Mismatch ratios (pixel_ratio / channel_ratio)

Threshold (1/255)	0	1	2	4	8	16	32	64	128
pixel_ratio	0.0	0.0	0.0	0.0	0.0	0.0	0.0	0.0	0.0
channel_ratio	0.0	0.0	0.0	0.0	0.0	0.0	0.0	0.0	0.0

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: ce7719d521

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2026-05-16T09:30:02Z

    model_name: str,
    vllm_image: Image.Image,
-    diffusers_image: Image.Image,
+    baseline_image: Image.Image,


Preserve existing helper keyword names

When the existing accuracy tests call assert_similarity(..., diffusers_image=...) or assert_image_sequence_similarity(..., diffusers_images=...) (for example tests/e2e/accuracy/test_qwen_image.py:114 and test_qwen_image_layered.py:142), this signature change makes pytest fail immediately with TypeError: got an unexpected keyword argument before any image comparison runs. Please keep backward-compatible keyword aliases or update all existing call sites in the same change.

Useful? React with 👍 / 👎.

yenuo26 · 2026-05-19T09:07:43Z

+
+_REPO_ROOT = Path(__file__).resolve().parent.parent.parent.parent
+BASELINE_PATH = _REPO_ROOT / "tests" / "assets" / "hunyuan" / "hunyuan_baseline.png"
+_DEFAULT_DEPLOY_CONFIG = _REPO_ROOT / "vllm_omni" / "deploy" / "hunyuan_image3.yaml"


maybe you can use get_deploy_config_path in stage_config.py

yenuo26 · 2026-05-19T09:08:17Z

+        "--stage-init-timeout", "300",
+        "--init-timeout", "900",
+    ]
+    with OmniServer(model, server_args, use_omni=True) as omni_server:


maybe you can use omni_server fixtures

yenuo26 · 2026-05-19T09:09:45Z

+    )
+
+    # online vs baseline_image
+    # assert_images_pixel_close(


Is this redundant code?

congw729 · 2026-05-21T06:34:46Z

Does this PR need to be closed?

BLANKETusers · 2026-05-21T12:29:30Z

new PR：3790

BLANKETusers requested review from congw729 and yenuo26 as code owners May 16, 2026 09:28

chatgpt-codex-connector Bot reviewed May 16, 2026

View reviewed changes

Gaohan123 added this to the v0.22.0 milestone May 18, 2026

BLANKETusers force-pushed the main branch from ce7719d to c1d186f Compare May 19, 2026 07:09

BLANKETusers requested review from Gaohan123, Isotr0py, RuixiangMa, SamitHuang, ZJY0516, ZeldaHuang, david6666666, gcanlin, hsliuustc0106, linyueqian, princepride, tzhouam, wtomin, yuanheng-zhao and ywang96 as code owners May 19, 2026 07:09

BLANKETusers force-pushed the main branch 2 times, most recently from 751540a to fbcf39d Compare May 19, 2026 08:45

yenuo26 reviewed May 19, 2026

View reviewed changes

BLANKETusers closed this May 21, 2026

BLANKETusers force-pushed the main branch from 8de37d5 to 083b5e3 Compare May 21, 2026 02:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP][CI][Accuracy] Add HunyuanImage3 pixel accuracy test and nightly CI#3657

[WIP][CI][Accuracy] Add HunyuanImage3 pixel accuracy test and nightly CI#3657
BLANKETusers wants to merge 0 commit into
vllm-project:mainfrom
BLANKETusers:main

BLANKETusers commented May 16, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot May 16, 2026

Uh oh!

yenuo26 May 19, 2026

Uh oh!

yenuo26 May 19, 2026

Uh oh!

yenuo26 May 19, 2026

Uh oh!

congw729 commented May 21, 2026

Uh oh!

BLANKETusers commented May 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

BLANKETusers commented May 16, 2026

Summary

Files changed

Test plan

Test Result

Pixel Metrics

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot May 16, 2026

Choose a reason for hiding this comment

Uh oh!

yenuo26 May 19, 2026

Choose a reason for hiding this comment

Uh oh!

yenuo26 May 19, 2026

Choose a reason for hiding this comment

Uh oh!

yenuo26 May 19, 2026

Choose a reason for hiding this comment

Uh oh!

congw729 commented May 21, 2026

Uh oh!

BLANKETusers commented May 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants