[Frontend/Model] Support Optional Prompt Upscale by alex-jw-brooks · Pull Request #3783 · vllm-project/vllm-omni

alex-jw-brooks · 2026-05-21T00:24:34Z

Purpose

Exposes param to turn prompt upscaling on/off and unifies the behavior for the following Diffusion models:

Flux2Dev
Longcat
Ernie Image

In cases where the model prompt upscaler is external (i.e., of the above, Ernie Image), the download for the extra files is gated on whether or not it's actually going to be used, and the prompt upscaler won't be loaded until you actually make a request using it, since the prompt upscaler takes a nontrivial amount of extra VRAM. For models like this, you need to pass an additional opt-in flag enable_external_prompt_upscaler (or the corresponding CLI arg), which is set to False by default. If you try to make a request with prompt upscale on while its disabled, and the model has an external upscaler component, you will get a warning and it'll skip the upscale part:

WARNING 05-21 00:09:43 [pipeline_ernie_image.py:198] Requested prompt upscaling on a model with an external prompt upscaler, but enable_external_prompt_upscaler is not set in the server config; prompt upscaling will be skipped

Also worth considering that technically we could have sampling params for the upscaling / rewrite, but I opted to not expose those here as well, since I thought it would be best to minimize the number of new params / flags in this PR. Open to discussion / exploring this in potential follow-ups though.

Test Plan

Validated for all 3 models that outputs do not match the raw gen with same seed when upscale is enabled for:

Offline path
Chat completions endpoint
Image generation endpoint

Test Result

Upscale results are different (i.e., due to prompt manipulation) on all paths for each of the models. Added additional tests for Ernie since the other changes were very straightforward. Also verified when running the server online that the GPU usage was low (~24 Gi) until I made a request with the upscale param, which increased it (to ~30 Gi) and that we get a warning instead of blowing up the memory if it's disabled. For Flux2/LongCat, we don't need to externally disable it since the component is always loaded anyway, which is why it would still work without issues.

@RuixiangMa @retowyss could you please take a look?

Signed-off-by: Alex Brooks <albrooks@redhat.com>

chatgpt-codex-connector · 2026-05-21T00:24:39Z

Codex usage limits have been reached for code reviews. Please check with the admins of this repo to increase the limits by adding credits.
Credits must be used to enable repository wide code reviews.

Signed-off-by: Alex Brooks <albrooks@redhat.com>

retowyss · 2026-05-21T12:40:30Z

Works wonderfully for Ernie-Image-Turbo. Thanks!

alex-jw-brooks · 2026-05-21T21:15:55Z

Great! Thanks for trying it out @retowyss 🙏

alex-jw-brooks added 10 commits May 20, 2026 20:21

flux / longcat pe

f8e4c2d

Signed-off-by: Alex Brooks <albrooks@redhat.com>

remove global prompt upscale flag, add to req extras

80dac25

Signed-off-by: Alex Brooks <albrooks@redhat.com>

optional prompt unhance for ernie image

7bf082e

Signed-off-by: Alex Brooks <albrooks@redhat.com>

fix online kwarg fwd

b4ec862

Signed-off-by: Alex Brooks <albrooks@redhat.com>

add prompt upscale flag to docs

05f76b9

Signed-off-by: Alex Brooks <albrooks@redhat.com>

add config gate on external scaler

d775b6a

Signed-off-by: Alex Brooks <albrooks@redhat.com>

docs

fe123a4

Signed-off-by: Alex Brooks <albrooks@redhat.com>

expose flag and add to default eng

4f2d744

Signed-off-by: Alex Brooks <albrooks@redhat.com>

clarify log

b3cdb08

Signed-off-by: Alex Brooks <albrooks@redhat.com>

gate download for ernie files based on upscaler flag

6e289aa

Signed-off-by: Alex Brooks <albrooks@redhat.com>

alex-jw-brooks requested review from Gaohan123, Isotr0py, RuixiangMa, SamitHuang, ZJY0516, david6666666, hsliuustc0106, princepride, tzhouam, wtomin and yenuo26 as code owners May 21, 2026 00:24

update recipe

76286f7

Signed-off-by: Alex Brooks <albrooks@redhat.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Frontend/Model] Support Optional Prompt Upscale#3783

[Frontend/Model] Support Optional Prompt Upscale#3783
alex-jw-brooks wants to merge 11 commits into
vllm-project:mainfrom
alex-jw-brooks:prompt_upscale

alex-jw-brooks commented May 21, 2026

Uh oh!

chatgpt-codex-connector Bot commented May 21, 2026

Uh oh!

retowyss commented May 21, 2026

Uh oh!

alex-jw-brooks commented May 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

alex-jw-brooks commented May 21, 2026

Purpose

Test Plan

Test Result

Uh oh!

chatgpt-codex-connector Bot commented May 21, 2026

Uh oh!

retowyss commented May 21, 2026

Uh oh!

alex-jw-brooks commented May 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants