Adds NVIDIA PixelDiT and PiD support by jtreminio · Pull Request #1393 · mcmonkeyprojects/SwarmUI

jtreminio · 2026-05-26T17:29:03Z

Not included: docs updates.

PixelDiT is an image model. Not that great.

The interesting part of this PR is the PiD, a 4x-locked upscaler that now replaces the refiner stage's upscaler. The upscale happens after the refiner's SwarmKSampler node.

PixelDiT workflow:

PiD upscale workflow:

jtreminio · 2026-05-27T23:40:01Z

1839001-A high-quality, cinematic portrait featu

mcmonkey4eva · 2026-05-27T23:48:16Z

+# PixelDiT
+
+- NVIDIA's [PixelDiT](<https://huggingface.co/Comfy-Org/PixelDiT>) is supported in SwarmUI!
+    - Or the smaller FP8 version: [Comfy-Org/PixelDiT - mxfp8](<https://huggingface.co/Comfy-Org/PixelDiT/resolve/main/diffusion_models/pixeldit_1300m_1024px_mxfp8.safetensors>)


mcmonkey4eva · 2026-05-27T23:54:13Z

+                if (doUpscale && upscaleMethod.StartsWith("pidmodel-"))
+                {
+                    string pidModelName = upscaleMethod.After("pidmodel-");
+                    T2IModel pidModel = Program.MainSDModels.GetModel(pidModelName);


check t2iprompthandling for "lora", there's a weird special case pattern for how indirectly specified models are read that accommodates both white/blacklisting of models and user-typing issues (eg excluding the .safetensors or not)

mcmonkey4eva · 2026-05-27T23:57:43Z

+                    string pidSampled = g.CreateKSampler(g.CurrentModel.Path, [pidCond, 0], pidNeg, [pidEmptyLatent, 0], pidCfg, pidSteps, 0, 10000,
+                        g.UserInput.Get(T2IParamTypes.Seed) + 2, false, true, defsampler: "lcm", defscheduler: "simple", explicitSampler: pidSampler, explicitScheduler: pidScheduler, sectionId: T2IParamInput.SectionID_PixelDecoder);
+                    g.CurrentMedia = g.CurrentMedia.WithPath([pidSampled, 0], WGNodeData.DT_LATENT_IMAGE, pidModel.ModelClass?.CompatClass);
+                    g.CurrentMedia.Width = pidWidth;


for the Refiner Upscale, since target size is user-specified, follow user specified size by way of doing a post-rescale in pixel space, see how ImageUpscaleWithModel does it above

mcmonkey4eva · 2026-05-27T23:58:33Z

        bool isHiDreamO1Lora(JObject h) => hasLoraKey(h, "final_layer2.linear") && hasLoraKey(h, "language_model.layers.0.self_attn.q_proj");
        bool isChroma(JObject h) => h.ContainsKey("distilled_guidance_layer.in_proj.bias") && h.ContainsKey("double_blocks.0.img_attn.proj.bias");
        bool isChromaRadiance(JObject h) => h.ContainsKey("nerf_image_embedder.embedder.0.bias");
+        bool isPiD(JObject h) => h.ContainsKey("net.lq_proj.latent_proj.0.weight");


could you pick another key or two each just to narrow it? The list is getting long enough that we're getting occasional surprise overlaps.

Added net.pixel_blocks.0.attn.q_norm.weight for isPid() and core.pixel_blocks.0.attn.q_norm.weight for isPixelDiT(). I figure keys with pixel_ in them aren't very common (yet). Clearing metadata is clean.

* as base model - for when a user uploads an image I guess * as refiner model - if base model isn't a compatible vae user, load the vae and add a vae decode/encode pair * refiner upscale model - base -> pid -> downscale or upscale with lanczos (if needed) -> refiner swarmksampler * after the refiner swarmksampler; if refiner model isn't a compatible vae user, load the vae and add a vae decode/encode pair

mcmonkey4eva · 2026-06-04T22:18:37Z

+    public (WGNodeData, string) CreatePidCompatLatent(T2IModel pidModel, WGNodeData media, WGNodeData decodeVae)
+    {
+        string mediaFamily = media.IsLatentData ? media.Compat?.VaeFamily : null;
+        string family = PidFamilyOfModel(pidModel) ?? mediaFamily ?? "flux1";


if flux1 and flux2 both are fine, iirc flux2 is a much better latent format

jtreminio added 4 commits May 26, 2026 12:28

Adds NVIDIA PixelDiT and PiD support

33bac0c

Merge branch 'master' into pixeldit-pid-support

320de43

PiD replaces Refiner upscaler, not refiner stage itself

3d3a933

Cleanup

05836eb

jtreminio marked this pull request as ready for review May 27, 2026 23:29

Docs for PixelDiT

ab0665d

mcmonkey4eva reviewed May 27, 2026

View reviewed changes

doc fix

f10bb98

mcmonkey4eva reviewed May 27, 2026

View reviewed changes

Add <pixeldecoder> hint

145fec3

mcmonkey4eva reviewed May 27, 2026

View reviewed changes

jtreminio added 3 commits May 27, 2026 19:15

Use T2IParamTypes.GetBestModelInList()

f42cf7a

Add more (pixel) keys

4502cb4

add more why not

eb099fc

jtreminio marked this pull request as draft May 29, 2026 12:37

jtreminio marked this pull request as ready for review June 4, 2026 17:46

Merge branch 'master' into pr/1393

a6715d8

mcmonkey4eva reviewed Jun 4, 2026

View reviewed changes

Comment thread src/BuiltinExtensions/ComfyUIBackend/WorkflowGenerator.cs

mcmonkey4eva reviewed Jun 4, 2026

View reviewed changes

minor doc

5a796a3

mcmonkey4eva force-pushed the pixeldit-pid-support branch from c40732f to 5a796a3 Compare June 5, 2026 01:02

mcmonkey4eva added 2 commits June 4, 2026 18:21

docs

dccc351

base sampler input wrong pattern

a774977

mcmonkey4eva merged commit be8ed96 into mcmonkeyprojects:master Jun 5, 2026
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Adds NVIDIA PixelDiT and PiD support#1393

Adds NVIDIA PixelDiT and PiD support#1393
mcmonkey4eva merged 15 commits into
mcmonkeyprojects:masterfrom
jtreminio:pixeldit-pid-support

jtreminio commented May 26, 2026 •

edited

Loading

Uh oh!

jtreminio commented May 27, 2026

Uh oh!

mcmonkey4eva May 27, 2026

Uh oh!

mcmonkey4eva May 27, 2026

Uh oh!

mcmonkey4eva May 27, 2026

Uh oh!

mcmonkey4eva May 27, 2026

Uh oh!

jtreminio May 28, 2026

Uh oh!

Uh oh!

mcmonkey4eva Jun 4, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

jtreminio commented May 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jtreminio commented May 27, 2026

Uh oh!

mcmonkey4eva May 27, 2026

Choose a reason for hiding this comment

Uh oh!

mcmonkey4eva May 27, 2026

Choose a reason for hiding this comment

Uh oh!

mcmonkey4eva May 27, 2026

Choose a reason for hiding this comment

Uh oh!

mcmonkey4eva May 27, 2026

Choose a reason for hiding this comment

Uh oh!

jtreminio May 28, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mcmonkey4eva Jun 4, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

jtreminio commented May 26, 2026 •

edited

Loading