Feature add DyPE support (experimental) #941

stduhpf · 2025-11-05T18:51:07Z

https://github.com/guyyariv/DyPE/tree/master

Flux only for now, I don't have enough VRAM to test it at very high resolutions (seems to be working at 1536x1536 resolution, but it should be tested at up to 4096x4096 to be completely sure it's working as intended)

Use env vraiables to enable it:

FLUX_ROPE = DY_YARN, DY_NTK, YARN, or NTK (any other value will use standard RoPE)
FLUX_DYPE_BASE_RESOLUTION (defaults to 1024 which should be best for base Flux, ~~maybe use 512 for Chroma? (untested yet)~~ 768 seems to work for Chroma, for some reason 512 didn't perform well at all in my testing, maybe use 1024 too)

Example:

.\build\bin\sd.exe --diffusion-model ..\ComfyUI\models\unet\Flux\dev\flux1-dev-Q3_k.gguf --t5xxl ..\ComfyUI\models\clip\t5\t5xxl_q8_0.gguf --clip_l ..\ComfyUI\models\clip\clip_l\clip_l.safetensors --vae ..\ComfyUI\models\vae\flux\ae.safetensors -p "a lovely cat holding a sign says 'Flux cpp'" --cfg-scale 1 --sampling-method euler -W 1536 -H 1536 --vae-tiling --vae-tile-size 64

(base resolution 1024)

default RoPE	DY_YARN	DY_NTK	YARN	NTK

stduhpf · 2025-11-06T22:01:02Z

Test with Flux Schnell q3_k at 1792x1792 (biggest I could acheive using shared video memory without crashing), 6 steps

default	DY_YARN

(doesn't look very lovely, but it makes a better use of the available image area I guess?)

fszontagh · 2025-11-06T23:46:12Z

Test with Flux Schnell q3_k at 1792x1792 (biggest I could acheive using shared video memory without crashing), 6 steps

How much video memory do you have?

stduhpf · 2025-11-07T00:03:18Z

How much video memory do you have?

16GB VRAM + 16GB shared memory. But I no longer think the crash is related to OOM issues. Rocm backend just crashes at high resolution (#948). ~~Maybe I should try again with Vulkan~~ Of course Vulkan won't work either, because buffer size limit

Green-Sky · 2025-11-07T07:55:15Z

Just wanted you to try a smaller model, just to add a little more headroom, ~~but my ggufs keep crashing on loading, even without the last commit 🙈~~ I am stupid, I just forgot --clip-on-cpu .
eg: https://huggingface.co/Green-Sky/flux.1-lite-8B-GGUF/blob/main/lora-experiments/hyper-flux.1-lite-8B-8step-q5_k.gguf

stduhpf · 2025-11-07T09:32:11Z

@Green-Sky I get the same crash at over 1792x1792, even with a tiny model like https://huggingface.co/Green-Sky/flux-mini-GGUF/blob/main/flux-mini-q4_k.gguf

Green-Sky · 2025-11-07T13:40:03Z

I can generate 2048x1024 without problems(?), but when I try 3072x1024 it runs but always returns a black image (???).

Here is 2048x1024 without any rope manipulation:

and here with YARN (defaults):

stduhpf · 2025-11-07T13:47:19Z

Wait, I forgot about --diffusion-fa. I can run 2048x2048 just fine by enabling it.

Well "fine", but still slow, with about 7GB vram to spare still

4096x4096 yields stable-diffusion.cpp/ggml/src/ggml-cuda/cpy.cu:258: GGML_ASSERT(ggml_nbytes(src0) <= INT_MAX) failed

stduhpf · 2025-11-07T14:24:36Z

Shnell q4_k 6steps, 2048x2048, with fa enabled

default	DY_YARN

Neither are looking good, but the one with dype at least has a cat in it?

stduhpf · 2025-11-07T14:48:37Z

Not sure if it's because of flash attention or if there's a bug somewhere, But I can't get any good results at high resolution, either with or without dype. Since I'm using previews, I can see that the first step generally looks okay-ish, but it gets darker and less detailed at every subsequent step.

High resolution results look the same kind of broken as when using CFG with Flux (blurry, overly contrasted and so on), But i double checked that CFG is not enabled.

leejet · 2025-11-07T15:03:38Z

I’m not sure whether generating ultra-high-resolution images would cause problems for models that weren’t specifically trained for that purpose — for example, internal NaNs. Previously, I tried using relatively large images as context inputs, which ended up producing black images, while using lower-resolution inputs worked fine.

stduhpf · 2025-11-07T15:50:04Z

It looks like it's inducing some (slight) distorsions on non-square resolutions, maybe the "base resolution" should be made aware of the targeted aspect ratio...

stduhpf · 2025-11-07T19:10:55Z

Not sure if it's because of flash attention or if there's a bug somewhere, But I can't get any good results at high resolution, either with or without dype. Since I'm using previews, I can see that the first step generally looks okay-ish, but it gets darker and less detailed at every subsequent step.

High resolution results look the same kind of broken as when using CFG with Flux (blurry, overly contrasted and so on), But i double checked that CFG is not enabled.

This doesn't seem to happen with Vulkan btw, ROCm only.
@Green-Sky Which backend did you use for your 2048x1024 gens?

Green-Sky · 2025-11-07T19:38:31Z

This doesn't seem to happen with Vulkan btw, ROCm only. @Green-Sky Which backend did you use for your 2048x1024 gens?

CUDA on my rtx 2070 (8gig). The model I linked looked fine, and I think slightly better without rope scaling at that resolution.
edit: though it is a single example, but I think the vertical stripes are stronger with rope scaling (YaRN).

stduhpf · 2025-11-07T19:51:40Z

CUDA on my rtx 2070 (8gig)

Ok, so it seem like it's a ROCm specific issue. For context, generating at 2048x1024 with Flux on my ROCm build spits out somethig like this, regardless of the RoPE modifications:

Vulkan works, but it's very slow, here's a preview of the 2048x1536 that it's currently generating at 158.96s/it with dy-yarn enabled:

I got a driver timeout when I tried 2048x2048 on Vulkan.

stduhpf · 2025-11-09T00:32:32Z

It will now by default pick a base resolution with a similar aspect ratio and a pixel count close to FLUX_DYPE_BASE_RESOLUTION ² pixels, unless you specify the exact desired dimensions of the base resolution with FLUX_DYPE_BASE_RESOLUTION={Width}x{Height}

This seems to fix the distorsion issue completely, here's an example with a 1024x2048 using dy_yarn:

No dype	Dype 1024x1024 (old)	dype 724x1448 (1024 auto)	bonus: 1448x724 (oops)

stduhpf added 2 commits November 5, 2025 19:47

Flux dype

1d27a27

Working dype + NTK

5e6c77e

base_resolution with desired aspect ratio

795ad67

Feature add DyPE support (experimental) #941

Are you sure you want to change the base?

Feature add DyPE support (experimental) #941

Uh oh!

Conversation

stduhpf commented Nov 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

stduhpf commented Nov 6, 2025

Uh oh!

fszontagh commented Nov 6, 2025

Uh oh!

stduhpf commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Green-Sky commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

stduhpf commented Nov 7, 2025

Uh oh!

Green-Sky commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

stduhpf commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

stduhpf commented Nov 7, 2025

Uh oh!

stduhpf commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

leejet commented Nov 7, 2025

Uh oh!

stduhpf commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

stduhpf commented Nov 7, 2025

Uh oh!

Green-Sky commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

stduhpf commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

stduhpf commented Nov 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

stduhpf commented Nov 5, 2025 •

edited

Loading

stduhpf commented Nov 7, 2025 •

edited

Loading

Green-Sky commented Nov 7, 2025 •

edited

Loading

Green-Sky commented Nov 7, 2025 •

edited

Loading

stduhpf commented Nov 7, 2025 •

edited

Loading

stduhpf commented Nov 7, 2025 •

edited

Loading

stduhpf commented Nov 7, 2025 •

edited

Loading

Green-Sky commented Nov 7, 2025 •

edited

Loading

stduhpf commented Nov 7, 2025 •

edited

Loading

stduhpf commented Nov 9, 2025 •

edited

Loading