Questions on applying to other T2I models like Qwen-Image

I'm very impressed with the excellent performance of the DC-Gen project.
I have a few questions about applying this methodology to other Text-to-Image models.


1. Are there any official plans to support other T2I (or I2I) models besides FLUX, such as Qwen-Image?

2. (DC-AE Retraining) Qwen-Image uses a different VAE than FLUX. In this case, is it correct that we would need to train a new DC-AE from scratch to fit Qwen-Image's VAE?

3. (Application Guide) I would like to try applying this methodology to Qwen-Image myself. Could you provide a brief, high-level guide on which scripts to run and in what order, based on the process used for the FLUX model?


Thank you in advance for your response!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Questions on applying to other T2I models like Qwen-Image #5

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Questions on applying to other T2I models like Qwen-Image #5

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions