I'm very impressed with the excellent performance of the DC-Gen project.
I have a few questions about applying this methodology to other Text-to-Image models.
-
Are there any official plans to support other T2I (or I2I) models besides FLUX, such as Qwen-Image?
-
(DC-AE Retraining) Qwen-Image uses a different VAE than FLUX. In this case, is it correct that we would need to train a new DC-AE from scratch to fit Qwen-Image's VAE?
-
(Application Guide) I would like to try applying this methodology to Qwen-Image myself. Could you provide a brief, high-level guide on which scripts to run and in what order, based on the process used for the FLUX model?
Thank you in advance for your response!