Question about patch embedder training

Hi, thanks for your great jobs. I have some questions in stage 1 patch embedder training. 
1. I want to know whether the latent need to add noise like in diffusion forward process to calculate the mse loss in stage1 before sending patch embedder. 
2. If I want to train an I2V task, do I need to concatenate all the conditions in channels before feeding them into the patch embedder for training? 
3. And what is the value of mse loss after convergence in your training setting? 
 
Can you share more information? Thanks~

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Question about patch embedder training #11

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Question about patch embedder training #11

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions