[go: up one dir, main page]

03 Sep 24

Merge two embedding sequences regardless of modality, e.g., image with text in Stable Diffusion U-Net with encoder-decoder attention.

by isaac 1 year ago