Fix Value dimension in ImageCrossAttention #188

hugojarkoff · 2024-01-17T14:57:21Z

This is a minor issue, but while reading the codebase I noticed that the ImageCrossAttention uses the wrong in_features dimension in the second fl.Linear layer:

class ImageCrossAttention(fl.Chain):
    def __init__(self, text_cross_attention: fl.Attention, scale: float = 1.0) -> None:
            ...      
                    # This first Linear corresponds to the keys K' 
                    fl.Linear(
                      ...
                    ),
                    ...
                    # This second Linear corresponds to the values V'
                    fl.Linear(
                        in_features=text_cross_attention.key_embedding_dim,
                        out_features=text_cross_attention.inner_dim,
                        bias=text_cross_attention.use_bias,
                        device=text_cross_attention.device,
                        dtype=text_cross_attention.dtype,
                    ),
                ),
            ...

Should (IIUC) be changed to :

class ImageCrossAttention(fl.Chain):
    def __init__(self, text_cross_attention: fl.Attention, scale: float = 1.0) -> None:
            ...      
                    # This first Linear corresponds to the keys K' 
                    fl.Linear(
                      ...
                    ),
                    ...
                    # This second Linear corresponds to the values V'
                    fl.Linear(
                        in_features=text_cross_attention.value_embedding_dim,
                        out_features=text_cross_attention.inner_dim,
                        bias=text_cross_attention.use_bias,
                        device=text_cross_attention.device,
                        dtype=text_cross_attention.dtype,
                    ),
                ),
            ...

In practice, and IINM, this shouldn't change anything in the context of Image Cross-Attention (since both key and query dim are the same).

Fix Value dimension in ImageCrossAttention

0b05813

hugojarkoff requested a review from deltheil January 17, 2024 14:57

limiteinductive self-requested a review January 17, 2024 15:40

limiteinductive approved these changes Jan 17, 2024

View reviewed changes

limiteinductive merged commit a6a9c8b into main Jan 17, 2024
1 check passed

limiteinductive deleted the pr/fix-mistake-image-cross-attention branch January 17, 2024 15:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Value dimension in ImageCrossAttention #188

Fix Value dimension in ImageCrossAttention #188

hugojarkoff commented Jan 17, 2024

Fix Value dimension in ImageCrossAttention #188

Fix Value dimension in ImageCrossAttention #188

Conversation

hugojarkoff commented Jan 17, 2024