Add ParametricAttention.v2 #913

danieldk · 2023-12-12T09:27:12Z

Description

This layer is an extension of the existing ParametricAttention layer, adding support for transformations (such as a non-linear layer) of the key representation. This brings the model closer to the paper that suggested it (Yang et al, 2016) and gave slightly better results in experiments.

Types of change

Feature

Checklist

I confirm that I have the right to submit this contribution under the project's MIT license.
I ran the tests, and all new and existing tests passed.
My changes don't require a change to the documentation, or if they do, I've added all required information.

netlify · 2023-12-12T09:27:15Z

👷 Deploy request for thinc-ai pending review.

Visit the deploys page to approve it

Name	Link
🔨 Latest commit	`80f47b7`

This layer is an extension of the existing `ParametricAttention` layer, adding support for transformations (such as a non-linear layer) of the key representation. This brings the model closer to the paper that suggested it (Yang et al, 2016) and gave slightly better results in experiments.

svlandeg

Looks good! I only had a few concerns about robustness & tests.

thinc/tests/layers/test_layers_api.py

thinc/layers/parametricattention_v2.py

website/docs/api-layers.md

Co-authored-by: Adriane Boyd <[email protected]>

svlandeg

Definitely much cleaner implementation with the noop layer. Looks good to merge!

danieldk force-pushed the feature/parametric-attention-v2 branch from 72eca0d to a2e178f Compare December 12, 2023 09:33

danieldk added enhancement Feature requests and improvements feat / layers Weights layers, transforms, combinators, wrappers labels Dec 12, 2023

danieldk marked this pull request as ready for review December 12, 2023 10:29

svlandeg reviewed Dec 13, 2023

View reviewed changes

thinc/tests/layers/test_layers_api.py Outdated Show resolved Hide resolved

thinc/tests/layers/test_layers_api.py Show resolved Hide resolved

thinc/layers/parametricattention_v2.py Outdated Show resolved Hide resolved

thinc/layers/parametricattention_v2.py Outdated Show resolved Hide resolved

danieldk commented Dec 13, 2023

View reviewed changes

thinc/layers/parametricattention_v2.py Outdated Show resolved Hide resolved

danieldk added 5 commits December 13, 2023 13:55

Use noop for when key_transform is None

f394c84

Remove stray import

137a457

Add constant for key transform ref

e92d581

Check that we correctly set the key transform

06be6dd

isooooooort

2369b34

adrianeboyd reviewed Dec 13, 2023

View reviewed changes

website/docs/api-layers.md Outdated Show resolved Hide resolved

Update citation to ACL link

80f47b7

Co-authored-by: Adriane Boyd <[email protected]>

svlandeg approved these changes Dec 13, 2023

View reviewed changes

danieldk merged commit 88dc49d into explosion:master Dec 14, 2023
10 checks passed

danieldk deleted the feature/parametric-attention-v2 branch December 14, 2023 10:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add ParametricAttention.v2 #913

Add ParametricAttention.v2 #913

danieldk commented Dec 12, 2023

netlify bot commented Dec 12, 2023 •

edited

Loading

svlandeg left a comment

svlandeg left a comment

Add ParametricAttention.v2 #913

Add ParametricAttention.v2 #913

Conversation

danieldk commented Dec 12, 2023

Description

Types of change

Checklist

netlify bot commented Dec 12, 2023 • edited Loading

👷 Deploy request for thinc-ai pending review.

svlandeg left a comment

Choose a reason for hiding this comment

svlandeg left a comment

Choose a reason for hiding this comment

netlify bot commented Dec 12, 2023 •

edited

Loading