Note

Simple code to visualize attention values of Transformer-based language model.

Note

The main idea of handling attention values comes from ACL-IJCNLP paper LEWIS: Levenshtein Editing for Unsupervised Text Style Transfer's codebase. According to the paper, penultimate (second to the last) layer worked the best. (e.g. 11th layer for Roberta-base.)

Install Packages

conda env create -n <name> -f requirements.txt
- If you want GPU-enabled torch,
  - conda activate <name>
  - conda install pytorch torchvision torchaudio pytorch-cuda=11.7 -c pytorch -c nvidia (check url)
Or simply check if packages in requirements.txt are already installed in your environment.

Run

conda activate <name>
python viz_attention.py
Or open demo.ipynb and run it for demo.

Sample results

Data: SST-2 test set
Model: distilbert-base-uncased-finetuned-sst-2-english

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
examples		examples
Readme.md		Readme.md
demo.ipynb		demo.ipynb
requirements.txt		requirements.txt
viz_attention.py		viz_attention.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Note

Install Packages

Run

Sample results

About

Releases

Packages

Languages

hayleyson/attention_viz

Folders and files

Latest commit

History

Repository files navigation

Note

Install Packages

Run

Sample results

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages