Guided-Attention

Diffusion Model that allows for annotations (bounding boxes, crosshairs, keywords) to guide latent based on cross attention layers. This model does not use any fine tuning (all modification is done at inference) and is fully compatible with existing models such as Stable Diffusion.

Example

Demo:

webUI_demo_trim.mp4

Setup

This code was tested using Python 3.10, torch 1.13.1.

Install Pytorch then,

git clone https://github.com/jackBonadies/Guided-Attention.git
cd Guided-Attention/
pip install -r environment/requirements.txt

To generate an image one can use:

python run.py --meta_prompt "a [robot:.6,.3,.4,.55] and a [blue vase:.2,.3,.4,.55]" --seeds [28] --half_precision True

To launch the web based gui one can use:

python run.py --interactive True --half_precision True

Note: Half precision is not mandatory, but it is recommended for most users. Guided Attention keeps track of gradients of UNet and therefore, even with float16, takes around 9.5GB vram.

Acknowledgements

This work builds on code from Attend and Excite and Prompt to Prompt

Name		Name	Last commit message	Last commit date
Latest commit History 67 Commits
.github/workflows		.github/workflows
environment		environment
notebooks		notebooks
resource		resource
utils		utils
LICENSE		LICENSE
README.md		README.md
config.py		config.py
gui.py		gui.py
pipeline_guided_attention.py		pipeline_guided_attention.py
run.py		run.py
web_ui.html		web_ui.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Guided-Attention

Example

Setup

Acknowledgements

About

Releases

Packages

Contributors 2

Languages

License

jackBonadies/Guided-Attention

Folders and files

Latest commit

History

Repository files navigation

Guided-Attention

Example

Setup

Acknowledgements

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages