Skip to content

badly coded gui for a quick streamlined workflow to produce 512x512 images suitable to train Stable Diffusion

License

Notifications You must be signed in to change notification settings

Trainraider/training-image-processor

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

23 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Training Image Processor

This project speeds up the process of manually preparing images for training Stable Diffusion embeddings.

It provides a streamlined UI for selecting a square portion of a picture and then scales that section to 512x512 for training, and then outputs it and loads the next image, all in a single click!

The code has been thrown together quickly. It's disorganized and unprincipled. I made the minimum viable project to speed up my own workflow I was interested in. Pull requests are welcome.

example of UI

Installation and Launching

Windows

  • Install python 3.x from python.org.
  • Download this repository as a zip file and extract
  • Enter the extracted folder with your file browser
  • Double click setup (setup.bat) - This only needs done once
  • Double click launch (launch.bat)

Linux - and other Unix-likes/BSDs/MacOS

Ya'll probably know what to do already...

  • Make sure python is installed however is standard for your distribution.
  • Git clone the repo or download the zip and extract
  • Enter the project folder in a bash terminal
  • run make
  • activate the venv with source ./venv/bin/activate or alternative script if not using bash...
  • run the script python src/training_image_processor.py

Usage

  • First open a directory with pictures
  • Use the buttons at the top to rotate/flip immages as needed
  • Resize selection square with mouse wheel
  • Click the image to process it and load the next image. The 512x512 image is placed in a new 'outputs' folder and the original goes in a new 'originals' folder, inside the open directory

Licensing

The contents under the assets folder are from the adwaita-icons project and are released under the license LGPL-v3.0. The rest of the project is GPL-v2 only.

About

badly coded gui for a quick streamlined workflow to produce 512x512 images suitable to train Stable Diffusion

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages