This project is based on Kaggle's competition: https://www.kaggle.com/c/denoising-dirty-documents The challenge is to removed different types of synthetic noises from scanned texts.
Small windows (e.g ) of the scanned texts are passed through an autoencoder-like neural network. Detailed architecture can be found in code and project report.