Skip to content

My attempt for Kaggle's Denoising Dirty Document competition.

Notifications You must be signed in to change notification settings

petrpan26/DenoisingDirtyDocuments

 
 

Repository files navigation

Denoising Dirty Documents

Introduction

This project is based on Kaggle's competition: https://www.kaggle.com/c/denoising-dirty-documents The challenge is to removed different types of synthetic noises from scanned texts.

Approach

Small windows (e.g equation) of the scanned texts are passed through an autoencoder-like neural network. Detailed architecture can be found in code and project report.

Some demo (from competition's test files)

Before:

Before

After:

After

Before:

Before

After:

After

About

My attempt for Kaggle's Denoising Dirty Document competition.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%