Joint Inference for Neural Network Depth and Dropout Regularization

Dropout regularization methods prune a neural network's pre-determined backbone structure to avoid overfitting. However, a deep model still tends to be poorly calibrated with high confidence on incorrect predictions. We propose a unified Bayesian model selection method to jointly infer the most plausible network depth warranted by data, and perform dropout regularization simultaneously. In particular, to infer network depth we define a beta process over the number of hidden layers which allows it to go to infinity. Layer-wise activation probabilities induced by the beta process modulate neuron activation via binary vectors of a conjugate Bernoulli process. Experiments across domains show that by adapting network depth and dropout regularization to data, our method achieves superior performance comparing to state-of-the-art methods with well-calibrated uncertainty estimates. In continual learning, our method enables neural networks to dynamically expand their depths and neuron activations to accommodate incrementally available data beyond their initial structures, and alleviate catastrophic forgetting.

The basic codebase is implemented in Python 3.7.6 and is provided in experiments folder. The package version used for development are as follows:

1. torch 	        1.5.0
2. torchvision 	        0.4.2
3. numpy 	        1.18.1
4. pandas 	        1.0.1
5. matplotlib 	        3.1.3
6. seaborn 	        0.10.0
7. tqdm 	        4.42.1
8. argparse 	        1.1
9. texttable 	        1.6.2

Install all requirements and Depth_and_Dropout package using following commands

pip install -r requirements.txt
pip install -e .

Change to the experiments directory:

cd experiments

Synthetic experiments

Train the model on sythetic experiments:

python synthetic_experiments.py

Image experiments

Train the model for image classification on MNIST dataset:

python image_experiments.py

Citation

If you find this code useful, please consider citing our paper:

Kishan K C, Rui Li, Mahdi Gilany. (2021). Joint Inference for Neural Network Depth and Dropout Regularization. Proceedings of the Advances in Neural Information Processing Systems.

@inproceedings{kc2021DepthandDropout,
 title={Joint Inference for Neural Network Depth and Dropout Regularization},
 author={K C, Kishan and Li, Rui and Gilany, Mahdi}
 booktitle = {Advances in Neural Information Processing Systems},
 year = {2021}
}

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
experiments		experiments
images		images
src		src
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Joint Inference for Neural Network Depth and Dropout Regularization

Synthetic experiments

Image experiments

Citation

About

Releases

Packages

Contributors 2

Languages

License

kckishan/Depth_and_Dropout

Folders and files

Latest commit

History

Repository files navigation

Joint Inference for Neural Network Depth and Dropout Regularization

Synthetic experiments

Image experiments

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages