Exploring the efficiency of the combined application of connection pruning and source data pre­processing when training a multilayer perceptron

Oleg Galchonkov; Alexander Nevrev; Maria Glava; Mykola Babych

doi:10.15587/1729-4061.2020.200819

Authors

Oleg Galchonkov Odessa National Polytechnic University Shevchenka ave., 1, Odessa, Ukraine, 65044, Ukraine https://orcid.org/0000-0001-5468-7299
Alexander Nevrev Odessa National Polytechnic University Shevchenka ave., 1, Odessa, Ukraine, 65044, Ukraine https://orcid.org/0000-0001-7673-5466
Maria Glava Odessa National Polytechnic University Shevchenka ave., 1, Odessa, Ukraine, 65044, Ukraine https://orcid.org/0000-0002-9596-9556
Mykola Babych Odessa National Polytechnic University Shevchenka ave., 1, Odessa, Ukraine, 65044, Ukraine https://orcid.org/0000-0002-3946-9880

DOI:

https://doi.org/10.15587/1729-4061.2020.200819

Keywords:

multilayer perceptron, neural network, pruning, regularization, learning curve, weight coefficients

Abstract

A conventional scheme to operate neural networks until recently has been assigning the architecture of a neural network and its subsequent training. However, the latest research in this field has revealed that the neural networks that had been set and configured in this way exhibited considerable redundancy. Therefore, the additional operation was to eliminate this redundancy by pruning the connections in the architecture of a neural network. Among the many approaches to eliminating redundancy, the most promising one is the combined application of several methods when their cumulative effect exceeds the sum of effects from employing each of them separately. We have performed an experimental study into the effectiveness of the combined application of iterative pruning and pre-processing (pre-distortions) of input data for the task of recognizing handwritten digits with the help of a multilayer perceptron. It has been shown that the use of input data pre-processing regularizes the procedure of training a neural network, thereby preventing its retraining. The combined application of the iterative pruning and pre-processing of input data has made it possible to obtain a smaller error in the recognition of handwritten digits, 1.22 %, compared to when using the thinning only (the error decreased from 1.89 % to 1.81 %) and when employing the predistortions only (the error decreased from 1.89 % to 1.52 %). In addition, the regularization involving pre-distortions makes it possible to receive a monotonously increasing number of disconnected connections while maintaining the error at 1.45 %. The resulting learning curves for the same task but corresponding to the onset of training under different initial conditions acquire different values both in the learning process and at the end of the training. This shows the multi-extreme character of the quality function – the accuracy of recognition. The practical implication of the study is our proposal to run the multiple training of a neural network in order to choose the best result

Author Biographies

Oleg Galchonkov, Odessa National Polytechnic University Shevchenka ave., 1, Odessa, Ukraine, 65044

PhD, Associate Professor

Department of Information Systems

Institute of Computer Systems

Alexander Nevrev, Odessa National Polytechnic University Shevchenka ave., 1, Odessa, Ukraine, 65044

PhD, Associate Professor

Department of Information Systems

Institute of Computer Systems

Maria Glava, Odessa National Polytechnic University Shevchenka ave., 1, Odessa, Ukraine, 65044

PhD, Associate Professor

Department of Information Systems

Institute of Computer Systems

Mykola Babych, Odessa National Polytechnic University Shevchenka ave., 1, Odessa, Ukraine, 65044

PhD, Associate Professor

Department of Information Systems

Institute of Computer Systems

References

Nikolenko, S., Kadurin, A., Arhangel'skaya, E. (2018). Glubokoe obuchenie. Sankt-Peterburg: Piter, 480.
Denil, M., Shakibi, B., Dinh, L., Ranzato, M. A., De Freitas, N. (2014). Predicting Parameters in Deep Learning. ArXiv. Available at: https://arxiv.org/pdf/1306.0543v2.pdf
Han, S., Pool, J., Tran, J., Dally, W. J. (2015). Learning both Weights and Connections for Efficient Neural Networks. ArXiv. Available at: https://arxiv.org/pdf/1506.02626v3.pdf
Cun, Y. L., Denker, J. S., Solla, S. A. (1990). Optimal Brain Damage. NIPS. Available at: http://yann.lecun.com/exdb/publis/pdf/lecun-90b.pdf
Denton, E. L., Zaremba, W., Bruna, J., LeCun, Y., Fergus, R. (2014). Exploiting linear structure within convolutional networks for efficient evaluation. In NIPS, 1269–1277.
Sainath, T. N., Kingsbury, B., Sindhwani, V., Arisoy, E., Ramabhadran, B. (2013). Low-rank matrix factorization for Deep Neural Network training with high-dimensional output targets. 2013 IEEE International Conference on Acoustics, Speech and Signal Processing. doi: https://doi.org/10.1109/icassp.2013.6638949
Molchanov, D., Ashukha, A., Vetrov, D. (2017). Variational dropout sparsifies deep neural networks. arXiv. Available at: https://arxiv.org/pdf/1701.05369.pdf
Han, S., Mao, H., Dally, W. J. (2016). Deep compression: compressing deep neural networks with pruning, trained quantization and huffman coding. arXiv. Available at: https://arxiv.org/pdf/1510.00149.pdf
Qiu, J., Song, S., Wang, Y., Yang, H., Wang, J., Yao, S. et. al. (2016). Going Deeper with Embedded FPGA Platform for Convolutional Neural Network. Proceedings of the 2016 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays - FPGA’16. doi: https://doi.org/10.1145/2847263.2847265
Alford, S., Robinett, R., Milechin, L., Kepner, J. (2019). Training Behavior of Sparse Neural Network Topologies. 2019 IEEE High Performance Extreme Computing Conference (HPEC). doi: https://doi.org/10.1109/hpec.2019.8916385
Lee, N., Ajanthan, T., Torr, P. H. S. (2019). SNIP: Single-Shot Network Pruning Based on Connection Sensitivity. International Conference on Learning Representations (ICLR 2019).
Li, Y., Zhao, W., Shang, L. (2019). Really should we pruning after model be totally trained? Pruning based on a small amount of training. arXiv. Available at: https://arxiv.org/pdf/1901.08455v1.pdf
Loquercio, A., Torre, F. D., Buscema, M. (2017). Computational Eco-Systems for Handwritten Digits Recognition. arXiv. Available at: https://arxiv.org/pdf/1703.01872v1.pdf
LeCun, Y., Cortes, C., Burges, C. J. C. The MNIST Database of Handwritten Digits. Available at: http://yann.lecun.com/exdb/mnist/
Tabik, S., Peralta, D., Herrera-Poyatos, A., Herrera, F. (2017). A snapshot of image pre-processing for convolutional neural networks: case study of MNIST. International Journal of Computational Intelligence Systems, 10 (1), 555. doi: https://doi.org/10.2991/ijcis.2017.10.1.38
Cireşan, D. C., Meier, U., Gambardella, L. M., Schmidhuber, J. (2010). Deep, Big, Simple Neural Nets for Handwritten Digit Recognition. Neural Computation, 22 (12), 3207–3220. doi: https://doi.org/10.1162/neco_a_00052
Simard, P. Y., Steinkraus, D., Platt, J. C. (2003). Best practices for convolutional neural networks applied to visual document analysis. Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings. doi: https://doi.org/10.1109/icdar.2003.1227801
Tarik, R. (2017). Sozdaem neyronnuyu set'. Sankt-Peterburg: OOO “Al'fa-kniga”, 272.

Exploring the efficiency of the combined application of connection pruning and source data preprocessing when training a multilayer perceptron

Authors

DOI:

Keywords:

Abstract

Author Biographies

Oleg Galchonkov, Odessa National Polytechnic University Shevchenka ave., 1, Odessa, Ukraine, 65044

Alexander Nevrev, Odessa National Polytechnic University Shevchenka ave., 1, Odessa, Ukraine, 65044

Maria Glava, Odessa National Polytechnic University Shevchenka ave., 1, Odessa, Ukraine, 65044

Mykola Babych, Odessa National Polytechnic University Shevchenka ave., 1, Odessa, Ukraine, 65044

References

Downloads

Published

How to Cite

Issue

Section

License

Language

Information

Make a Submission

Developed By

Current Issue

Exploring the efficiency of the combined application of connection pruning and source data pre­processing when training a multilayer perceptron

Authors

DOI:

Keywords:

Abstract

Author Biographies

Oleg Galchonkov, Odessa National Polytechnic University Shevchenka ave., 1, Odessa, Ukraine, 65044

Alexander Nevrev, Odessa National Polytechnic University Shevchenka ave., 1, Odessa, Ukraine, 65044

Maria Glava, Odessa National Polytechnic University Shevchenka ave., 1, Odessa, Ukraine, 65044

Mykola Babych, Odessa National Polytechnic University Shevchenka ave., 1, Odessa, Ukraine, 65044

References

Downloads

Published

How to Cite

Issue

Section

License

Language

Information

Make a Submission

Developed By

Current Issue

Exploring the efficiency of the combined application of connection pruning and source data preprocessing when training a multilayer perceptron