Research of image compression algorithms using neural networks

Authors

DOI:

https://doi.org/10.31498/2225-6733.49.1.2024.321212

Keywords:

autoencoder, image compression, neural networks, JPEG, compression algorithms

Abstract

The article presents the results of the study of image compression algorithms based on neural networks. Classical compression methods, such as JPEG, PNG, GIF, TIFF, are analyzed, and the advantages of neural network methods, in particular the use of an autoencoder, a variational autoencoder, and generative adversarial networks, are highlighted. It is concluded that the main advantages of neural network methods are the preservation of a high level of textures and details at low bitrates, as well as the ability to work with high-quality images, although this requires significant computing resources. A comparative analysis of classical compression algorithms, such as JPEG, with new approaches based on neural networks is carried out using the example of an autoencoder, and the prospects of neural networks in solving the problem of data compression are assessed. The main emphasis is placed on the analysis of the quality of image restoration and the level of compression using different neural network settings. A mathematical model is presented that describes the principle of operation of an autoencoder and shows how a neural network encodes and restores images using latent space. To achieve the best reconstruction quality, a hybrid loss function was used, which consists of three components: perceptual loss based on VGG16, SSIM loss, and MSE loss. A modular software system was developed using the Python programming language to conduct experiments. The software includes a graphical interface, a compression module for performing image encoding and decoding operations using an autoencoder model, and a quality assessment module for calculating the main quality metrics (PSNR and SSIM). It was found that traditional image compression methods demonstrate high efficiency, but are more prone to generating artifacts, especially at high compression levels, than neural network methods. As a result of the research, it was found that the autoencoder model can encode and decode images with minimal loss of quality, on a par with JPEG, but is inferior to classical algorithms in speed (1.6 seconds per image versus 0.02 for JPEG) and compression ratio (the model provides a reduction in file size by 11–18%). It is concluded that without reducing the need for computational resources, neural network compression methods will not be able to replace classical methods

Author Biographies

I. Marchenko, State Higher Education Institution «Priazovskyi state technical university», Dnipro

PhD (Engineering), associate professor

O. Balalaieva, State Higher Education Institution «Priazovskyi state technical university», Dnipro

PhD (Engineering), associate professor

H. Korotenko, Dnipro University of Technology, Dnipro

Dsc (Engineering), associate professor 

M. Tarazanov , State Higher Education Institution «Priazovskyi state technical university», Dnipro

Master's student

References

Li X., Ji S. Neural Image Compression and Explanation. IEEE Access. 2020. Vol. 8. Pp. 214605-214615. DOI: https://doi.org/10.1109/ACCESS.2020.3041416.

Ballé J., Laparra V., Simoncelli E.P. End-to-end Optimized Image Compression. ICLR 2017 : 5th International Conference on Learning Representations, Toulon, France, 24-26 April 2017. Pp. 1-27. DOI: https://doi.org/10.48550/arXiv.1611.01704.

Autoencoders and their applications in machine learning: a survey / K. Berahmand Fet al. Artificial Intelligence Review. 2024. Vol. 57(2). Pp. 1-52. DOI: https://doi.org/10.1007/s10462-023-10662-6.

Kingma D.P., Welling M. An introduction to variational autoencoders. Foundations and trends in machine learning. 2019. Vol. 12(4). Pp. 307-392. DOI: https://doi.org/10.48550/arXiv.1312.6114.

Generative Adversarial Nets / I. Goodfellow et al. Proceedings of the International Conference on Neural Information Processing Systems, Montreal, Canada, 8-13 December 2014. Vol. 3(11). Pp. 2672-2680. DOI: https://doi.org/10.1145/2969033.2969125.

High-Fidelity Generative Image Compression / Mentzer F., Toderici G., Tschannen M., Agustsson E. NeurIPS 2020 : Proceedings of the 34th Conference on Neural Information Processing Systems, Vancouver, Canada, 6-12 December 2020. Pp. 11913-11924. DOI: https://doi.org/10.5555/3495724.3496723.

Bank D., Koenigstein N., Giryes R. Autoencoders. Machine Learning for Data Science Handbook / ed. by Rokach L., Maimon O., Shmueli E. Springer, Cham, 2023. Pp. 353-374. DOI: https://doi.org/10.1007/978-3-031-24628-9_16.

Image quality assessment: from error visibility to structural similarity / Wang Zh., Bovik A. C. , Sheikh H. R., Simoncelli E. P. IEEE Transactions on Image Processing. 2004. Vol.13. No. 4. Pp. 600-612. DOI: https://doi.org/10.1109/TIP.2003.819861.

Published

2024-12-26

How to Cite

Marchenko, I. ., Balalaieva, O. ., Korotenko, H. ., & Tarazanov , M. . (2024). Research of image compression algorithms using neural networks. Reporter of the Priazovskyi State Technical University. Section: Technical Sciences, 1(49), 85–99. https://doi.org/10.31498/2225-6733.49.1.2024.321212