Research of image compression algorithms using neural networks
DOI:
https://doi.org/10.31498/2225-6733.49.1.2024.321212Keywords:
autoencoder, image compression, neural networks, JPEG, compression algorithmsAbstract
The article presents the results of the study of image compression algorithms based on neural networks. Classical compression methods, such as JPEG, PNG, GIF, TIFF, are analyzed, and the advantages of neural network methods, in particular the use of an autoencoder, a variational autoencoder, and generative adversarial networks, are highlighted. It is concluded that the main advantages of neural network methods are the preservation of a high level of textures and details at low bitrates, as well as the ability to work with high-quality images, although this requires significant computing resources. A comparative analysis of classical compression algorithms, such as JPEG, with new approaches based on neural networks is carried out using the example of an autoencoder, and the prospects of neural networks in solving the problem of data compression are assessed. The main emphasis is placed on the analysis of the quality of image restoration and the level of compression using different neural network settings. A mathematical model is presented that describes the principle of operation of an autoencoder and shows how a neural network encodes and restores images using latent space. To achieve the best reconstruction quality, a hybrid loss function was used, which consists of three components: perceptual loss based on VGG16, SSIM loss, and MSE loss. A modular software system was developed using the Python programming language to conduct experiments. The software includes a graphical interface, a compression module for performing image encoding and decoding operations using an autoencoder model, and a quality assessment module for calculating the main quality metrics (PSNR and SSIM). It was found that traditional image compression methods demonstrate high efficiency, but are more prone to generating artifacts, especially at high compression levels, than neural network methods. As a result of the research, it was found that the autoencoder model can encode and decode images with minimal loss of quality, on a par with JPEG, but is inferior to classical algorithms in speed (1.6 seconds per image versus 0.02 for JPEG) and compression ratio (the model provides a reduction in file size by 11–18%). It is concluded that without reducing the need for computational resources, neural network compression methods will not be able to replace classical methods
References
Li X., Ji S. Neural Image Compression and Explanation. IEEE Access. 2020. Vol. 8. Pp. 214605-214615. DOI: https://doi.org/10.1109/ACCESS.2020.3041416.
Ballé J., Laparra V., Simoncelli E.P. End-to-end Optimized Image Compression. ICLR 2017 : 5th International Conference on Learning Representations, Toulon, France, 24-26 April 2017. Pp. 1-27. DOI: https://doi.org/10.48550/arXiv.1611.01704.
Autoencoders and their applications in machine learning: a survey / K. Berahmand Fet al. Artificial Intelligence Review. 2024. Vol. 57(2). Pp. 1-52. DOI: https://doi.org/10.1007/s10462-023-10662-6.
Kingma D.P., Welling M. An introduction to variational autoencoders. Foundations and trends in machine learning. 2019. Vol. 12(4). Pp. 307-392. DOI: https://doi.org/10.48550/arXiv.1312.6114.
Generative Adversarial Nets / I. Goodfellow et al. Proceedings of the International Conference on Neural Information Processing Systems, Montreal, Canada, 8-13 December 2014. Vol. 3(11). Pp. 2672-2680. DOI: https://doi.org/10.1145/2969033.2969125.
High-Fidelity Generative Image Compression / Mentzer F., Toderici G., Tschannen M., Agustsson E. NeurIPS 2020 : Proceedings of the 34th Conference on Neural Information Processing Systems, Vancouver, Canada, 6-12 December 2020. Pp. 11913-11924. DOI: https://doi.org/10.5555/3495724.3496723.
Bank D., Koenigstein N., Giryes R. Autoencoders. Machine Learning for Data Science Handbook / ed. by Rokach L., Maimon O., Shmueli E. Springer, Cham, 2023. Pp. 353-374. DOI: https://doi.org/10.1007/978-3-031-24628-9_16.
Image quality assessment: from error visibility to structural similarity / Wang Zh., Bovik A. C. , Sheikh H. R., Simoncelli E. P. IEEE Transactions on Image Processing. 2004. Vol.13. No. 4. Pp. 600-612. DOI: https://doi.org/10.1109/TIP.2003.819861.
Downloads
Published
How to Cite
Issue
Section
License

This work is licensed under a Creative Commons Attribution 4.0 International License.
The journal «Reporter of the Priazovskyi State Technical University. Section: Technical sciences» is published under the CC BY license (Attribution License).
This license allows for the distribution, editing, modification, and use of the work as a basis for derivative works, even for commercial purposes, provided that proper attribution is given. It is the most flexible of all available licenses and is recommended for maximum dissemination and use of non-restricted materials.
Authors who publish in this journal agree to the following terms:
1. Authors retain the copyright of their work and grant the journal the right of first publication under the terms of the Creative Commons Attribution License (CC BY). This license allows others to freely distribute the published work, provided that proper attribution is given to the original authors and the first publication of the work in this journal is acknowledged.
2. Authors are allowed to enter into separate, additional agreements for non-exclusive distribution of the work in the same form as published in this journal (e.g., depositing it in an institutional repository or including it in a monograph), provided that a reference to the first publication in this journal is maintained.







