Text image compression based on statistical analysis and classification of the vertical line elements
DOI:
https://doi.org/10.15587/1729-4061.2014.26298Keywords:
text image compression, vertical line elements, statistical analysis, classificationAbstract
A new original method for text data image compression is presented. Vertical line elements rather than connecting symbols of the text image are used as the main processing element. The given probability model quite accurately describes possible distortions of the vertical line elements, caused by printing and scanning noise. Based on the accepted probability model and statistical analysis methods, minimum most plausible set of undistorted elements in the entire set of the investigated vertical line elements is found. For each vertical element, the probability that this element is a distortion of an element of the set of undistorted line elements is found. Classification of the vertical line elements of the text image is based on a probabilistic assessment of the classified elements belonging to a single center. The end result of the proposed method is forming a dictionary of connecting symbols of the text image, where each class is represented by its most probable image and an allocation map of connecting symbols on the plane of the studied image. The proposed text image processing method has allowed to obtain a relatively high compression ratio with good quality of the reconstructed image. Comparison with the currently best special text image compression algorithm - JB2, within the format DjVu, has shown that the proposed algorithm has the advantage in data compression ratio of about 37% in processing a text page image with a resolution of 300 dpi.References
- Technical Papers from AT&T Labs. Available at: http://djvuzone.org/techpapers/index.html.
- DjVu.org. Available at: http://www.djvu.org/
- Haffner, P., Bottou, L., Howard, P. G., LeCun, Y. (1999). DjVu: Analyzing and Compressing Scanned Documents for Internet Distribution. Fifth International Conference on Document Analysis and Recognition (ICDAR'99), 625. doi:10.1109/icdar.1999.791865
- JBIG2.com : An Introduction to JBIG2. Available at: http://jbig2.com/index.html
- Ayvazian, S. A., Bukhshtaber, V .M., Eniukov, I. S. (1989). Prikladnaja statistika: Klassifikacija i snizhenie razmernosti [Applied Statistics: Classification and reduction of dimensionality]. Moscow: Finansy i statistika, 1989, 607.
- Ivanov, V. G., Lomonosov, J. V., Lubarskiy, M. G. (2009). Compression of Images on the Busis of Automatic and Indistinct Classification of Fragments. Journal of Automation and information sciences, 41 (1), 27–39. doi:10.1615/jautomatinfscien.v41.i1.40
- Shlezinger, M. I. (1983). Matematicheskie sredstva obrabotki izobrazhenij [The mathematical methods of image processing]. Kiev: Naukova dumka, 200.
- Gonsales, R. C., Woods, R. E. (2005). Digital Image Processing. Moscow, Tekhnosfera, 1072.
- Zemskov, V. N., Kim, I. S. (2003). Szhatie izobrazhenij na osnove avtomaticheskoj klassifikacii [Image compression based on the automatic classification]. Izvestija vuzov. Jelektronika [Proceedings of the universities. Electronics], 2, 50-56.
- Mallat, S. (1989). Multiresolution Approximation and Wavelet Orthonormal Bases L2(R). Trans of the American Mathematical Society, 315 (1), 68–87. doi:10.1090/s0002-9947-1989-1008470-5
- Gupta, M. R., Stroilov, A. (2005). Segmenting for wavelet compression. Data Compression Conference (DCC’05), 462. doi:10.1109/dcc.2005.80
- Montiel, E., Aquado, A. S., Nixon, M. S. (2005). Texture classification via conditional histograms. Pattern Recognition Letters, 26 (11), 1740–1751. doi:10.1016/j.patrec.2005.02.004
- Lakhani, G. (2008). Improving Image Decomposition Method of the 3-MRC Coding of Scanned Compound Document Images. Sixth Indian Conference on Computer Vision, Graphics & Image Processing, 289–296. doi:10.1109/icvgip.2008.94
- Ding, W., Liu, D., He, Y., Wu, F. (2006). Block-based Fast Compression for Compound Images. IEEE International Conference on Multimedia and Expo, 809–812. doi:10.1109/icme.2006.262624
- Malvar, H. S. (2001). Fast Adaptive Encoder for Bi-Level Images. Data Compression Conference (DCC '01), 253. doi:10.1109/dcc.2001.917156
- Imura, H., Tanaka, Y. (2009). Compression and String Matching Method for Printed Document Images. 10th International Conference on Document Analysis and Recognition, 291–295. doi:10.1109/icdar.2009.182
- Ivanov, V. G., Lubarskiy, M. G., Lomonosov, J. V. (2010). Compression of Text Image Based on Characters and Their Classification. Journal of Automation and information sciences, 42 (11), 46–57. doi:10.1615/jautomatinfscien.v42.i11.50
- Ivanov, V. G., Lubarskiy, M. G., Lomonosov, J. V. (2011). Text Image Compression Based on the Formation and Classification of Vertical Elements of a Row in the Graphical Dictionary of Symbol Data. Journal of Automation and information sciences, 43(10), 29–41. doi:10.1615/jautomatinfscien.v43.i10.40
- Kanatnikov, A. N., Krishhenko, A .P. Funkcii neskol'kih peremennyh [Function of several variables]. Available at: http://mathmod.bmstu.ru/Docs/Eduwork/la_fnp/FNP-14.pdf
- Nekotorye materialy iz lekcij po analizu. Teorema o nejavnoj funkcii. [Some material from the lectures on analysis. The implicit function theorem]. Available at: http://new.math.msu.su/Sites/demosite/Uploads/Neyavnaya%20funktsiya.B6EF5654E2C8486FBA1EF9F186B32F1A.pdf
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2014 Владимир Георгиевич Иванов, Юрий Вячеславович Ломоносов, Михаил Григорьевич Любарский
This work is licensed under a Creative Commons Attribution 4.0 International License.
The consolidation and conditions for the transfer of copyright (identification of authorship) is carried out in the License Agreement. In particular, the authors reserve the right to the authorship of their manuscript and transfer the first publication of this work to the journal under the terms of the Creative Commons CC BY license. At the same time, they have the right to conclude on their own additional agreements concerning the non-exclusive distribution of the work in the form in which it was published by this journal, but provided that the link to the first publication of the article in this journal is preserved.
A license agreement is a document in which the author warrants that he/she owns all copyright for the work (manuscript, article, etc.).
The authors, signing the License Agreement with TECHNOLOGY CENTER PC, have all rights to the further use of their work, provided that they link to our edition in which the work was published.
According to the terms of the License Agreement, the Publisher TECHNOLOGY CENTER PC does not take away your copyrights and receives permission from the authors to use and dissemination of the publication through the world's scientific resources (own electronic resources, scientometric databases, repositories, libraries, etc.).
In the absence of a signed License Agreement or in the absence of this agreement of identifiers allowing to identify the identity of the author, the editors have no right to work with the manuscript.
It is important to remember that there is another type of agreement between authors and publishers – when copyright is transferred from the authors to the publisher. In this case, the authors lose ownership of their work and may not use it in any way.