Combined method for scanned documents images segmentation using sequential extraction of regions

Authors

DOI:

https://doi.org/10.15587/1729-4061.2018.142735

Keywords:

image segmentation, scanned document, block method, graphics, photographic image, text fragment, connected component, bloomberg method

Abstract

We propose a combined method to segment the images of scanned documents, which, in contrast to known methods, implies a preliminary separation of the graphics and photograph regions from the text regions and a background. In this case, an analysis of the connected components is performed, which are different for graph­ics, photographs, and text regions. In order to classify the selected regions into the photograph and graphics regions, a block method is employed. It was established that such a technique for splitting the regions into blocks less affects the quality of segmentation when compared to applying the block method directly to the original im­age. To extract the text regions that are more complex in their shape from the background, the neighborhood of each pixel was processed.

To detect the boundaries of illustrations on the images of scanned documents, we applied the bloomberg method. In order to classify into photographs and graphics, it is proposed to split an illustration into blocks of pixels. Each block of pixels is identified with a vector of two features: the mean value of the local gradient magnitude, and the mean value of the function that localizes at the images of scanned documents the linear objects (graphics and text characters). The derived feature vectors were classified using a sup­port vector machine.

When extracting the text regions, we applied a low-frequency filtering and a thresholding.

The combined method was implemented in practice to segment the test images of scanned newspaper articles from the document da­tabase mediateam at oulu university (finland). It was established that the combined method is characterized by an increase in perfor­mance speed during image segmentation at high quality processing.

Author Biographies

Marina Polyakova, Odessa National Polytechnic University Shevchenka ave., 1, Odessa, Ukraine, 65044

Doctor of Technical Sciences, Associate Professor

Department of Applied Mathematics and Information Technologies

Alesya Ishchenko, Odessa National Polytechnic University Shevchenka ave., 1, Odessa, Ukraine, 65044

Senior Lecturer

Department of Applied Mathematics and Information Technologies

Natalya Volkova, Odessa National Polytechnic University Shevchenka ave., 1, Odessa, Ukraine, 65044

Senior Lecturer

Department of Applied Mathematics and Information Technologies

Oleg Pavlov, Odessa National Polytechnic University Shevchenka ave., 1, Odessa, Ukraine, 65044

Senior Lecturer

Department of Applied Mathematics and Information Technologies

References

  1. Haneda, E., Bouman, C. A. (2011). Text Segmentation for MRC Document Compression. IEEE Transactions on Image Processing, 20 (6), 1611–1626. doi: https://doi.org/10.1109/tip.2010.2101611
  2. Polyakova, M., Ishchenko, A., Huliaieva, N. (2018). Document image segmentation using averaging filtering and mathematical morphology. 2018 14th International Conference on Advanced Trends in Radioelecrtronics, Telecommunications and Computer Engineering (TCSET). doi: https://doi.org/10.1109/tcset.2018.8336354
  3. El-Omari, N. K. T., Omari, A., Al-Badarneh, O., Abdel-Jaber, H. (2012). Scanned Document Image Segmentation Using Back-Propagation Artificial Neural Network Based Technique. International Journal of Computers and Communications, 6 (4), 183–190. Available at: https://www.naun.org/main/UPress/cc/16-060.pdf
  4. Sasirekha, D., Chandra, E. (2012). Enhanced techniques for PDF image segmentation and text extraction. International Journal of Electronics and Computer Science Engineering. 2012. Vol. 10, Issue 9. P. 1833–1838.
  5. Korennoy, A. V., Yudakov, D. S., Dedov, S. V., Strazhnik, V. P. (2015). Obnaruzhenie i lokalizaciya tekstovyh oblastey na polutonovyh cifrovyh izobrazheniyah. Vestnik VGU. Sistemnyy analiz i informacionnye tekhnologii, 4, 65–72.
  6. Kundu, M. K., Dhar, S., Banerjee, M. (2012). A new approach for segmentation of image and text in natural and commercial color documents. 2012 International Conference on Communications, Devices and Intelligent Systems (CODIS). doi: https://doi.org/10.1109/codis.2012.6422142
  7. Abdullah, H. S., Jassim, A. H. (2016). Improved fuzzy c-means for document image segmentation. British Journal of Science, 14 (2), 1–15.
  8. Abdullah, H. S., Jasim, A. H. (2016). Improved Ant Colony Optimization for Document Image Segmentation. International Journal of Computer Science and Information Security (IJCSIS), 14 (11), 775–785.
  9. Erkilinc, M. S., Jaber, M., Saber, E., Bauer, P., Depalov, D. (2012). Text, photo, and line extraction in scanned documents. Journal of Electronic Imaging, 21 (3), 033006. doi: https://doi.org/10.1117/1.jei.21.3.033006
  10. Bukhari, S. S., Shafait, F., Breuel, T. M. (2011). Improved document image segmentation algorithm using multiresolution morphology. Document Recognition and Retrieval XVIII. doi: https://doi.org/10.1117/12.873461
  11. Zirari, F., Ennaji, A., Nicolas, S., Mammass, D. (2013). A Document Image Segmentation System Using Analysis of Connected Components. 2013 12th International Conference on Document Analysis and Recognition. doi: https://doi.org/10.1109/icdar.2013.154
  12. Bukhari, S. S., Al Azawi, M. I. A., Shafait, F., Breuel, T. M. (2010). Document image segmentation using discriminative learning over connected components. Proceedings of the 8th IAPR International Workshop on Document Analysis Systems – DAS ’10. doi: https://doi.org/10.1145/1815330.1815354
  13. Gonsales, R., Vuds, R. (2005). Cifrovaya obrabotka izobrazheniy. Moscow: Tekhnosfera, 1072.
  14. Frangi, A. F., Niessen, W. J., Vincken, K. L., Viergever, M. A. (1998). Multiscale vessel enhancement filtering. Lecture Notes in Computer Science, 130–137. doi: https://doi.org/10.1007/bfb0056195
  15. Mandel', I. D. (1988). Klasterniy analiz. Moscow: Finansy i statistika, 176.
  16. Chu, W., Keerthi, S. S., Ong, C. J. (2002). A general formulation for support vector machines. Proceedings of the 9th International Conference on Neural Information Processing, 2002. ICONIP '02. doi: https://doi.org/10.1109/iconip.2002.1201949
  17. Otsu, N. (1979). A Threshold Selection Method from Gray-Level Histograms. IEEE Transactions on Systems, Man, and Cybernetics, 9 (1), 62–66. doi: https://doi.org/10.1109/tsmc.1979.4310076
  18. Sauvola, J., Kauniskangas, H. (1999). MediaTeam Document Database II: a collection of document images. University of Oulu. Finland.

Downloads

Published

2018-09-21

How to Cite

Polyakova, M., Ishchenko, A., Volkova, N., & Pavlov, O. (2018). Combined method for scanned documents images segmentation using sequential extraction of regions. Eastern-European Journal of Enterprise Technologies, 5(2 (95), 6–15. https://doi.org/10.15587/1729-4061.2018.142735