Development of the descriptive binary model and its application for identification of clumps of toxic cyanobacteria

Authors

DOI:

https://doi.org/10.15587/1729-4061.2017.108285

Keywords:

descriptive models, dynamical systems, binary data, parsimony, data mining, clumps of toxic cyanobacteria

Abstract

In the paper, a descriptive model of system dynamics for binary data is presented. Binary or dichotomous data are widely spread across various fields of research – in decision making and data mining, marketing, solving of many natural, social and technical problems. The initial data for building the model is a set of states of an autonomous dynamical system with components taking binary values. At the same time, the time order of the states is permissible. The following objectives were stated: to identify the relationships between the components of the system defining its dynamics; on the basis of the identified dynamics, to recover the true order of the system states; to apply the developed model to the problem of visual identification of cyanobacteria in water areas using digital photography.

To solve the problem, we used a mathematical model that enables to describe the relationships between components and transitions between the system states at a simple-for-understanding level. The principle of parsimony underlies the model. According to this principle, the most appropriate model is described by the simplest relations in the sense defined in the work.

As the case study, the problem of recognizing clumps of cyanobacteria from digital satellite imagery was considered. This is a complex, practically important problem that does not have a satisfactory experimental and theoretical solution at the moment. Applying system approaches to the measured colorimetric parameters of digital photography, we developed the index for identification of such clumps. This index uses the parameters of the digital RGB model of (various parts of) an image and allows us to reveal clumps of cyanobacteria on digital images obtained by aerospace methods. Additionally, digital photography can be performed in the conditions of insufficient visibility (due to precipitation, fog, and other factors), for imitation of which in the case study the original image was distorted by the digital noise.

The studied model can find useful applications in the areas requiring binary dynamical data insights

Author Biographies

Konstantin Nosov, V. N. Karazin Kharkiv National University Svobody sq., 4, Kharkiv, Ukraine, 61022

PhD, Research Fellow

Scientific research unit

Grygoriy Zholtkevych, V. N. Karazin Kharkiv National University Svobody sq., 4, Kharkiv, Ukraine, 61022

Doctor of Technical Sciences, PhD, Professor

Department of Theoretical and Applied Computer Science.

Marine Georgiyants, Kharkiv Medical Academy of Postgraduate Education Amosova str., 58, Kharkіv, Ukraine, 61176

MD, Professor

Department of Pediatrics Anesthesiology and Intensive Therapy

Olena Vysotska, Kharkiv National University of Radio Electronics Nauky ave., 14, Kharkiv, Ukraine, 61166

Doctor of Technical Sciences, Professor

Department of Biomedical Engineering

Yuriі Balym, Kharkiv State Zooveterinary Academy Academichna str., 1, Malaya Danylivka, Dergachi district, Kharkiv region, Ukraine, 62341

Doctor of Veterinary Science, Professor

Department of Reproductology

Andrеi Porvan, Kharkiv National University of Radio Electronics Nauky ave., 14, Kharkiv, Ukraine, 61166

PhD

Department of Biomedical Engineering

References

  1. Zholtkevych, G. N., Bespalov, G. Y., Nosov, K. V., Abhishek, M. (2013). Discrete Modeling of Dynamics of Zooplankton Community at the Different Stages of an Antropogeneous Eutrophication. Acta Biotheoretica, 61 (4), 449–465. doi: 10.1007/s10441-013-9184-6
  2. Zholtkevych, G., Nosov, K., Bespalov, Yu., Rak, L., Vysotskaya, E., Balkova, Y., Kolomiychenko, V. (2016) Descriptive Models of System Dynamics. Proceedings of the 12th International Conference on ICT in Education, Research and Industrial Applications. Integration, Harmonization and Knowledge Transfer, 1614, 57–72
  3. Carmichael, W. W. (2001). Health Effects of Toxin-Producing Cyanobacteria: “The CyanoHABs.” Human and Ecological Risk Assessment: An International Journal, 7 (5), 1393–1407. doi: 10.1080/20018091095087
  4. Zohary, T., Robarts, R. D. (1990). Hyperscums and the population dynamics of Microcystis aeruginosa. Journal of Plankton Research, 12 (2), 423–432. doi: 10.1093/plankt/12.2.423
  5. Karabashev, G. S., Evdoshenko, M. A. (2015). Spektral'nye priznaki cveteniya cianobakterij v Baltijskom more po dannym skanera MODIS. Sovremennye problemy distancionnogo zondirovaniya Zemli iz kosmosa, 12 (3), 158–170.
  6. Vysotska, O. V., Bespalov, Yu. G., Pecherska, A. I., Parvadov, D. A. (2016). Using of margalef succession model in remote detection technologies for indications of human impact on vegetation cover. Radioelektronni i komp'uterni sistemi, 2 (76), 15–19.
  7. Vysotskaya, E. V., Zholtkevych, G. N., Klochko, T. A., Bespalov, Yu. G., Nosov, K. V. (2016). Unmasking the soil cover’s disruption by modeling the dynamics of ground vegetation parameters. Visnyk Natsionalnoho Tekhnichnoho Universytetu Ukrayiny «KPI». Seriya – Radiotekhnika. Radioaparatobuduvannya, 64, 101–109.
  8. Shulika, B., Porvan, A., Vysotska, O., Nekos, A., Zhemerov, A. (2017). Control over grape yield in the north-eastern region of Ukraine using mathematical modeling. Eastern-European Journal of Enterprise Technologies, 2 (3 (86)), 51–59. doi: 10.15587/1729-4061.2017.97969
  9. De Menezes, F. S., Liska, G. R., Cirillo, M. A., Vivanco, M. J. F. (2017). Data classification with binary response through the Boosting algorithm and logistic regression. Expert Systems with Applications, 69, 62–73. doi: 10.1016/j.eswa.2016.08.014
  10. Pierola, A., Epifanio, I., Alemany, S. (2016). An ensemble of ordered logistic regression and random forest for child garment size matching. Computers & Industrial Engineering, 101, 455–465. doi: 10.1016/j.cie.2016.10.013
  11. Ghattas, B., Michel, P., Boyer, L. (2017). Clustering nominal data using unsupervised binary decision trees: Comparisons with the state of the art methods. Pattern Recognition, 67, 177–185. doi: 10.1016/j.patcog.2017.01.031
  12. Yamamoto, M., Hayashi, K. (2015). Clustering of multivariate binary data with dimension reduction via L1-regularized likelihood maximization. Pattern Recognition, 48 (12), 3959–3968. doi: 10.1016/j.patcog.2015.05.026
  13. Katahira, K. (2016). How hierarchical models improve point estimates of model parameters at the individual level. Journal of Mathematical Psychology, 73, 37–58. doi: 10.1016/j.jmp.2016.03.007
  14. Erkan Ozkaya, H., Dabas, C., Kolev, K., Hult, G. T. M., Dahlquist, S. H., Manjeshwar, S. A. (2013). An assessment of hierarchical linear modeling in international business, management, and marketing. International Business Review, 22 (4), 663–677. doi: 10.1016/j.ibusrev.2012.10.002
  15. Van Oirbeek, R., Lesaffre, E. (2012). Assessing the predictive ability of a multilevel binary regression model. Computational Statistics & Data Analysis, 56 (6), 1966–1980. doi: 10.1016/j.csda.2011.11.023
  16. Asar, Ö., İlk, Ö. (2014). Flexible multivariate marginal models for analyzing multivariate longitudinal data, with applications in R. Computer Methods and Programs in Biomedicine, 115 (3), 135–146. doi: 10.1016/j.cmpb.2014.04.005
  17. Zou, G., Donner, A. (2011). Extension of the modified Poisson regression model to prospective studies with correlated binary data. Statistical Methods in Medical Research, 22 (6), 661–670. doi: 10.1177/0962280211427759
  18. Schildcrout, J. S., Heagerty, P. J. (2006). Marginalized Models for Moderate to Long Series of Longitudinal Binary Response Data. Biometrics, 63 (2), 322–331. doi: 10.1111/j.1541-0420.2006.00680.x
  19. Kupek, E. (2006). Beyond logistic regression: structural equations modelling for binary variables and its application to investigating unobserved confounders. BMC Medical Research Methodology, 6 (1). doi: 10.1186/1471-2288-6-13
  20. Rhemtulla, M., Brosseau-Liard, P. É., Savalei, V. (2012). When can categorical variables be treated as continuous? A comparison of robust continuous and categorical SEM estimation methods under suboptimal conditions. Psychological Methods, 17 (3), 354–373. doi: 10.1037/a0029315
  21. Pearl, J. (2009). Causal inference in statistics: An overview. Statistics Surveys, 3, 96–146. doi: 10.1214/09-ss057
  22. Bollen, K. A., Pearl, J. (2013). Eight Myths About Causality and Structural Equation Models. Handbook of Causal Analysis for Social Research, 301–328. doi: 10.1007/978-94-007-6094-3_15
  23. LANCE-MODIS Collection 6. Available at: https://lance3.modaps.eosdis.nasa.gov

Downloads

Published

2017-08-30

How to Cite

Nosov, K., Zholtkevych, G., Georgiyants, M., Vysotska, O., Balym, Y., & Porvan, A. (2017). Development of the descriptive binary model and its application for identification of clumps of toxic cyanobacteria. Eastern-European Journal of Enterprise Technologies, 4(4 (88), 4–11. https://doi.org/10.15587/1729-4061.2017.108285

Issue

Section

Mathematics and Cybernetics - applied aspects