Development of the descriptive binary model and its application for identification of clumps of toxic cyanobacteria
DOI:
https://doi.org/10.15587/1729-4061.2017.108285Keywords:
descriptive models, dynamical systems, binary data, parsimony, data mining, clumps of toxic cyanobacteriaAbstract
In the paper, a descriptive model of system dynamics for binary data is presented. Binary or dichotomous data are widely spread across various fields of research – in decision making and data mining, marketing, solving of many natural, social and technical problems. The initial data for building the model is a set of states of an autonomous dynamical system with components taking binary values. At the same time, the time order of the states is permissible. The following objectives were stated: to identify the relationships between the components of the system defining its dynamics; on the basis of the identified dynamics, to recover the true order of the system states; to apply the developed model to the problem of visual identification of cyanobacteria in water areas using digital photography.
To solve the problem, we used a mathematical model that enables to describe the relationships between components and transitions between the system states at a simple-for-understanding level. The principle of parsimony underlies the model. According to this principle, the most appropriate model is described by the simplest relations in the sense defined in the work.
As the case study, the problem of recognizing clumps of cyanobacteria from digital satellite imagery was considered. This is a complex, practically important problem that does not have a satisfactory experimental and theoretical solution at the moment. Applying system approaches to the measured colorimetric parameters of digital photography, we developed the index for identification of such clumps. This index uses the parameters of the digital RGB model of (various parts of) an image and allows us to reveal clumps of cyanobacteria on digital images obtained by aerospace methods. Additionally, digital photography can be performed in the conditions of insufficient visibility (due to precipitation, fog, and other factors), for imitation of which in the case study the original image was distorted by the digital noise.
The studied model can find useful applications in the areas requiring binary dynamical data insightsReferences
- Zholtkevych, G. N., Bespalov, G. Y., Nosov, K. V., Abhishek, M. (2013). Discrete Modeling of Dynamics of Zooplankton Community at the Different Stages of an Antropogeneous Eutrophication. Acta Biotheoretica, 61 (4), 449–465. doi: 10.1007/s10441-013-9184-6
- Zholtkevych, G., Nosov, K., Bespalov, Yu., Rak, L., Vysotskaya, E., Balkova, Y., Kolomiychenko, V. (2016) Descriptive Models of System Dynamics. Proceedings of the 12th International Conference on ICT in Education, Research and Industrial Applications. Integration, Harmonization and Knowledge Transfer, 1614, 57–72
- Carmichael, W. W. (2001). Health Effects of Toxin-Producing Cyanobacteria: “The CyanoHABs.” Human and Ecological Risk Assessment: An International Journal, 7 (5), 1393–1407. doi: 10.1080/20018091095087
- Zohary, T., Robarts, R. D. (1990). Hyperscums and the population dynamics of Microcystis aeruginosa. Journal of Plankton Research, 12 (2), 423–432. doi: 10.1093/plankt/12.2.423
- Karabashev, G. S., Evdoshenko, M. A. (2015). Spektral'nye priznaki cveteniya cianobakterij v Baltijskom more po dannym skanera MODIS. Sovremennye problemy distancionnogo zondirovaniya Zemli iz kosmosa, 12 (3), 158–170.
- Vysotska, O. V., Bespalov, Yu. G., Pecherska, A. I., Parvadov, D. A. (2016). Using of margalef succession model in remote detection technologies for indications of human impact on vegetation cover. Radioelektronni i komp'uterni sistemi, 2 (76), 15–19.
- Vysotskaya, E. V., Zholtkevych, G. N., Klochko, T. A., Bespalov, Yu. G., Nosov, K. V. (2016). Unmasking the soil cover’s disruption by modeling the dynamics of ground vegetation parameters. Visnyk Natsionalnoho Tekhnichnoho Universytetu Ukrayiny «KPI». Seriya – Radiotekhnika. Radioaparatobuduvannya, 64, 101–109.
- Shulika, B., Porvan, A., Vysotska, O., Nekos, A., Zhemerov, A. (2017). Control over grape yield in the north-eastern region of Ukraine using mathematical modeling. Eastern-European Journal of Enterprise Technologies, 2 (3 (86)), 51–59. doi: 10.15587/1729-4061.2017.97969
- De Menezes, F. S., Liska, G. R., Cirillo, M. A., Vivanco, M. J. F. (2017). Data classification with binary response through the Boosting algorithm and logistic regression. Expert Systems with Applications, 69, 62–73. doi: 10.1016/j.eswa.2016.08.014
- Pierola, A., Epifanio, I., Alemany, S. (2016). An ensemble of ordered logistic regression and random forest for child garment size matching. Computers & Industrial Engineering, 101, 455–465. doi: 10.1016/j.cie.2016.10.013
- Ghattas, B., Michel, P., Boyer, L. (2017). Clustering nominal data using unsupervised binary decision trees: Comparisons with the state of the art methods. Pattern Recognition, 67, 177–185. doi: 10.1016/j.patcog.2017.01.031
- Yamamoto, M., Hayashi, K. (2015). Clustering of multivariate binary data with dimension reduction via L1-regularized likelihood maximization. Pattern Recognition, 48 (12), 3959–3968. doi: 10.1016/j.patcog.2015.05.026
- Katahira, K. (2016). How hierarchical models improve point estimates of model parameters at the individual level. Journal of Mathematical Psychology, 73, 37–58. doi: 10.1016/j.jmp.2016.03.007
- Erkan Ozkaya, H., Dabas, C., Kolev, K., Hult, G. T. M., Dahlquist, S. H., Manjeshwar, S. A. (2013). An assessment of hierarchical linear modeling in international business, management, and marketing. International Business Review, 22 (4), 663–677. doi: 10.1016/j.ibusrev.2012.10.002
- Van Oirbeek, R., Lesaffre, E. (2012). Assessing the predictive ability of a multilevel binary regression model. Computational Statistics & Data Analysis, 56 (6), 1966–1980. doi: 10.1016/j.csda.2011.11.023
- Asar, Ö., İlk, Ö. (2014). Flexible multivariate marginal models for analyzing multivariate longitudinal data, with applications in R. Computer Methods and Programs in Biomedicine, 115 (3), 135–146. doi: 10.1016/j.cmpb.2014.04.005
- Zou, G., Donner, A. (2011). Extension of the modified Poisson regression model to prospective studies with correlated binary data. Statistical Methods in Medical Research, 22 (6), 661–670. doi: 10.1177/0962280211427759
- Schildcrout, J. S., Heagerty, P. J. (2006). Marginalized Models for Moderate to Long Series of Longitudinal Binary Response Data. Biometrics, 63 (2), 322–331. doi: 10.1111/j.1541-0420.2006.00680.x
- Kupek, E. (2006). Beyond logistic regression: structural equations modelling for binary variables and its application to investigating unobserved confounders. BMC Medical Research Methodology, 6 (1). doi: 10.1186/1471-2288-6-13
- Rhemtulla, M., Brosseau-Liard, P. É., Savalei, V. (2012). When can categorical variables be treated as continuous? A comparison of robust continuous and categorical SEM estimation methods under suboptimal conditions. Psychological Methods, 17 (3), 354–373. doi: 10.1037/a0029315
- Pearl, J. (2009). Causal inference in statistics: An overview. Statistics Surveys, 3, 96–146. doi: 10.1214/09-ss057
- Bollen, K. A., Pearl, J. (2013). Eight Myths About Causality and Structural Equation Models. Handbook of Causal Analysis for Social Research, 301–328. doi: 10.1007/978-94-007-6094-3_15
- LANCE-MODIS Collection 6. Available at: https://lance3.modaps.eosdis.nasa.gov
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2017 Konstantin Nosov, Grygoriy Zholtkevych, Olena Vysotska, Marine Georgiyants, Yuriі Balym, Andrеi Porvan
This work is licensed under a Creative Commons Attribution 4.0 International License.
The consolidation and conditions for the transfer of copyright (identification of authorship) is carried out in the License Agreement. In particular, the authors reserve the right to the authorship of their manuscript and transfer the first publication of this work to the journal under the terms of the Creative Commons CC BY license. At the same time, they have the right to conclude on their own additional agreements concerning the non-exclusive distribution of the work in the form in which it was published by this journal, but provided that the link to the first publication of the article in this journal is preserved.
A license agreement is a document in which the author warrants that he/she owns all copyright for the work (manuscript, article, etc.).
The authors, signing the License Agreement with TECHNOLOGY CENTER PC, have all rights to the further use of their work, provided that they link to our edition in which the work was published.
According to the terms of the License Agreement, the Publisher TECHNOLOGY CENTER PC does not take away your copyrights and receives permission from the authors to use and dissemination of the publication through the world's scientific resources (own electronic resources, scientometric databases, repositories, libraries, etc.).
In the absence of a signed License Agreement or in the absence of this agreement of identifiers allowing to identify the identity of the author, the editors have no right to work with the manuscript.
It is important to remember that there is another type of agreement between authors and publishers – when copyright is transferred from the authors to the publisher. In this case, the authors lose ownership of their work and may not use it in any way.