METRICS FOR EVALUATING CONSISTENCY IN DISTRIBUTED DATASTORES

Authors

DOI:

https://doi.org/10.30837/2522-9818.2020.12.040

Keywords:

distributed datastore, response time, CAP-theorem, stochastic consistency metric, methods of building distributed network

Abstract

The subject of the paper is metrics for evaluating consistency of distributed datastore as one of main CAP-guarantees, more precisely, criteria for reliable distributed datastore. The goal of the research is investigation of the ability to develop such a program on the earlier stage of building distributed network and build some components of decision-making algorithm, which purpose is to build optimal network topology. This decision-making algorithm should be suitable for any business model and its requirements. To be more detailed, for that purpose the following tasks had been done: mathematical model for stochastic metric for consistency in distributed datastore is built; the conditions of consistency convergence time are investigated in initial perfect datastore environment. Methods used are: theory of number partitions, basics from graph theory and probability theory, computer modeling and program for running sets of experiments. As a result, it is established that in the conditions of data loss absence the consistency convergence after first write request is equal or less than diameter of graph that represents topology of distributed network. Such convergence has the same unit of measure as the link cost of each link in the network; the stochastic model is proposed for metric to evaluate consistency. Making a final conclusion, this will give the opportunity to investigate or monitor the current state of the system in the given time interval. This research is the base to form some elements of decision-making algorithm for building topology in a distributed network and the elements of the algorithm for monitoring such a system. Also, based on trends of requests frequency of data modification and reading, the strategy of nodes allocation in the topology is suggested, which can improve the response time and speed of convergence of the distributed storage to the fully consistent or close to that state. The practical role of the components of the decision-making algorithm is that the network architect could apply the algorithm at the stage of building the network for a distributed database, so that CAP characteristics will be optimized in the context of specific business needs. The mathematical model for the stochastic metric of distributed storage consistency can be applied both at the system design stage, for testing the satisfactory level of consistency, and at the system operation stage, as a component of the network monitoring system.

Author Biography

Galyna Zholtkevych, a Private Entrepreneur

Researcher, Software Engineer

References

Kuhlenkamp, J., Klems, M., Röss, O. (2014), "Benchmarking Scalability and Elasticity of Distributed Database Systems", PVLDB, No. 7, P. 1219–1230.

Tanenbaum, A. S., Steen, M. V. (2007), Distributed systems - principles and paradigms, 2nd Edition, Upper Sddle River, Prentice-hall, 686 p.

Banothu, N., Bhukya, S. and Sharma, K. (2016), "Big-data: Acid versus base for database transactions", 2016 International Conference on Electrical, Electronics, and Optimization Techniques (ICEEOT), Chennai, P. 3704–3709. DOI: 10.1109/ICEEOT.2016.7755401.

Brewer, E. A. (2000), "Towards robust distributed systems (abstract)", PODC '00: Proceedings of the nineteenth annual ACM symposium on Principles of distributed computingJuly 2000, DOI: https://doi.org/10.1145/343477.343502

Gilbert, S., Lynch, N. A. (2002), "Brewer's conjecture and the feasibility of consistent, available, partition-tolerant web services", SIGACT News, No. 33, P. 51–59.

Brewer, E. A. (2012), "CAP twelve years later: How the "rules" have changed", Computer, No. 45, P. 23–29.

Bailis, P., Ghodsi, A. (2013), Eventual Consistency Today: Limitations, Extensions, and Beyond, QUEUE, Vol. 11, Issue 3, P. 9–13, available at : https://dl.acm.org/doi/pdf/10.1145/2460276.2462076?download=true

Gilbert, S., Lynch, N. A. (2012), "Perspectives on the CAP Theorem", Computer, No. 45, P. 30–36.

Calder, B., Wang, J., Ogus, A., Nilakantan, N., Skjolsvold, A., et. al. (2011), "Windows Azure Storage: a highly available cloud storage service with strong consistency", SOSP '11, available at : https://azure.microsoft.com/en-us/blog/sosp-paper-windows-azure-storage-a-highly-available-cloud-storage-service-with-strong-consistency/.

Madria, S. K. (1998), "Handling of Mutual Conflicts in Distributed Databases Using Timestamps", Comput. J., No. 41, P. 376–385.

Rukkas, K., Zholtkevych, G. (2015), "Distributed Datastores: Towards Probabilistic Approach for Estimation of Dependability", ICTERI, Computer Science, available at : https://pdfs.semanticscholar.org/5eb0/01632c6cd6da2e4ec92adbc288939de0f4f9.pdf?_ga=2.235973185.1723289165.1592897490-2045290888.1592897490.

Andrews, G. E. (1976), The theory of partitions, Cambridge University Press, 255 p.

Bondy, J. A., Murty, U. S. (1976), Graph Theory with Applications, Elsevier Science Ltd.The Boulevard Langford Lane Kidlington, Oxford OX5 1GBUnited Kingdom, 270 p.

Rukkas, K., Zholtkevych, G. (2020), "Probabilistic model for estimation of cap-guarantees for distributed datastore", Advanced Information Systems, No. 4, P. 47–50. DPI: 10.20998/2522-9052.2020.2.09

Rukkas, K., Zholtkevych, G. (2020), "Load balancing consistency in a distributed datastore", Control, Navigation and Communication Systems, No. 2, P. 95–100. DOI: 10.26906/SUNZ.2020.2.095

Lemeshko, O., Yevdokymenko, M., Yeremenko, O. (2019), "Model of data traffic qos fast rerouting in infocommunication networks", Innovative Technologies and Scientific Solutions for Industries, No. 3 (9), P. 127–134. DOI: https://doi.org/10.30837/2522-9818.2019.9.127

Yeremenko, O., Yevdokymenko, M., Sleiman, B. (2020), "Advanced performance-based fast rerouting model with path protection and its bandwidth in software-defined network", Innovative Technologies and Scientific Solutions for Industries, No. 1 (11), P. 163–171. DOI: https://doi.org/10.30837/2522-9818.2020.11.163.

Downloads

Published

2020-06-24

How to Cite

Zholtkevych, G. (2020). METRICS FOR EVALUATING CONSISTENCY IN DISTRIBUTED DATASTORES. INNOVATIVE TECHNOLOGIES AND SCIENTIFIC SOLUTIONS FOR INDUSTRIES, (2 (12), 40–48. https://doi.org/10.30837/2522-9818.2020.12.040

Issue

Section

INFORMATION TECHNOLOGY