Development of multibase data storages on the basis of data and queries structuredness
DOI:
https://doi.org/10.15587/1729-4061.2015.36646Keywords:
multibase data storages, building, data structuredness, queries, genetic algorithms, gene-based adaptation of searchAbstract
The study focuses on building multibase data storages that consider a correlation between the data properties and performed queries. This type of data storaging has been neither viewed as an approach nor researched before. Lack of attention especially concerns presentation of data by various models for optimizing query response.We suggest a method of designing multibase data storages on the basis of data structuredness, which allows posting the reference data in storage media the data models of which facilitate performing queries on them. The efficiency of the designed data storage is optimized on the basis of the statistics on queries processing and consists in storing data as well as saving the data in storage media with the help of indexing, materialized submission, fragmentation, and merger. We have studied both the impact of design phases and optimization on storage performance and the parameters of the modified genetic algorithm, including the threshold of gene adaptation.
The research has proved that application of the suggested approach increases the integral index of query processing by 10 %. The storage building time can be reduced to 50 %, which significantly impacts data storage building of a huge amount of data. An important advantage of the approach is flexibility: any storage media and optimization mechanisms can be used while applying the suggested models.
References
- Inmon, W. H. Corporate Information Factory Components.Inmon Data Systems. available at: http://www.inmoncif.com/view/26
- Kimball, R. (2002). The data warehouse toolkit: the complete guide to dimensional modeling. Wiley, 436.
- Hackney, D. Architectures and Approaches for Successful Data Warehouses. Available at: http://www.egltd.com/presents/ArchitecturesApproaches.pdf
- Tomashevskyi, V. M., Yatsyshyn, A. Yu. (2011). Osoblyvosti proektuvannia hibrydnykh skhovyshch danykh z vrakhuvanniam dzherel danykh . Informatsiini systemy ta merezhi: zbirnyk naukovykh prats. Vistnik Natsionalnogo universytetu "Lvivska politekhnika", 715, 246–254.
- Thusoo, A., Sarma, J. S., Jain, N., Shao, Z., Chakka, P., Zhang, N. et. al. (2010). Hive – a petabyte scale data warehouse using Hadoop. Data Engineering (ICDE), 2010 IEEE 26th International Conference, 996–1005. doi: 10.1109/icde.2010.5447738
- Shakhovska, N. B. (2012). Organizatsiya prostoriv danih u skladnyh informatsiinyh sistemah. Natsionalnyi universytet "Lvivska polItehnika", 39.
- Zhou, L., He, X., Li, K. (2012),. An Improved Approach for Materialized View Selection Based on Genetic Algorithm. Journal of Computers, 7 (7), 1591–1598. doi: 10.4304/jcp.7.7.1591-1598
- Mami, I., Bellahsene, Z. (2012). A survey of view selection methods. ACM SIGMOD Record, 41 (1), 20–29. doi: 10.1145/2206869.2206874
- Dimovski, A., Velinov, G., Sahpaski, D. (2010). Advances in Databases and Information Systems. Lecture Notes in Computer Science, 6295, 164–175. doi: 10.1007/978-3-642-15576-5_14
- Elmansouri, R., Ziyati, E., Elbeqqali, O., Aboutajdine, D. (2013). The fragmentation of data warehouses. An approach based on principal components analysis. 2012 International Conference on Multimedia Computing and Systems (ICMCS), 18–23. doi: 10.1109/icmcs.2012.6320319
- Jarke, M., Jeusfeld, M. A., Quix, C., Vassiliadis, P. (2013). Architecture and Quality in Data Warehouses. Seminal Contributions to Information Systems Engineering, 161–181. doi: 10.1007/978-3-642-36926-1_13
- Siebert, J. C., Munsil, W., Rosenberg-Hasson, Y., Davis, M. M., Holden, T., Maecker, J. (2013). The Stanford Data Miner: a novel approach for integrating and exploring heterogeneous immunological data. Journal of Translational Medicine, 10 (1), 62. doi: 10.1186/1479-5876-10-62
- Yatsyshyn, A. Yu. (2012). Proektuvannia multybazovykh skhovyshch danykh na osnovi dvokhfaznoho alhorytmu Visnyk NTUU «KPI». Informatyka, upravlinnia ta obchysliuvalna tekhnika : zbirnyk naukovykh prats, 55, 125–134.
- Yatsyshyn, A. Yu. (2012). Proektuvannia hibrydnykh skhovyshch danykh z vrakhuvanniam strukturovanosti danykh.Upravlinnia rozvytkom skladnykh system, 9, 59–65.
- Azarov, M. Ya. (Ed.) (2011). Rol virtualnoho universytetu u zabezpechenni prozorosti biudzhetnoho protsesu v monohrafii Derzhavnyi biudzhet i biudzhetna stratehiia v umovakh ekonomichnykh reform: u 4 t. Vol. 2. DNNU «Akad. fin. upravlinnia», 878–902.
- Azarov, M. Ya. (2011). Sotsialna tekhnolohiia «Prozoryi biudzhet» yak innovatsiia v monohrafii Derzhavnyi biudzhet i biudzhetna stratehiia v umovakh ekonomichnykh reform: u 4 t. Vol. 4. DNNU «Akad. fin. upravlinnia»; 327–381.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2015 Андрій Юрійович Яцишин
This work is licensed under a Creative Commons Attribution 4.0 International License.
The consolidation and conditions for the transfer of copyright (identification of authorship) is carried out in the License Agreement. In particular, the authors reserve the right to the authorship of their manuscript and transfer the first publication of this work to the journal under the terms of the Creative Commons CC BY license. At the same time, they have the right to conclude on their own additional agreements concerning the non-exclusive distribution of the work in the form in which it was published by this journal, but provided that the link to the first publication of the article in this journal is preserved.
A license agreement is a document in which the author warrants that he/she owns all copyright for the work (manuscript, article, etc.).
The authors, signing the License Agreement with TECHNOLOGY CENTER PC, have all rights to the further use of their work, provided that they link to our edition in which the work was published.
According to the terms of the License Agreement, the Publisher TECHNOLOGY CENTER PC does not take away your copyrights and receives permission from the authors to use and dissemination of the publication through the world's scientific resources (own electronic resources, scientometric databases, repositories, libraries, etc.).
In the absence of a signed License Agreement or in the absence of this agreement of identifiers allowing to identify the identity of the author, the editors have no right to work with the manuscript.
It is important to remember that there is another type of agreement between authors and publishers – when copyright is transferred from the authors to the publisher. In this case, the authors lose ownership of their work and may not use it in any way.