IMPLEMENTATION OF THE INTELLECTUAL SYSTEM OF SENTIMENT ANALYSIS AND CLUSTERIZATION OF PUBLICATIONS IN THE TWITTER SOCIAL NETWORK
DOI:
https://doi.org/10.30837/ITSSI.2023.23.025Keywords:
neural network; LSTM; sentiment analysis of publications; cluster analysis; social network TwitterAbstract
Thanks to the intensive development of social networks, the intensity of exchange of short electronic text messages is constantly increasing, the tone of which can serve as a sensitive indicator of public mood and important social phenomena, interesting for sociologists, politicians, economists, and specialists in other fields. In this regard, the task of automating the processing of such natural language messages is of significant scientific and practical interest. The object of this study is the sentiment of user publications in the Twitter social network. Due to the great popularity of the social network itself and the large number of user messages, which are short in nature, it is possible to conveniently determine the mood of user posts and combine them into clusters according to the given parameters of the intelligent system. The subject of the study is methods and algorithms for analysing the sentiment of large arrays of messages containing the necessary keywords and relating to a certain specific topic, determining the factors and distributions of the sentiment of messages based on the input array of system data, dividing messages into main groups and providing estimates within certain defined limits in to each group, division into clusters according to the obtained search point and display of the obtained results in the desired format. The purpose of the work is to implement an intelligent system of sentiment analysis and clustering of publications based on a recurrent neural network of long short-term memory (LSTM) and the k-means clustering algorithm. The following main tasks are solved in the work: 1. To analyse the most used and newest algorithms, methods, approaches and means of implementing tasks of sentiment analysis and clustering of publications in social networks. 2. To develop a conceptual structure of an intellectual system of sentiment analysis and clustering of publications. 3. To form functional tasks for the key modules of the created intelligent system of sentiment analysis and clustering of publications in the Twitter social network. 4. Implement an intelligent system of sentiment analysis and clustering of publications based on a recurrent neural network and the k-means clustering algorithm and conduct experimental verification. Among the methods used for this purpose are the recurrent neural network of long short-term memory; k-means clustering algorithm. The following results were obtained: the general structure of the intellectual system of sentiment analysis and clustering of publications was analyzed, designed and built. The main task of creating the system, first of all, was to improve the recurrent neural network of long-short-term memory, which, thanks to the improved algorithm, significantly facilitates text processing by natural language processors according to text data of a certain size. Also, a special clustering algorithm, namely k-means, was used in parallel, thanks to which it was possible to change the general approach to clustering and the creation of final clusters, in accordance with the obtained results of the work of the recurrent neural network. Conclusions: As a result of applying a combination of LSTM neural network and k-means clustering algorithm, it was possible to speed up the process of sentiment analysis and clustering of posts in the Twitter social network by 10...15% compared to similar convolutional neural networks and hierarchical clustering.
References
Список літератури
Almahmood R. J. K., Tekerek A. Issues and Solutions in Deep Learning-Enabled Recommendation Systems within the E-Commerce Field. Applied Sciences. 2022. № 12 (21). Р. 256–264. DOI: https://doi.org/10.3390/app122111256
Xie W., Damiano L., Jong C.-H. Emotional appeals and social support in organizational YouTube videos during COVID-19. Telematics and Informatics reports. 2022. № 8 (1). Р. 100–128.
Abbas A. F., Jusoh A., Mas’od A., Alsharif A. H., Ali J. Bibliometrix analysis of information sharing in social media. Cogent Business & Management. 2022. № 9 (1). Р. 521–543. DOI: https://doi.org/10.1080/23311975.2021.2016556
Villegas-Ch. W., Erazo D. M., Ortiz-Garces I., Gaibor-Naranjo W., Palacios-Pacheco X. Artificial Intelligence Model for the Identification of the Personality of Twitter Users through the Analysis of Their Behavior in the Social Network. Electronics. 2022. № 11 (22). Р. 381–399. DOI: https://doi.org/10.3390/electronics11223811
Malkawi R., Daradkeh M., El-Hassan A., Petrov P. A Semantic Similarity-Based Identification Method for Implicit Citation Functions and Sentiments Information. Information. 2022. № 13 (11). Р. 546–561. DOI: https://doi.org/10.3390/info13110546
Yuan Y., You T., Xu T., Yu X. Customer-Oriented Strategic Planning for Hotel Competitiveness Improvement Based on Online Reviews. Sustainability. 2022. № 14 (22). Р. 152–199.
Yin J. Y. B., Saad N. H. M., Yaacob Z. Exploring Sentiment Analysis on E-Commerce Business: Lazada and Shopee. Tem journal. 2022. № 11 (4). Р. 1508–1519. DOI: https://doi.org/10.18421/TEM114-11
Hinduja S., Afrin M., Mistry S., Krishna A. Machine learning-based proactive social-sensor service for mental health monitoring using twitter data. International journal of Information Management Data insights. 2022. № 2 (2). Р. 103–124.
Bhadamkar A., Bhattacharya S. Tesla Inc. Stock Prediction Using Sentiment Analysis. Australasian Accounting, Business and Finance journal. 2022. № 16 (5). Р. 52–66. DOI: https://doi.org/10.14453/aabfj.v16i5.05
Alhakiem H. R., Setiawan E. B. Aspect-Bas1ed Sentiment Analysis on Twitter Using Logistic Regression with FastText Feature Expansion. Jurnal resti (Rekayasa sistem dan Teknologi Informasi). 2022. № 6 (5). Р. 840–846. DOI: https://doi.org/10.29207/resti.v6i5.4429
Pawełoszek I. Towards a Smart City—The Study of Car-Sharing Services in Poland. Energies. 2022. № 15 (22). Р 845–859. DOI: https://doi.org/10.3390/en15228459
Huang X., Gong P., Wang S., White M., Zhang B. Machine Learning Modeling of Vitality Characteristics in Historical Preservation Zones with Multi-Source Data. Buildings. 2022. № 12 (11). Р. 1978–1989. DOI: https://doi.org/10.3390/buildings12111978
Li C., Renda M., Yusuf F., Geller J., Chun S. A. Public Health Policy Monitoring through Public Perceptions: A Case of COVID-19 Tweet Analysis. Information. 2022. № 13 (11). Р. 443–457. DOI: https://doi.org/10.3390/info13110543
Vysotska V. Information Technology for Internet Resources Promotion in Search Systems Based on Content Analysis of Web-Page Keywords. Radio Electronics, Computer Science, Control. 2021. № 3. Р. 133–151.
Corti L., Zanetti M., Tricella G., Bonati M. Social media analysis of Twitter tweets related to ASD in 2019–2020, with particular attention to COVID-19: topic modelling and sentiment analysis. Journal of Big Data. 2022. № 9 (1). Р. 1–17. DOI: https://doi.org/10.1186/s40537-022-00666-4
Lampropoulos G., Keramopoulos E. Virtual Reality in Education: A Comparative Social Media Data and Sentiment Analysis Study. International journal of recent contributions from Engineering, Science & IT. 2007. № 10 (3). Р. 221–235. DOI: https://doi.org/10.3991/ijes.v10i03.34057
Liu H. Online review analysis on various networks’ consumer feedback using deep learning. IET networks. 2022. № 11 (6). Р. 234–244. DOI: https://doi.org/10.1049/ntw2.12045
Wang Y., Chen Z., Fu C. Synergy Masks of Domain Attribute Model DaBERT: Emotional Tracking on Time-Varying Virtual Space Communication. Sensors. 2022. № 22 (21). Р. 450–471. DOI: https://doi.org/10.3390/s22218450
References
Almahmood R. J. K., Tekerek A. Issues and Solutions in Deep Learning-Enabled Recommendation Systems within the E-Commerce Field. Applied Sciences. 2022. № 12 (21). Р. 256–264. DOI: https://doi.org/10.3390/app122111256
Xie W., Damiano L., Jong C.-H. Emotional appeals and social support in organizational YouTube videos during COVID-19. Telematics and Informatics reports. 2022. № 8 (1). Р. 100–128.
Abbas A. F., Jusoh A., Mas’od A., Alsharif A. H., Ali J. Bibliometrix analysis of information sharing in social media. Cogent Business & Management. 2022. № 9 (1). Р. 521–543. DOI: https://doi.org/10.1080/23311975.2021.2016556
Villegas-Ch. W., Erazo D. M., Ortiz-Garces I., Gaibor-Naranjo W., Palacios-Pacheco X. Artificial Intelligence Model for the Identification of the Personality of Twitter Users through the Analysis of Their Behavior in the Social Network. Electronics. 2022. № 11 (22). Р. 381–399. DOI: https://doi.org/10.3390/electronics11223811
Malkawi R., Daradkeh M., El-Hassan A., Petrov P. A Semantic Similarity-Based Identification Method for Implicit Citation Functions and Sentiments Information. Information. 2022. № 13 (11). Р. 546–561. DOI: https://doi.org/10.3390/info13110546
Yuan Y., You T., Xu T., Yu X. Customer-Oriented Strategic Planning for Hotel Competitiveness Improvement Based on Online Reviews. Sustainability. 2022. № 14 (22). Р. 152–199.
Yin J. Y. B., Saad N. H. M., Yaacob Z. Exploring Sentiment Analysis on E-Commerce Business: Lazada and Shopee. Tem journal. 2022. № 11 (4). Р. 1508–1519. DOI: https://doi.org/10.18421/TEM114-11
Hinduja S., Afrin M., Mistry S., Krishna A. Machine learning-based proactive social-sensor service for mental health monitoring using twitter data. International journal of Information Management Data insights. 2022. № 2 (2). Р. 103–124.
Bhadamkar A., Bhattacharya S. Tesla Inc. Stock Prediction Using Sentiment Analysis. Australasian Accounting, Business and Finance journal. 2022. № 16 (5). Р. 52–66. DOI: https://doi.org/10.14453/aabfj.v16i5.05
Alhakiem H. R., Setiawan E. B. Aspect-Bas1ed Sentiment Analysis on Twitter Using Logistic Regression with FastText Feature Expansion. Jurnal resti (Rekayasa sistem dan Teknologi Informasi). 2022. № 6 (5). Р. 840–846. DOI: https://doi.org/10.29207/resti.v6i5.4429
Pawełoszek I. Towards a Smart City—The Study of Car-Sharing Services in Poland. Energies. 2022. № 15 (22). Р 845–859. DOI: https://doi.org/10.3390/en15228459
Huang X., Gong P., Wang S., White M., Zhang B. Machine Learning Modeling of Vitality Characteristics in Historical Preservation Zones with Multi-Source Data. Buildings. 2022. № 12 (11). Р. 1978–1989. DOI: https://doi.org/10.3390/buildings12111978
Li C., Renda M., Yusuf F., Geller J., Chun S. A. Public Health Policy Monitoring through Public Perceptions: A Case of COVID-19 Tweet Analysis. Information. 2022. № 13 (11). Р. 443–457. DOI: https://doi.org/10.3390/info13110543
Vysotska V. Information Technology for Internet Resources Promotion in Search Systems Based on Content Analysis of Web-Page Keywords. Radio Electronics, Computer Science, Control. 2021. № 3. Р. 133–151.
Corti L., Zanetti M., Tricella G., Bonati M. Social media analysis of Twitter tweets related to ASD in 2019–2020, with particular attention to COVID-19: topic modelling and sentiment analysis. Journal of Big Data. 2022. № 9 (1). Р. 1–17. DOI: https://doi.org/10.1186/s40537-022-00666-4
Lampropoulos G., Keramopoulos E. Virtual Reality in Education: A Comparative Social Media Data and Sentiment Analysis Study. International journal of recent contributions from Engineering, Science & IT. 2007. № 10 (3). Р. 221–235. DOI: https://doi.org/10.3991/ijes.v10i03.34057
Liu H. Online review analysis on various networks’ consumer feedback using deep learning. IET networks. 2022. № 11 (6). Р. 234–244. DOI: https://doi.org/10.1049/ntw2.12045
Wang Y., Chen Z., Fu C. Synergy Masks of Domain Attribute Model DaBERT: Emotional Tracking on Time-Varying Virtual Space Communication. Sensors. 2022. № 22 (21). Р. 450–471. DOI: https://doi.org/10.3390/s22218450
Downloads
Published
How to Cite
Issue
Section
License
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Our journal abides by the Creative Commons copyright rights and permissions for open access journals.
Authors who publish with this journal agree to the following terms:
Authors hold the copyright without restrictions and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License (CC BY-NC-SA 4.0) that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
Authors are able to enter into separate, additional contractual arrangements for the non-commercial and non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
Authors are permitted and encouraged to post their published work online (e.g., in institutional repositories or on their website) as it can lead to productive exchanges, as well as earlier and greater citation of published work.