IMPLEMENTATION OF THE INTELLECTUAL SYSTEM OF SENTIMENT ANALYSIS AND CLUSTERIZATION OF PUBLICATIONS IN THE TWITTER SOCIAL NETWORK

Authors

DOI:

https://doi.org/10.30837/ITSSI.2023.23.025

Keywords:

neural network; LSTM; sentiment analysis of publications; cluster analysis; social network Twitter

Abstract

Thanks to the intensive development of social networks, the intensity of exchange of short electronic text messages is constantly increasing, the tone of which can serve as a sensitive indicator of public mood and important social phenomena, interesting for sociologists, politicians, economists, and specialists in other fields. In this regard, the task of automating the processing of such natural language messages is of significant scientific and practical interest. The object of this study is the sentiment of user publications in the Twitter social network. Due to the great popularity of the social network itself and the large number of user messages, which are short in nature, it is possible to conveniently determine the mood of user posts and combine them into clusters according to the given parameters of the intelligent system. The subject of the study is methods and algorithms for analysing the sentiment of large arrays of messages containing the necessary keywords and relating to a certain specific topic, determining the factors and distributions of the sentiment of messages based on the input array of system data, dividing messages into main groups and providing estimates within certain defined limits in to each group, division into clusters according to the obtained search point and display of the obtained results in the desired format. The purpose of the work is to implement an intelligent system of sentiment analysis and clustering of publications based on a recurrent neural network of long short-term memory (LSTM) and the k-means clustering algorithm. The following main tasks are solved in the work: 1. To analyse the most used and newest algorithms, methods, approaches and means of implementing tasks of sentiment analysis and clustering of publications in social networks. 2. To develop a conceptual structure of an intellectual system of sentiment analysis and clustering of publications. 3. To form functional tasks for the key modules of the created intelligent system of sentiment analysis and clustering of publications in the Twitter social network. 4. Implement an intelligent system of sentiment analysis and clustering of publications based on a recurrent neural network and the k-means clustering algorithm and conduct experimental verification. Among the methods used for this purpose are the recurrent neural network of long short-term memory; k-means clustering algorithm. The following results were obtained: the general structure of the intellectual system of sentiment analysis and clustering of publications was analyzed, designed and built. The main task of creating the system, first of all, was to improve the recurrent neural network of long-short-term memory, which, thanks to the improved algorithm, significantly facilitates text processing by natural language processors according to text data of a certain size. Also, a special clustering algorithm, namely k-means, was used in parallel, thanks to which it was possible to change the general approach to clustering and the creation of final clusters, in accordance with the obtained results of the work of the recurrent neural network. Conclusions: As a result of applying a combination of LSTM neural network and k-means clustering algorithm, it was possible to speed up the process of sentiment analysis and clustering of posts in the Twitter social network by 10...15% compared to similar convolutional neural networks and hierarchical clustering.

Author Biographies

Taras Batiuk, Lviv Polytechnic National University

Postgraduate Student of the Information Systems and Networks Department

Dmytro Dosyn, Lviv Polytechnic National University

Doctor of Sciences (Engineering)

References

Список літератури

Almahmood R. J. K., Tekerek A. Issues and Solutions in Deep Learning-Enabled Recommendation Systems within the E-Commerce Field. Applied Sciences. 2022. № 12 (21). Р. 256–264. DOI: https://doi.org/10.3390/app122111256

Xie W., Damiano L., Jong C.-H. Emotional appeals and social support in organizational YouTube videos during COVID-19. Telematics and Informatics reports. 2022. № 8 (1). Р. 100–128.

Abbas A. F., Jusoh A., Mas’od A., Alsharif A. H., Ali J. Bibliometrix analysis of information sharing in social media. Cogent Business & Management. 2022. № 9 (1). Р. 521–543. DOI: https://doi.org/10.1080/23311975.2021.2016556

Villegas-Ch. W., Erazo D. M., Ortiz-Garces I., Gaibor-Naranjo W., Palacios-Pacheco X. Artificial Intelligence Model for the Identification of the Personality of Twitter Users through the Analysis of Their Behavior in the Social Network. Electronics. 2022. № 11 (22). Р. 381–399. DOI: https://doi.org/10.3390/electronics11223811

Malkawi R., Daradkeh M., El-Hassan A., Petrov P. A Semantic Similarity-Based Identification Method for Implicit Citation Functions and Sentiments Information. Information. 2022. № 13 (11). Р. 546–561. DOI: https://doi.org/10.3390/info13110546

Yuan Y., You T., Xu T., Yu X. Customer-Oriented Strategic Planning for Hotel Competitiveness Improvement Based on Online Reviews. Sustainability. 2022. № 14 (22). Р. 152–199.

Yin J. Y. B., Saad N. H. M., Yaacob Z. Exploring Sentiment Analysis on E-Commerce Business: Lazada and Shopee. Tem journal. 2022. № 11 (4). Р. 1508–1519. DOI: https://doi.org/10.18421/TEM114-11

Hinduja S., Afrin M., Mistry S., Krishna A. Machine learning-based proactive social-sensor service for mental health monitoring using twitter data. International journal of Information Management Data insights. 2022. № 2 (2). Р. 103–124.

Bhadamkar A., Bhattacharya S. Tesla Inc. Stock Prediction Using Sentiment Analysis. Australasian Accounting, Business and Finance journal. 2022. № 16 (5). Р. 52–66. DOI: https://doi.org/10.14453/aabfj.v16i5.05

Alhakiem H. R., Setiawan E. B. Aspect-Bas1ed Sentiment Analysis on Twitter Using Logistic Regression with FastText Feature Expansion. Jurnal resti (Rekayasa sistem dan Teknologi Informasi). 2022. № 6 (5). Р. 840–846. DOI: https://doi.org/10.29207/resti.v6i5.4429

Pawełoszek I. Towards a Smart City—The Study of Car-Sharing Services in Poland. Energies. 2022. № 15 (22). Р 845–859. DOI: https://doi.org/10.3390/en15228459

Huang X., Gong P., Wang S., White M., Zhang B. Machine Learning Modeling of Vitality Characteristics in Historical Preservation Zones with Multi-Source Data. Buildings. 2022. № 12 (11). Р. 1978–1989. DOI: https://doi.org/10.3390/buildings12111978

Li C., Renda M., Yusuf F., Geller J., Chun S. A. Public Health Policy Monitoring through Public Perceptions: A Case of COVID-19 Tweet Analysis. Information. 2022. № 13 (11). Р. 443–457. DOI: https://doi.org/10.3390/info13110543

Vysotska V. Information Technology for Internet Resources Promotion in Search Systems Based on Content Analysis of Web-Page Keywords. Radio Electronics, Computer Science, Control. 2021. № 3. Р. 133–151.

Corti L., Zanetti M., Tricella G., Bonati M. Social media analysis of Twitter tweets related to ASD in 2019–2020, with particular attention to COVID-19: topic modelling and sentiment analysis. Journal of Big Data. 2022. № 9 (1). Р. 1–17. DOI: https://doi.org/10.1186/s40537-022-00666-4

Lampropoulos G., Keramopoulos E. Virtual Reality in Education: A Comparative Social Media Data and Sentiment Analysis Study. International journal of recent contributions from Engineering, Science & IT. 2007. № 10 (3). Р. 221–235. DOI: https://doi.org/10.3991/ijes.v10i03.34057

Liu H. Online review analysis on various networks’ consumer feedback using deep learning. IET networks. 2022. № 11 (6). Р. 234–244. DOI: https://doi.org/10.1049/ntw2.12045

Wang Y., Chen Z., Fu C. Synergy Masks of Domain Attribute Model DaBERT: Emotional Tracking on Time-Varying Virtual Space Communication. Sensors. 2022. № 22 (21). Р. 450–471. DOI: https://doi.org/10.3390/s22218450

References

Almahmood R. J. K., Tekerek A. Issues and Solutions in Deep Learning-Enabled Recommendation Systems within the E-Commerce Field. Applied Sciences. 2022. № 12 (21). Р. 256–264. DOI: https://doi.org/10.3390/app122111256

Xie W., Damiano L., Jong C.-H. Emotional appeals and social support in organizational YouTube videos during COVID-19. Telematics and Informatics reports. 2022. № 8 (1). Р. 100–128.

Abbas A. F., Jusoh A., Mas’od A., Alsharif A. H., Ali J. Bibliometrix analysis of information sharing in social media. Cogent Business & Management. 2022. № 9 (1). Р. 521–543. DOI: https://doi.org/10.1080/23311975.2021.2016556

Villegas-Ch. W., Erazo D. M., Ortiz-Garces I., Gaibor-Naranjo W., Palacios-Pacheco X. Artificial Intelligence Model for the Identification of the Personality of Twitter Users through the Analysis of Their Behavior in the Social Network. Electronics. 2022. № 11 (22). Р. 381–399. DOI: https://doi.org/10.3390/electronics11223811

Malkawi R., Daradkeh M., El-Hassan A., Petrov P. A Semantic Similarity-Based Identification Method for Implicit Citation Functions and Sentiments Information. Information. 2022. № 13 (11). Р. 546–561. DOI: https://doi.org/10.3390/info13110546

Yuan Y., You T., Xu T., Yu X. Customer-Oriented Strategic Planning for Hotel Competitiveness Improvement Based on Online Reviews. Sustainability. 2022. № 14 (22). Р. 152–199.

Yin J. Y. B., Saad N. H. M., Yaacob Z. Exploring Sentiment Analysis on E-Commerce Business: Lazada and Shopee. Tem journal. 2022. № 11 (4). Р. 1508–1519. DOI: https://doi.org/10.18421/TEM114-11

Hinduja S., Afrin M., Mistry S., Krishna A. Machine learning-based proactive social-sensor service for mental health monitoring using twitter data. International journal of Information Management Data insights. 2022. № 2 (2). Р. 103–124.

Bhadamkar A., Bhattacharya S. Tesla Inc. Stock Prediction Using Sentiment Analysis. Australasian Accounting, Business and Finance journal. 2022. № 16 (5). Р. 52–66. DOI: https://doi.org/10.14453/aabfj.v16i5.05

Alhakiem H. R., Setiawan E. B. Aspect-Bas1ed Sentiment Analysis on Twitter Using Logistic Regression with FastText Feature Expansion. Jurnal resti (Rekayasa sistem dan Teknologi Informasi). 2022. № 6 (5). Р. 840–846. DOI: https://doi.org/10.29207/resti.v6i5.4429

Pawełoszek I. Towards a Smart City—The Study of Car-Sharing Services in Poland. Energies. 2022. № 15 (22). Р 845–859. DOI: https://doi.org/10.3390/en15228459

Huang X., Gong P., Wang S., White M., Zhang B. Machine Learning Modeling of Vitality Characteristics in Historical Preservation Zones with Multi-Source Data. Buildings. 2022. № 12 (11). Р. 1978–1989. DOI: https://doi.org/10.3390/buildings12111978

Li C., Renda M., Yusuf F., Geller J., Chun S. A. Public Health Policy Monitoring through Public Perceptions: A Case of COVID-19 Tweet Analysis. Information. 2022. № 13 (11). Р. 443–457. DOI: https://doi.org/10.3390/info13110543

Vysotska V. Information Technology for Internet Resources Promotion in Search Systems Based on Content Analysis of Web-Page Keywords. Radio Electronics, Computer Science, Control. 2021. № 3. Р. 133–151.

Corti L., Zanetti M., Tricella G., Bonati M. Social media analysis of Twitter tweets related to ASD in 2019–2020, with particular attention to COVID-19: topic modelling and sentiment analysis. Journal of Big Data. 2022. № 9 (1). Р. 1–17. DOI: https://doi.org/10.1186/s40537-022-00666-4

Lampropoulos G., Keramopoulos E. Virtual Reality in Education: A Comparative Social Media Data and Sentiment Analysis Study. International journal of recent contributions from Engineering, Science & IT. 2007. № 10 (3). Р. 221–235. DOI: https://doi.org/10.3991/ijes.v10i03.34057

Liu H. Online review analysis on various networks’ consumer feedback using deep learning. IET networks. 2022. № 11 (6). Р. 234–244. DOI: https://doi.org/10.1049/ntw2.12045

Wang Y., Chen Z., Fu C. Synergy Masks of Domain Attribute Model DaBERT: Emotional Tracking on Time-Varying Virtual Space Communication. Sensors. 2022. № 22 (21). Р. 450–471. DOI: https://doi.org/10.3390/s22218450

Published

2023-04-21

How to Cite

Batiuk, T., & Dosyn, D. (2023). IMPLEMENTATION OF THE INTELLECTUAL SYSTEM OF SENTIMENT ANALYSIS AND CLUSTERIZATION OF PUBLICATIONS IN THE TWITTER SOCIAL NETWORK. INNOVATIVE TECHNOLOGIES AND SCIENTIFIC SOLUTIONS FOR INDUSTRIES, (1 (23), 25–44. https://doi.org/10.30837/ITSSI.2023.23.025