Clustering based sentiment analysis on Twitter data for COVID-19 vaccines in India

https://doi.org/10.53730/ijhs.v6nS2.6126

Authors

  • Ponmani K Bharathiar University, Coimbatore, India
  • Thangaraj M Madurai Kamaraj University, Madurai, India

Keywords:

COVID-19 vaccines, improved random forest, machine learning, MEEM, sentiment analysis

Abstract

Coronavirus is a new and rapidly spreading viral disease. It is essential to have a vaccine in order to reduce the virus's impact. Vaccination-related sentiments can influence an individual's decision to accept the vaccines. Evaluating the sentiments is a time-consuming and challenging process. Sentiment analysis (SA) could have an impact on the vaccination initiatives as well as changes in people's opinions and behaviour around immunizations. Since social media is widely utilized to disseminate information, mining this data is a popular area of study these days. On Twitter, a wide range of opinions about the negative effects of licensed vaccines have been expressed over time. In this research, tweets are gathered, pre-processed to remove extraneous data, and then utilized for sentiments analysis utilizing the Lexicons-based technique and machine learning. After feature extraction, the clustering is performed using MEEM approach. This research proposed a Clustering Based Twitter sentiments analysis of COVID 19 (C-SAT COVID 19) vaccinations in India. An enhanced random forest classifier is implemented in this research to classify the sentiment scores provided by the sentiment analysis. A classification is performed based on the negative, neutral, and positive  sentiment analysis to examine people's emotions towards vaccinations accessible in India. 

Downloads

Download data is not yet available.

References

Ansari, M. T., & Khan, N. A. (2021). Worldwide COVID-19 Vaccines Sentiment Analysis Through Twitter Content. Electronic Journal of General Medicine, 18.

Augustyniak, Ł., Szymański, P., Kajdanowicz, T., & Tuligłowicz, W. (2015). Comprehensive study on lexicon-based ensemble classification sentiment analysis. Entropy, 18, 4.

Chaudhri, A. A., Saranya, S. S., & Dubey, S. (2021). Implementation paper on analyzing COVID-19 vaccines on twitter dataset using tweepy and text blob. Annals of the Romanian Society for Cell Biology, 8393–8396.

Fauzi, M. A. (2018). Random Forest Approach fo Sentiment Analysis in Indonesian. Indonesian Journal of Electrical Engineering and Computer Science, 12, 46–50.

Hananto, A. R., Rahayu, S. A., & Hariguna, T. (2022). COVID-19 Vaccination: A Retrospective Observation and Sentiment Analysis of the Twitter Social Media Platform in Indonesia. International Journal of Informatics and Information Systems, 5, 56–68.

Hasan, A., Moin, S., Karim, A., & Shamshirband, S. (2018). Machine learning-based sentiment analysis for twitter accounts. Mathematical and Computational Applications, 23, 11.

Jalil, Z., Abbasi, A., Javed, A. R., Khan, M. B., Hasanat, M. H., Malik, K. M., et al. (2021). COVID-19 Related Sentiment Analysis Using State-of-the-Art Machine Learning and Deep Learning Techniques. Frontiers in Public Health, 9.

Jianqiang, Z., Xiaolin, G., & Xuejun, Z. (2018). Deep convolution neural networks for twitter sentiment analysis. IEEE Access, 6, 23253–23260.

Kaur, H., Ahsaan, S. U., Alankar, B., & Chang, V. (2021). A proposed sentiment analysis deep learning algorithm for analyzing COVID-19 tweets. Information Systems Frontiers, 23, 1417–1429.

Kausar, M. A., Soosaimanickam, A., & Nasar, M. (2021). Public sentiment analysis on Twitter data during COVID-19 outbreak. Int. J. Adv. Comput. Sci. Appl, 12, 415–422.

Narasamma, V. L., & Sreedevi, M. (2021). Twitter based Data Analysis in Natural Language Processing using a Novel Catboost Recurrent Neural Framework. International Journal of Advanced Computer Science and Applications, 440–447.

Neogi, A. S., Garg, K. A., Mishra, R. K., & Dwivedi, Y. K. (2021). Sentiment analysis and classification of Indian farmers’ protest using twitter data. International Journal of Information Management Data Insights, 1, 100019.

Paliwal, S., Parveen, S., Afshar Alam, M., & Ahmed, J. (2022). Sentiment Analysis of COVID-19 Vaccine Rollout in India. In ICT Systems and Sustainability (pp. 21–33). Springer.

Ramamoorthy, T., Karmegam, D., & Mappillairaju, B. (2021). Use of social media data for disease based social network analysis and network modeling: A Systematic Review. Informatics for Health and Social Care, 46, 443–454.

Robnik-Šikonja, M. (2004). Improving random forests. European conference on machine learning, (pp. 359–370).

Scannell, D., Desens, L., Guadagno, M., Tra, Y., Acker, E., Sheridan, K., et al. (2021). COVID-19 vaccine discourse on Twitter: A content analysis of persuasion techniques, sentiment and mis/disinformation. Journal of health communication, 26, 443–459.

Shamrat, F. M., Chakraborty, S., Imran, M. M., Muna, J. N., Billah, M. M., Das, P., et al. (2021). Sentiment analysis on twitter tweets about COVID-19 vaccines using NLP and supervised KNN classification algorithm. Indones. J. Electr. Eng. Comput. Sci, 23.

Singh, G. (2021). Sentiment Analysis of Code-Mixed Social Media Text (Hinglish). arXiv preprint arXiv:2102.12149.

Villavicencio, C., Macrohon, J. J., Inbaraj, X. A., Jeng, J.-H., & Hsieh, J.-G. (2021). Twitter sentiment analysis towards covid-19 vaccines in the Philippines using naı̈ve bayes. Information, 12, 204.

Zhang, L., Fan, H., Peng, C., Rao, G., & Cong, Q. (2020). Sentiment analysis methods for hpv vaccines related tweets based on transfer learning. Healthcare, 8, p. 307.

Published

16-04-2022

How to Cite

Ponmani, K., & Thangaraj, M. (2022). Clustering based sentiment analysis on Twitter data for COVID-19 vaccines in India. International Journal of Health Sciences, 6(S2), 4732–4748. https://doi.org/10.53730/ijhs.v6nS2.6126

Issue

Section

Peer Review Articles