Theme extraction based on NLP

https://doi.org/10.53730/ijhs.v6nS5.9654

Authors

  • Pranali Rahul Dandekar Assistant Professor, Shri Ramdeobaba College of Engineering and Management, Nagpur
  • Yoginee Surendra Pethe Assistant Professor, Shri Ramdeobaba College of Engineering and Management, Nagpur

Abstract

Theme Extraction is a method of identifying, evaluating, and understanding human perception about a product in form of key features or themes that we extract dynamically from a given set of reviews within a data set. These themes are then categories to form an opinion about a given feature inside of the product through which we can analyze the advantages as well as the short comings of a given product or organization. These key features are then displayed on to the user for them to make a wise decision based on their likes and dislikes which they can compare with the user base that have already formed a review. All of this happens seamlessly with the help of Natural language Techniques that enables us to dynamically extract features or themes and generalizes an opinion score alongside it to represent thousands of reviews in a small concise manner. To make this happen, we consider seven different steps: (i)Text Pre-Processing, (ii) Removal of Stop-Words, (iii) Vader Sentiment Analysis, (iv) Feature Extraction using HAC, (v) Classification of Key Features using MOS, (vi) Testing the Accuracy of the Score and (vii) Creation of Word-Cloud using Features.

Downloads

Download data is not yet available.

References

D. Winder, The importance of social mobility, Gemalto Review Magazine (2010).

V. Hatzivassiloglou, K.R. McKeown, Predicting the semantic orientation of adjectives, in: Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics, ACL’98, 1997, pp. 174–181.

P.D. Turney, M.L. Littman, Measuring praise and criticism: Inference of semantic orientation from association, ACM Trans. Inf. Syst. 21 (2003) 315–346.

H. Kanayama, T. Nasukawa, Fully automatic lexicon expansion for domain-oriented sentiment analysis, in: Proceedings of the 2006 Conference onEmpirical Methods in Natural Language Processing, EMNLP’06, 2006, pp. 355–363.

Rahma, D. Y., & Atmaja, M. H. S. (2022). Peritoneal carcinomatosis and mimicking on CT scan findings. International Journal of Health & Medical Sciences, 5(1), 154-162. https://doi.org/10.21744/ijhms.v5n1.1864

B. Pang, L. Lee, Opinion mining and sentiment analysis, Found. Trends Inf. Retr. 2 (2008) 1–135.

Rinartha, K., & Suryasa, W. (2017). Comparative study for better result on query suggestion of article searching with MySQL pattern matching and Jaccard similarity. In 2017 5th International Conference on Cyber and IT Service Management (CITSM) (pp. 1-4). IEEE.

A. Esuli, F. Sebastiani, Determining the semantic orientation of terms through gloss classification, in: Proceedings of the 14th ACM International Conference on Information and Knowledge Management, CIKM’05, 2005, pp. 617–624.

S.-M. Kim, E. Hovy, Determining the sentiment of opinions, in: Proceedings of the 20th International Conference on Computational Linguistics, COLING’04, 2004.

A. Meena, T.V. Prabhakar, Sentence level sentiment analysis in the presence of conjuncts using linguistic analysis, in: Proceedings of the 29th European Conference on IR Research, ECIR’07, 2007.

Published

24-06-2022

How to Cite

Dandekar, P. R., & Pethe, Y. S. (2022). Theme extraction based on NLP. International Journal of Health Sciences, 6(S5), 4885–4894. https://doi.org/10.53730/ijhs.v6nS5.9654

Issue

Section

Peer Review Articles