Generalized feature transformative dependency magnitude decision tree classification for COVID-19 medical data analysis

Authors

  • A. Mary Theresa Assistant Professor, Department of Information Technology, Nirmala College for Women, Coimbatore - 641 018. Tamil Nadu, India
  • V. Saravanan Professor & HEAD, Department of Information Technology, Hindusthan College of Arts and Science, Coimbatore - 641 028. Tamil Nadu, India

Keywords:

big data analytics, COVID-19 prediction, hosmer–lemeshow, log transform-based feature, engineering model, hellinger generalized, multidimensional scaling, feature map, rand dependency magnitude, iterative dichotomiser-3, decision tree classification model

Abstract

Data mining is a method for identifying attractive patterns in an understandable format from big data. Big data mining is an integration of structured and unstructured data that are mined for extracting valuable information and used for predictive analytics. Big data analytics are used in many applications, especially in the healthcare sector.  With a variety of data analytics devices and procedures, the healthcare domain uses big data to update health prevention and management. Big data analytics for healthcare create it feasible to invent smarter evaluations of patient heath conditions. One of the most current and relevant big data analytics in healthcare is the global COVID-19 prediction. Many works have been introduced for COVID-19 prediction with big data analytics.   But, the precision results and time consumption was not focused on by existing methods. Therefore, a novel Log Transformative Generalized feature Mapping based Rand Dependency Magnitude Iterative Dichotomiser-3 Decision Tree Classification (LTGFM-RDMDDTC) is introduced for COVID-19 prediction with higher accuracy, precision, and lesser time consumption. In medical data analytics, the performance of the model is said to be improved by including three major processes namely feature engineering, feature selection, and classification. 

Downloads

Download data is not yet available.

References

Abdu Gumaei, Walaa N. Ismail, Md. Rafiul Hassan, Mohammad Mehedi Hassan, Ebtsam Mohamed, Abdullah Alelaiwi, Giancarlo Fortino, “A Decision-Level Fusion Method for COVID-19 Patient Health Prediction”, Big Data Research, Elsevier, Volume 27, 2022, Pages 1-11. https://doi.org/10.1016/j.bdr.2021.100287

Aditya Gupta, Vibha Jain & Amritpal Singh, “Stacking Ensemble-Based Intelligent Machine Learning Model for Predicting Post-COVID-19 Complications”, New Generation Computing, 2021, Pages 1-21. https://doi.org/10.1007/s00354-021-00144-0

Ahmed Hamed, Ahmed Sobhy, Hamed Nassar, “Accurate Classification of COVID-19 Based on Incomplete Heterogeneous Data using a KNN Variant Algorithm”, Arabian Journal for Science and Engineering, Springer, Volume 46, 2021, Pages 8261–8272. https://doi.org/10.1007/s13369-020-05212-z

Ameer Sardar Kwekha-Rashid, Heamn N. Abduljabbar & Bilal Alhayani, “Coronavirus disease (COVID-19) cases analysis using machine-learning applications”, Applied Nanoscience, 2021, Pages 1-13. https://doi.org/10.1007/s13204-021-01868-7

Guohui Li, Kang Chen, Hong Yang, “A new hybrid prediction model of cumulative COVID-19 confirmed data”, Process Safety and Environmental Protection, Elsevier, Volume 157, January 2022, Pages1-19.https://doi.org/10.1016/j.psep.2021.10.047

Hua Ye, Peiliang Wu, Tianru Zhu, Zhongxiang Xiao, Xie Zhang, Long Zheng, Rongwei Zheng, Yangjie Sun, Weilong Zhou, Qinlei Fu, Xinxin Ye, Ali Chen, Shuang Zheng, Ali Asghar Heidari Mingjing Wang, Jiandong Zhu, Huiling Chen, and Jifa L, “Diagnosing Coronavirus Disease 2019 (COVID-19): Efficient Harris Hawks-Inspired Fuzzy K-Nearest Neighbor Prediction Methods”, IEEE Access, Volume 9, 2021, Pages 17787 – 17802. DOI: 10.1109/ACCESS.2021.3052835

Informatics in Medicine Unlocked, Elsevier, Volume 31,2022,Pages1-7. https://doi.org/10.1016/j.imu.2022.100990

Jialu Song, Hujin Xie, Bingbing Gao, Yongmin Zhong, Chengfan Gu, Kup-Sze Choi, “Maximum likelihood-based extended Kalman filter for COVID-19 prediction”, Chaos, Solitons and Fractals, Elsevier, Volume 146, 2021, Pages 1-9. https://doi.org/10.1016/j.chaos.2021.110922

Kim Tien Ly, “A COVID-19 forecasting system using adaptive neuro-fuzzy inference”, Finance Research Letters, Elsevier, Volume 41, 2021, Pages 1-10. https://doi.org/10.1016/j.frl.2020.101844

Luis Fernando Castillo Ossa, Pablo Chamoso, Jeferson Arango-López, Francisco Pinto-Santos,Gustavo Adolfo Isaza, Cristina Santa-Cruz-González, Alejandro Ceballos-Marquez, Guillermo Hernández and Juan M. Corchado, “A Hybrid Model for COVID-19 Monitoring and Prediction”, Electronics, Volume 10, Issue 7, 2021, Pages 1-13. https://doi.org/10.3390/electronics10070799

Makram Soui, Nesrine Mansouri, Raed Alhamad, Marouane Kessentini, Khaled Ghedira, “NSGA-II as feature selection technique and AdaBoost classifier for COVID-19 prediction using patient’s symptoms”, Nonlinear Dynamics, Springer, Volume 106, 2021, Pages 1453–1475 . https://doi.org/10.1007/s11071-021-06504-1

Md. Abdul Awal, Mehedi Masud, Md. Shahadat Hossain, Abdullah Al-Mamun Bulbul, S. M. Hasan Mahmud, Anupam Kumar Bairagi, “A Novel Bayesian Optimization-Based Machine Learning Framework for COVID-19 Detection From Inpatient Facility Data”, IEEE Access, Jan 2021, Pages 10263 – 10281. DOI: 10.1109/ACCESS.2021.3050852

N. Deepa a,⇑, J. Sathya Priya b, Devi T, “Towards applying internet of things and machine learning for the risk prediction of COVID-19 in pandemic situation using Naive Bayes classifier for improving accuracy”, Materials Today: Proceedings, Elsevier, 2022, Pages 1-15. https://doi.org/10.1016/j.matpr.2022.03.345

Olusola O. Abayomi-Alli, Robertas Damaševiˇcius, Rytis Maskeliunas and Sanjay Misra, “An Ensemble Learning Model for COVID-19 Detection from Blood Test Samples”, Sensors, Volume 22, Issue 6, 2022, Pages 1-18. https://doi.org/10.3390/s22062224

Prabh Deep Singh, Rajbir Kaur, Kiran Deep Singh, Gaurav Dhiman, “A Novel Ensemble-based Classifier for Detecting the COVID-19 Disease for Infected Patients”, Information Systems Frontiers, Springer, Volume 23, 2021, Pages 1385-1401. https://doi.org/10.1007/s10796-021-10132-w

Pradeep Kumar Roy and Abhinav Kumar, “Early prediction of COVID-19 using ensemble of transfer learning” Computers and Electrical Engineering, Elsevier, Volume 101, 2022, Pages 1-15. https://doi.org/10.1016/j.compeleceng.2022.108018

Saratu Yusuf Ilu, Prasad Rajesh, Hassan Mohammed, “Prediction of COVID-19 using long short-term memory by integrating principal component analysis and clustering techniques”,

Shivam Bhardwaj, Sunil Sharma, Rashmi Bhardwaj, “Machine Learned Hybrid Gaussian Analysis of COVID-19 Pandemic in India”, Results in Physics, Elsevier, Volume 30, November 2021, Pages1-22.https://doi.org/10.1016/j.rinp.2021.104630

Shwet Ketu and Pramod Kumar Mishra, “Enhanced Gaussian process regression-based forecasting model for COVID-19 outbreak and significance of IoT for its detection”, Applied Intelligence, Springer, Volume 51, 2021, Pages 1492–1512. https://doi.org/10.1007/s10489-020-01889-9

Vasyl Martsenyuk, Marcin Bernas, Aleksandra Klos-Witkowska, “Two-Strain COVID-19 Model Using Delayed Dynamic System and Big Data”, IEEE Access, Volume 9, 2021, Pages 113866 – 113878. DOI: 10.1109/ACCESS.2021.3104519

Vishan Kumar Gupta, Avdhesh Gupta, Dinesh Kumar, Anjali Sardana, “Prediction of COVID-19 confirmed, death, and cured cases in India using random forest model”, Big Data Mining and Analytics , Volume 4, Issue 2, 2021, Pages 116 – 123. DOI: 10.26599/BDMA.2020.9020016

Published

19-10-2022

How to Cite

Theresa, A. M., & Saravanan, V. (2022). Generalized feature transformative dependency magnitude decision tree classification for COVID-19 medical data analysis. International Journal of Health Sciences, 6(S10), 429–445. Retrieved from https://sciencescholar.us/journal/index.php/ijhs/article/view/13493

Issue

Section

Peer Review Articles