An efficient filter based classification approach for microarray disease detection
Keywords:
Microarray data, feature selection, data transformation and classificationAbstract
As the size of the biomedical databases areincreasing day-by-day, finding anessential featureset for classification problem is complex due to large data size and sparsity problems. Microarray feature ranking and classification is one of the major challenges to scientific and medical researchers due to its high dimensional feature space and limited number of samples. Feature transformation, feature ranking and data classification are the essential components to improve the microarray cancer prediction on high dimensional datasets. In this work, a novel framework is designed and implemented to classify the high dimensional data with high true positive rate. In the proposed work, a hybrid feature transformation, hybrid feature selection and advance classification approach are implemented to improve the true positive rate and error rate of the disease prediction. A novel principal component ranking measure is integratedin order to find the subset of features for classification problem. Finally, a hybrid decision tree classifier is used to predict the classification accuracy on the selected features set. Experimental results proved that the present framework has better performance compared to the traditional models for variable microarray datasets.
Downloads
References
M. Ghosh S. Begum R. Sarkar D. Chakraborty U. Maulik "Recursive memetic algorithm for gene selection in microarray data" Expert Systems with Applications vol. 116 pp. 172-185 2019.
Z. Rustam I. Primasari D. Widya "Classification of cancer data based on support vectors machines with feature selection using genetic algorithm and laplacian score" AIP Conference Proceedings vol. 2023 no. 1 pp. 020234 2018.
V.B. Canedo N.S. Marono "A Review of Microarray Datasets and Applied Feature Selection Methods" Information Sciences pp. 111-135 2014.
Q. Su "A Cancer Gene Selection Algorithm Based on the K-S Test and CFS" Biomed Research International pp. 1-6 2017.
M. Morovvat A. Osareh "An Ensemble of Filters and Wrappers for Microarray Data Classification" Machine Learning and Applications: An International Journal (MLAIJ) vol. 3 no. 2 June 2016.
N Matamala MT Vargas R González-Cámpora R Miñambres et al. "Tumor microRNA expression profiling identifies circulating microRNAs for early breast cancer detection" Clin Chem vol. 61 no. 8 pp. 1098-106 Aug 2015.
K. Yan L. Ma Y. Dai W. Shen Z. Ji D. Xie "Cost-sensitive and sequential feature selection for chiller fault detection and diagnosis" International Journal of Refrigeration vol. 86 pp. 401-409 2018.
H. Lu J. Chen K. Yan Q. Jin Y. Xue Z. Gao "A hybrid feature selection algorithm for gene expression data classification" Neurocomputing vol. 256 pp. 56-62 2017.
K. Yan Z. Ji H. Lu J. Huang W. Shen Y. Xue "Fast and accurate classification of time series data using extended ELM: Application in fault diagnosis of air handling units" IEEE Transactions on Systems Man and Cybernetics: Systems 2017.
Y. Liu H. Lu K. Yan H. Xia C. An "Applying cost-sensitive extreme learning machine and dissimilarity integration to gene expression data classification" Computational intelligence and neuroscience 2016.
C. Braicu D. Gulei B. De Melo Maia I. Berindan-Neagoe G. A. Calin "Mirna expression assays" in Genomic Applications in Pathology Springer pp. 65-92 2019.
T. Setoyama H. Ling S. Natsugoe G. A. Calin "Non-coding rnas for medical practice in oncology" The Keio journal of medicine vol. 60 no. 4 pp. 106-113 2011.
M. Ghosh S. Begum R. Sarkar D. Chakraborty U. Maulik "Recursive memetic algorithm for gene selection in microarray data" Expert Systems with Applications vol. 116 pp. 172-185 2019.
J. Krawczuk T. Łukaszuk "The feature selection bias problem in relation to high-dimensional gene data" Artif. Intell. Med. vol. 66 pp. 63-71 2016.
H. Öztoprak M. Toycan Y.K. Alp et al. "Machine-based classification of ADHD and non-ADHD participants using time/frequency features of event-related neuroelectric activity" Clin. Neurophysiol. vol. 128 no. 12 pp. 2400-2410 2017.
P. Viday Sagar, Nageswara Rao Moparthi, Ch. Mukesh “Smart Meter Analytics for Optimizing the Utilization of Electricity using Arima, Navie & Holt Winter” International Journal of Innovative Technology and Exploring Engineering Vol 8, PP 585-590 (2019)
Published
How to Cite
Issue
Section
Copyright (c) 2022 International journal of health sciences

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
Articles published in the International Journal of Health Sciences (IJHS) are available under Creative Commons Attribution Non-Commercial No Derivatives Licence (CC BY-NC-ND 4.0). Authors retain copyright in their work and grant IJHS right of first publication under CC BY-NC-ND 4.0. Users have the right to read, download, copy, distribute, print, search, or link to the full texts of articles in this journal, and to use them for any other lawful purpose.
Articles published in IJHS can be copied, communicated and shared in their published form for non-commercial purposes provided full attribution is given to the author and the journal. Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
This copyright notice applies to articles published in IJHS volumes 4 onwards. Please read about the copyright notices for previous volumes under Journal History.








