An optimized random forest for multi class classification to classify the students using machine learning approaches
Keywords:
Multi Class, Hyper-Parameter, Feature Selection, Feature Important Score, Random ForestAbstract
In recent years, Multi-class classification in Educational Data Mining (EDM) has been continued to be a focusing research to deal the issues in the imbalanced data set and less number of unifying classification algorithms. Machine-learning algorithms take prominent part in multi-class classification. This research studyintroduced an Optimized Random Forest for Multi-Class Classification (ORFMCC) to classify the students based on the higher education programs enrolment. The base classifier Random Forest is optimized by hyper-parameter tuning and feature selection processes. The Optimized RFMCC is developed in Python 3.3 using Spider IDE 4.1.5. The optimum parameter extracted by tuning and relevant features significantly improved classification accuracy. The classification performance of Optimized Random Forest for Multi-Class Classifieris compared with RF, NB, DT, LR, and KNN. The experiments result of Accuracy, F1-Score, precision and Recall revealed that the ORFMCC outperformed in Multi-Class classification compared to the other five Classifiers.
Downloads
References
Bujang, SitiDianah Abdul, et al. "Multiclass prediction model for student grade prediction using machine learning." IEEE Access 9 (2021): 95608-95621.
Tripathi, Ankita, et al. "A multi class random forest (MCRF) model for classification of small plant peptides." International Journal of Information Management Data Insights 1.2 (2021): 100029.
P. Amutha, R. Priya,, Analysis of Higher Education Counselling and its awareness among Higher Secondary Students in Tamil Nadu, India, International Journal of Modern Agriculture, Volume 10, No.2, 2021.
Sevastyanov, Leonid A., and Eugene Yu Shchetinin. "On methods for improving the accuracy of multi-class classification on imbalanced data." ITTMM. 2020.
Hassan, Hasniza, NorBahiah Ahmad, and SyahidAnuar. "Improved students’ performance prediction for multi-class imbalanced problems using hybrid and ensemble approach in educational data mining." Journal of Physics: Conference Series. Vol. 1529.No. 5.IOP Publishing, 2020.
Thomas, Roy, and J. E. Judith. "A Novel Ensemble Method for Detecting Outliers in Categorical Data." International Journal 9.4 (2020).
Thomas, Roy, and J. E. Judith. "Voting-Based Ensemble of Unsupervised Outlier Detectors." Advances in Communication Systems and Networks.Springer, Singapore, 2020.501-511.
P. Amutha, R. Priya,, Conceptual Course Selection Framework for Post-Secondary Students’ Enrolment in Indian Universities and Colleges, Journal of Advanced Research in Dynamical & Control Systems, Vol. 12, 03-Special Issue, 2020.
Do Thi Thu Hien, Cu Thi Thu Thuy, Tran Kim Anh, Dao The Son and Cu Nguyen Giap, “Optimize the Combination of Categorical Variable Encoding and Deep Learning Technique for the Problem of Prediction of Vietnamese Student Academic Performance” International Journal of Advanced Computer Science and Applications(IJACSA), 11(11), 2020. http://dx.doi.org/10.14569/IJACSA.2020.0111135
Dalton Ndirangu et al. “A Hybrid Ensemble Method for Multiclass Classification and Outlier Detection”. International Journal of Sciences: Basic and Applied Research (IJSBAR) · ISSN 2307-4531, January 2019.
Latha, C. Beulah Christalin, and S. CarolinJeeva. "Improving the accuracy of prediction of heart disease risk based on ensemble classification techniques." Informatics in Medicine Unlocked 16 (2019): 100203.
Basu K, Basu T, Buckmire R, Lal N, Predictive Models of Student College Commitment Decisions Using Machine Learning. Data.2019; 4(2):65. https://doi.org/10.3390/data4020065
Mustaqeem, Anam, Syed Muhammad Anwar, and MuahammadMajid. "Multiclass classification of cardiac arrhythmia using improved feature selection and SVM invariants." Computational and mathematical methods in medicine 2018 (2018).
Lango, Mateusz, and Jerzy Stefanowski. "Multi-class and feature selection extensions of roughly balanced bagging for imbalanced data." Journal of Intelligent Information Systems 50.1 (2018): 97-127.
AshishDutt and MaizatulAkmarIsmail , “Can We Predict Student Learning Performance from LMS data? A Classification Approach”, Advances in Social Science, Education and Humanities Research, volume 326, 2018.
Hota, Soudamini, and SudhirPathak. "KNN classifier based approach for multi-class sentiment analysis of twitter data." International Journal of Engineering & Technology 7.3 (2018): 1372-1375.
Katuwal, Rakesh, and Ponnuthurai N. Suganthan. "Enhancing multi-class classification of random forest using random vector functional neural network and oblique decision surfaces." 2018 International Joint Conference on Neural Networks (IJCNN).IEEE, 2018.
P. Amutha, R. Priya, A survey on educational data mining techniques in predicting student’s academic performance, International Journal of Engineering & Technology, 7 (2.33) (2018) 634-636.
Wang, Ying, et al. "Using multiclass classification to automate the identification of patient safety incident reports by type and severity." BMC medical informatics and decision making 17.1 (2017): 1-12.
Maryam, Noor AkhmadSetiawan, and OyasWahyunggoro. "A hybrid feature selection method using multiclass SVM for diagnosis of erythemato-squamous disease." AIP Conference Proceedings.Vol. 1867.No. 1.AIP Publishing LLC, 2017.
Potdar, Kedar, Taher S. Pardawala, and Chinmay D. Pai. "A comparative study of categorical variable encoding techniques for neural network classifiers." International journal of computer applications 175.4 (2017): 7-9.
Kannan, Ramakrishnan, et al. "Outlier detection for text data." Proceedings of the 2017 siam international conference on data mining. Society for Industrial and Applied Mathematics, 2017.
Chaudhary, Archana, SavitaKolhe, and Raj Kamal. "An improved random forest classifier for multi-class classification." Information Processing in Agriculture 3.4 (2016): 215-222.
Farid, DewanMd, et al. "Hybrid decision tree and naïve Bayes classifiers for multi-class classification tasks." Expert systems with applications 41.4 (2014): 1937-1946.
Published
How to Cite
Issue
Section
Copyright (c) 2022 International journal of health sciences

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
Articles published in the International Journal of Health Sciences (IJHS) are available under Creative Commons Attribution Non-Commercial No Derivatives Licence (CC BY-NC-ND 4.0). Authors retain copyright in their work and grant IJHS right of first publication under CC BY-NC-ND 4.0. Users have the right to read, download, copy, distribute, print, search, or link to the full texts of articles in this journal, and to use them for any other lawful purpose.
Articles published in IJHS can be copied, communicated and shared in their published form for non-commercial purposes provided full attribution is given to the author and the journal. Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
This copyright notice applies to articles published in IJHS volumes 4 onwards. Please read about the copyright notices for previous volumes under Journal History.








