An optimized random forest for multi class classification to classify the students using machine learning approaches

https://doi.org/10.53730/ijhs.v6nS7.13116

Authors

  • P. Amutha Research Scholar, Dept. of ComputerScience VISTAS, India
  • R. Priya Professor, Dept. of ComputerScience VISTAS, India

Keywords:

Multi Class, Hyper-Parameter, Feature Selection, Feature Important Score, Random Forest

Abstract

In recent years, Multi-class classification in Educational Data Mining (EDM) has been continued to be a focusing research to deal the issues in the imbalanced data set and less number of unifying classification algorithms. Machine-learning algorithms take prominent part in multi-class classification. This research studyintroduced an Optimized Random Forest for Multi-Class Classification (ORFMCC) to classify the students based on the higher education programs enrolment. The base classifier Random Forest is optimized by hyper-parameter tuning and feature selection processes. The Optimized RFMCC is developed in Python 3.3 using Spider IDE 4.1.5. The optimum parameter extracted by tuning and relevant features significantly improved classification accuracy. The classification performance of Optimized Random Forest for Multi-Class Classifieris compared with RF, NB, DT, LR, and KNN. The experiments result of Accuracy, F1-Score, precision and Recall revealed that the ORFMCC outperformed in Multi-Class classification compared to the other five Classifiers.

Downloads

Download data is not yet available.

References

Bujang, SitiDianah Abdul, et al. "Multiclass prediction model for student grade prediction using machine learning." IEEE Access 9 (2021): 95608-95621.

Tripathi, Ankita, et al. "A multi class random forest (MCRF) model for classification of small plant peptides." International Journal of Information Management Data Insights 1.2 (2021): 100029.

P. Amutha, R. Priya,, Analysis of Higher Education Counselling and its awareness among Higher Secondary Students in Tamil Nadu, India, International Journal of Modern Agriculture, Volume 10, No.2, 2021.

Sevastyanov, Leonid A., and Eugene Yu Shchetinin. "On methods for improving the accuracy of multi-class classification on imbalanced data." ITTMM. 2020.

Hassan, Hasniza, NorBahiah Ahmad, and SyahidAnuar. "Improved students’ performance prediction for multi-class imbalanced problems using hybrid and ensemble approach in educational data mining." Journal of Physics: Conference Series. Vol. 1529.No. 5.IOP Publishing, 2020.

Thomas, Roy, and J. E. Judith. "A Novel Ensemble Method for Detecting Outliers in Categorical Data." International Journal 9.4 (2020).

Thomas, Roy, and J. E. Judith. "Voting-Based Ensemble of Unsupervised Outlier Detectors." Advances in Communication Systems and Networks.Springer, Singapore, 2020.501-511.

P. Amutha, R. Priya,, Conceptual Course Selection Framework for Post-Secondary Students’ Enrolment in Indian Universities and Colleges, Journal of Advanced Research in Dynamical & Control Systems, Vol. 12, 03-Special Issue, 2020.

Do Thi Thu Hien, Cu Thi Thu Thuy, Tran Kim Anh, Dao The Son and Cu Nguyen Giap, “Optimize the Combination of Categorical Variable Encoding and Deep Learning Technique for the Problem of Prediction of Vietnamese Student Academic Performance” International Journal of Advanced Computer Science and Applications(IJACSA), 11(11), 2020. http://dx.doi.org/10.14569/IJACSA.2020.0111135

Dalton Ndirangu et al. “A Hybrid Ensemble Method for Multiclass Classification and Outlier Detection”. International Journal of Sciences: Basic and Applied Research (IJSBAR) · ISSN 2307-4531, January 2019.

Latha, C. Beulah Christalin, and S. CarolinJeeva. "Improving the accuracy of prediction of heart disease risk based on ensemble classification techniques." Informatics in Medicine Unlocked 16 (2019): 100203.

Basu K, Basu T, Buckmire R, Lal N, Predictive Models of Student College Commitment Decisions Using Machine Learning. Data.2019; 4(2):65. https://doi.org/10.3390/data4020065

Mustaqeem, Anam, Syed Muhammad Anwar, and MuahammadMajid. "Multiclass classification of cardiac arrhythmia using improved feature selection and SVM invariants." Computational and mathematical methods in medicine 2018 (2018).

Lango, Mateusz, and Jerzy Stefanowski. "Multi-class and feature selection extensions of roughly balanced bagging for imbalanced data." Journal of Intelligent Information Systems 50.1 (2018): 97-127.

AshishDutt and MaizatulAkmarIsmail , “Can We Predict Student Learning Performance from LMS data? A Classification Approach”, Advances in Social Science, Education and Humanities Research, volume 326, 2018.

Hota, Soudamini, and SudhirPathak. "KNN classifier based approach for multi-class sentiment analysis of twitter data." International Journal of Engineering & Technology 7.3 (2018): 1372-1375.

Katuwal, Rakesh, and Ponnuthurai N. Suganthan. "Enhancing multi-class classification of random forest using random vector functional neural network and oblique decision surfaces." 2018 International Joint Conference on Neural Networks (IJCNN).IEEE, 2018.

P. Amutha, R. Priya, A survey on educational data mining techniques in predicting student’s academic performance, International Journal of Engineering & Technology, 7 (2.33) (2018) 634-636.

Wang, Ying, et al. "Using multiclass classification to automate the identification of patient safety incident reports by type and severity." BMC medical informatics and decision making 17.1 (2017): 1-12.

Maryam, Noor AkhmadSetiawan, and OyasWahyunggoro. "A hybrid feature selection method using multiclass SVM for diagnosis of erythemato-squamous disease." AIP Conference Proceedings.Vol. 1867.No. 1.AIP Publishing LLC, 2017.

Potdar, Kedar, Taher S. Pardawala, and Chinmay D. Pai. "A comparative study of categorical variable encoding techniques for neural network classifiers." International journal of computer applications 175.4 (2017): 7-9.

Kannan, Ramakrishnan, et al. "Outlier detection for text data." Proceedings of the 2017 siam international conference on data mining. Society for Industrial and Applied Mathematics, 2017.

Chaudhary, Archana, SavitaKolhe, and Raj Kamal. "An improved random forest classifier for multi-class classification." Information Processing in Agriculture 3.4 (2016): 215-222.

Farid, DewanMd, et al. "Hybrid decision tree and naïve Bayes classifiers for multi-class classification tasks." Expert systems with applications 41.4 (2014): 1937-1946.

Published

30-09-2022

How to Cite

Amutha, P., & Priya, R. (2022). An optimized random forest for multi class classification to classify the students using machine learning approaches. International Journal of Health Sciences, 6(S7), 5312–5326. https://doi.org/10.53730/ijhs.v6nS7.13116

Issue

Section

Peer Review Articles