An OCR for Arabic character recognition with advanced principal component analysis based on feature extraction and fuzzy-KNN based classification

https://doi.org/10.53730/ijhs.v6nS1.7918

Authors

  • Ashiq V M Research scholar, Dept. of Computer Science, Karpagam Academy of Higher Education, Coimbatore
  • E J Thomson Fredrik Professor, Dept. of Computer Applications, Karpagam Academy of Higher Education, Coimbatore

Keywords:

ACR, Fuzzy, KNN, Principal Component Analysis

Abstract

Offline character recognition has become a highly important study field for various pattern recognition applications in recent years. Several handwritten character recognition systems have been suggested, with the complexity of these systems varying depending on the recognizing units' writing styles. In reality, identifying letters or numerals is far simpler than recognizing cursive sentences or lines of text. As a result, early handwriting recognition algorithms could only distinguish a few characters with limited vocabularies. Nowadays Arabic handwritten character recognition is very important as it is very difficult to identify. The cursive writing and variety of styles make this recognition more complex. This paper presents an automated model for ACR. This ACR is constructed from four phases: Preprocessing, Segmentation, Feature Extraction, and Classification. In this research article, we compare the advanced EPSO EKNN Algorithm with the earlier DBN Algorithm and also suggest a new method having higher Accuracy. In this research article, the "Enhanced K-Nearest Neighbor" (EKNN) classification model was used to identify and classify or simply recognize the particular Arabic character, and the "Extended Particle Swarm Optimization" (EPSO) methodology was introduced to select the best feature from feature extraction to comply with Arabic-Character Classification.

Downloads

Download data is not yet available.

References

S. Elsaid, H. Alharthi, R. Alrubaia, S. Abutile, R. Aljres, A. Alanazi, and A. Albrikan(2019) "Arabic real-time license plate recognition system", in Proc.1st Int. Conf. Comput. pp. 126-143.

M. Awel, M. Ahmed, and A. Abidi,(2019) "Review on optical character recognition", Int. J. Comput. Appl., vol. 6, no. 5, pp. 3666-3669,

F. Solamani and A. Mohamed,(2019) "Off-line optical character recognition system for Arabic handwritten text", J. Pure Appl. Sci., vol. 18, pp. 52-58,

A. Lawgali,(2015) "A survey on Arabic character recognition", Int. J. Signal Process., Image Process. Pattern Recognit., vol. 8, no. 2, pp. 401-426

I. A. Doush, F. AIKhateeb, and A. H. Gharibeh(2018) "Yarmouk Arabic OCR dataset", in Proc. 8th Int. Conf. Comput. Sci. Inf. Technol. (CSIT), pp. 150-154.

Ashiquzzaman A, Tushar AK, Rahman A, Mohsin F (2019) An efficient recognition method for handwritten Arabic numerals using CNN with data augmentation and dropout. In: Balas VE, Sharma N, Chakrabarti A (eds) Data management, analytics, and innovation, advances in intelligent systems and computing. Springer, Singapore, pp 299–309

R. Ahmad, S. Naz, M. Afzal, S. Rashid, M. Liwicki, and A. Dengel,(2020) "A deep learning-based Arabic script recognition system: Benchmark on KHAT", Int. Arab J. Inf. Technol., vol. 17, no. 3, pp. 299-305,

Alani A (2017) Arabic handwritten digit recognition based on restricted Boltzmann machine and convolutional neural networks. Information 8(4):142.

M. M. Mohamad, H. Hassan, D. Nasien, and H. Haron,(2015) "A review on feature extraction and feature selection for handwritten character recognition", Int. J. Adv. Comput. Sci. Appl., vol. 6, no. 2, pp. 204-213,

Mars A, Antoniadis G (2016) Arabic online handwriting recognition using neural network. Int J Artif Intell Appl(IJAIA) 7(5)

El-Sawy A, Loey M, Hazem E (2017) Arabic handwritten character recognition using convolutional neural network. WSEAS Trans Comput Res 5:11–19

Maalej R, Kherallah M (2018) Convolutional neural network and best for offline Arabic handwriting recognition. In: 2018 International Arab conference on information technology (ACIT), Werdanye, Lebanon, 2018, pp 1–6. https://doi.org/10.1109/ACIT. 2018.8672667

Latif G, Alghazo J, Alzubaidi L, Naseer MM, Alghazo Y (2018) Deep convolutional neural network for recognition of unified multi-language handwritten numerals. In: 2018 IEEE 2nd International workshop on Arabic and derived script analysis and recognition (ASAR), pp 90–95

F. Alotaibi, M. T. Abdullah, R. B. H. Abdullah, R. W. B. O. K. Rahmat, I. A. T. Hashem, and A. K. Sangaiah,(2018)``Optical character recognition for quranic image similarity matching,'' IEEE Access, vol. 6, pp. 554-562

Dr E J Thomson Fredrik, V. M. Ashiq (2021). An OCR for Arabic Character Recognition with Ensemble Approach based Feature Selection for Enhanced KNN Classification. Design Engineering, 4728-4750. Retrieved from http://www.thedesignengineering.com/index.php/DE/article/view/5425.

Published

26-05-2022

How to Cite

Ashiq, V. M., & Fredrik, E. J. T. (2022). An OCR for Arabic character recognition with advanced principal component analysis based on feature extraction and fuzzy-KNN based classification. International Journal of Health Sciences, 6(S1), 12205–12224. https://doi.org/10.53730/ijhs.v6nS1.7918

Issue

Section

Peer Review Articles