Survey on optimization algorithms in speech processing

M Manjutha; P Subashini

doi:10.53730/ijhs.v6nS5.9307

Authors

Manjutha M Department of Computer Science, Avinashilingam Institute for Home Science and Higher Education for Women, Coimbatore, Tamil Nadu, India
Subashini P Professor, Department of Computer Science, Avinashilingam Institute for Home Science and Higher Education for Women, Coimbatore, Tamil Nadu, India

Keywords:

Evolutionary Computation, Feature Selection, Fitness Function, Nature Inspired Algorithm, Optimization and Speech Recognition

Abstract

Nature is a tremendous provenance of resolving hard and complex problems that exist in the field of computer science because it reveals a very diverse, robust, dynamic, and interesting phenomenon. It constantly finds the optimal solution to resolve its problem which accomplishes exact equity among its element. Nature-inspired algorithms are the heuristic high-level procedure that interprets nature to solve the optimization problem which popularized in the new era of computation. The main objective of this paper is to evaluate the modern technology and enhancement in the nature-inspired algorithm, especially in the application of speech processing, speech recognition, and speech feature selection problems. This paper presents broad collections of global optimization algorithms which have been successfully applied to generate recognition systems that are integrated with metaheuristic algorithms.

Downloads

Download data is not yet available.

References

A. Dev and P. Bansal, Robust Features for Noisy Speech Recognition using MFCC Computation from Magnitude Spectrum of Higher Order Autocorrelation Coefficients, Journal of Computer Applications,10(8) (2010) 36-38.

A. Shahzadi, A. Ahmadyfard, and A. Harim, Speech emotion recognition using nonlinear dynamics features, Turkish Journal of Electrical Engineering & Computer Sciences, 23 (2015) 2056-2073. DOI:10.3906/elk-1302-90

A. V. Ermilov, Modeling Speech Features Via Simulated Annealing Algorithm, Discrete and continuous models and applied computational science, 2 (2014), 354-358.

A. Zabidi, Lee Yoot Khuan, W. Mansor, I. M. Yassin, R. Sahak, Optimization of MFCC parameters using Particle Swarm Optimization for diagnosis of infant hypothyroidism using Multi- Layer Perceptron, in Proc Annual International Conference of the IEEE Engineering in Medicine and Biology, (2010)1417-1420, DOI: 10.1109/IEMBS.2010.5626712.

Ahmed Al-Hmouz, Khaled Daqrouq, Rami Al-Hmouz and Jaafar Alghazo, Feature Reduction Method for Speaker Identification Systems Using Particle Swarm Optimization, International Journal of Engineering and Technology (IJET), 9 (3) (2017) 1714-1723. DOI:10.21817/ijet/2017/v9i3/170903045.

Asl, Laleh Badri, Nezhad, Vahid Ma, Speech Enhancement Using Particle Swarm Optimization Techniques, in Proc. International Conference on Measuring Technology and Mechatronics Automation, (2010) 441-444. DOI:10.1109/icmtma.2010.510

B. Xue, M. Zhang, W. N. Browne and X. Yao, A Survey on Evolutionary Computation Approaches to Feature Selection, IEEE Transactions on Evolutionary Computation, 20(4) (2016)606-626, DOI: 10.1109/TEVC.2015.2504420.

Bolun Chen, Ling Chen and Yixin Chen, Efficient ant colony optimization for image feature selection, Signal Processing, 93(6) (2013)1566-1576, DOI:10.1016/j.sigpro.2012.10.022.

Bright Kanisha, Ganesan Balarishnanan, Speech Recognition with Advanced Feature Extraction Methods Using Adaptive Particle Swarm Optimization, International Journal of Intelligent Engineering and Systems, 9(4) (2016) 21-30, DOI:10.22266/ijies2016.1231.03.

C.Poonkuzhali, R.Karthiprakash, Dr.S.Valarmathy, and M.Kalamani, An Approach to Feature Selection Algorithm Based on Ant Colony Optimization For Automatic Speech Recognition, International Journal of Advanced Research in Electrical, Electronics and Instrumentation Engineering,2(11)(2013).

Czyzewski.A, Kaczmarek. A, Kostek.B, Intelligent processing of stuttered speech, Journal of Intelligent Information Systems, 21(2003)143–171, https://doi.org/10.1023/A:1024710532716.

E. Avci, Z. H. Akpolat, Speech recognition using a wavelet packet adaptive network based fuzzy inference system, In Expert Systems with Applications, 31(3) (2006)495-503. DOI:10.1016/j.eswa.2005.09.058.

E. Saraç and S. A. Özel, An Ant Colony Optimization Based Feature Selection for Web Page Classification, The Scientific World Journal, 2014 (2014)1-16, DOI:10.1155/2014/649260.

F. L. Huang, An Effective Approach for Chinese Speech Recognition on Small size of Vocabulary, Journal of signal and image processing, 2(2) (2011) 48-60. DOI:10.5121/sipij.2011.2205.

Fabíola Araújo, José Filho and Aldebaro Klautau, Genetic algorithm to estimate the input parameters of Klatt and HLSyn formant-based speech synthesizers, Biosystems, 150 (2016)190–193, DOI:10.1016/j.biosystems.2016.10.002.

Fooad Jalili and Milad Jafari Barani, Speech Recognition Using Combined Fuzzy and Ant Colony Algorithm, International Journal of Electrical and Computer Engineering (IJECE), 6(5) (2016) 2205-2210.

G. C. Batista, W. L. Santos Silva and A. G. Menezes, Automatic speech recognition using Support Vector Machine and Particle Swarm Optimization, in Proc. IEEE Symposium Series on Computational Intelligence (SSCI), Athens, (2016)1-6. DOI: 10.1109/SSCI.2016.7850125.

Gambardella L. and M. Dorigo, Ant-Q: A Reinforcement Learning approach to the traveling salesman problem, in Proc. of ML-95, Twelfth International Conference on Machine Learning, Tahoe City, CA, A. Prieditis and S. Russell (Eds.), Morgan Kaufmann,1733(1995), 252–260, DOI:10.1016/b978-1-55860-377-6.50039-6.

Ganapathiraju A, Hamaker J. E, and Picone J, Applications of support vector machines to speech recognition, IEEE Transactions on Signal Processing, 52(8) (2004) 2348-2355, DOI:10.1109/tsp.2004.831018.

Guowen Wang, Shixin Luo, Li He, Gang Yin, Application BP neural network in the speaker recognition Based on Chaos Particle Swarm Optimization Algorithm, Advanced Materials Research, 765-767 (2013) 2805-2808.

H. Kanan, K. Faez, S. M. Taheri, Feature Selection Using Ant Colony Optimization (ACO): A New Method and Comparative Study in the Application of Face Recognition System, P. Perner (Ed.): ICDM 2007, LNAI 4597, (2007) 63–76.

Hariharan Muthusamy, Kemal Polat, Sazali Yaacob, Particle Swarm Optimization Based Feature Enhancement and Feature Selection for Improved Emotion Recognition in Speech and Glottal Signals, PLoS ONE, 10(3) (2015) 1-20, DOI:10.1371/journal.pone.0120344. [29]

Ibrahim M. El-Henawy, Walid I. Khedr, Osama M. ELkomy, Al-Zahraa M.I. Abdalla, Recognition of phonetic Arabic figures via wavelet based Mel Frequency Cepstrum using HMMs, Journal of Housing and Building National Research Center, 10(2014) 49-54.

J. Jiang, Z. Wu, M. Xu, J. Jia and L. Cai, Comparing feature dimension reduction algorithms for GMM-SVM based speech emotion recognition, Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2013 Asia-Pacific, (2013) 1-4, doi: 10.1109/APSIPA.2013.6694336.

J. Sirisha Devi and Srinivas Yarramalle, Multi-Objective Optimization Problem resolution based on Hybrid Ant-Bee Colony for Text Independent Speaker Verification, I.J. Modern Education and Computer Science, 1(2015) 55-63.

J.S. Lee and C.H. Park, Hybrid simulated annealing and its application to optimization of hidden Markov models for visual speech recognition, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), 40(4) (2010) 1188-1196.

K.M.Ravikumar, Balakrishna Reddy, R.Rajagopal, H.C.Nagaraj, Automatic detection of syllable repetition in read speech for objective assessment of stuttered disfluencies, in Proc. of World Academy Science, Engineering and Technology (2008) 270–273.

Kalamani, M., Valarmathy, S., Poonkuzhali, C., & Catherine J N. (2014), Feature selection algorithms for automatic speech recognition. 2014 International Conference on Computer Communication and Informatics, 1-7, DOI:10.1109/iccci.2014.6921797.

Korvel, G., O. Kurasova, and B. Kostek, Comparative analysis of spectral and cepstral feature extraction techniques for phoneme modelling, in Proc. 11th International conference MISSI (2018) 480-489.DOI:10.1007/978-3-319-98678-4_48.

Lihui DU and Yueguang Li, Recognition of practical English speech emotion using improved Quantum Ant Colony Algorithm, International Symposium on Computers & Informatics (ISCI 2015).

Lokesh Selvaraj and Balakrishnan Ganesan, Enhancing Speech Recognition Using Improved Particle Swarm Optimization Based Hidden Markov Model, Hindawi Publishing Corporation, The Scientific World Journal, Volume 2014, Article ID 270576, 1-10.

M. Benzeghiba, R. De Mori, O. Deroo, S. Dupont, T. Erbes, D. Jouvet, L. Fissore, P. Laface, A. Mertins, C. Ris, R. Rose, V. Tyagi, C. Wellekens, Automatic speech recognition and speech variability: A review, Speech Communication, 49(10-11) (2007)763-786.

M. Dorigo, M. Birattari and T.Stutzle, Ant Colony Optimization: Artificial Ants as Computational Intelligent Technique, IEEE Computational Intelligent Magazine, (2006).

M. Dua, R. Aggarwal and M. Biswas, Optimizing Integrated Features for Hindi Automatic Speech Recognition System, Journal Intelligent System, 29(1) 2020, 959–976, DOI:10.1515/jisys-2018-0057.

M. M Kabir, M. Shahjahan and K Murase, A new hybrid ant colony optimization algorithm for feature selection, Expert Systems with Applications, 39 (2012) 3747–3763.

M. Manjutha, Dr. P. Subashini, Particle Swarm Optimization based Voice Activity Detection for Stuttered Tamil Speech, International Journal of Computer Engineering and Applications, 11(9) (2017) 1-14.

M. Manjutha, P. Subashini, M. Krishnaveni and V. Narmadha, An Optimized Cepstral Feature Selection method for Dysfluencies Classification using Tamil Speech Dataset, in Proc. IEEE International Smart Cities Conference (ISC2), Casablanca, Morocco, (2019), 671-677.

M.Dorigo and K.Socha, An introduction to ant colony optimization, Handbook of Metaheuristic, Brussels: IRIDIA, 26(1) (2006).

M.H. Aghdam. An Improved Ant Colony Optimization Algorithm and its Application to Text-Independent Speaker Verification System, JAISCR,2 (4) (2012)301-315.

M.Sheikhan, Synthesizing Supra segmental Speech Information Using Hybrid of GA-ACO and Dynamic Neural Network, in Proc. 5th Conference on Information and Knowledge Technology (IKT) (2013) 175-180, doi: 10.1109/IKT.2013.6620060

Mahesha. P, Vinod.D.S, Support vector machine-based stuttering dysfluency classification using GMM supervectors, Int. J. Grid and Utility Computing, 6(3/4) (2015)143–149.

Mahesha.P, Vinod. D. S, Feature based classification of dysfluent and normal speech, in Proc. Second International Conference on Computational Science, Engineering and Information Technology - CCSEIT 12, (2012) 594-597.

Manikandan J, Venkataramani B, Girish K, Karthic H and Siddharth V, Hardware implementation of real-time speech recognition system using TMS320C6713 DSP, in Proceedings of IEEE International Conference on VLSI Design, 250-255, 2011. 45

Manjula.G, Shivakumar.M, Geetha.Y. V, Adaptive optimization based neural network for classification of stuttered speech, in Proc. 3rd International Conference on Cryptography, Security and Privacy - ICCSP ’19 (2019) 93-98.

N. Najkar, F. Razzazi and H. Sameti, A novel approach to HMM-based speech recognition systems using particle swarm optimization, Mathematical and Computer Modelling 52 (2010), 1910-1920, DOI:10.1016/j.mcm.2010.03.041.

R. Alhutaish and N.Omar, Feature Selection for Multi-Label Document Based on Wrapper Approach through Class Association Rules, International Journal on Advanced Science Engineering and Information Technology, 7(2) (2017),DOI: 10.18517/ijaseit.7.2.1040.

R. K. Aggarwal; M. Dave, Filterbank optimization for robust ASR using GA and PSO, International Journal of Speech Technology, 15(2), (2012),191–201, DOI:10.1007/s10772-012-9133-9

R.A. Shirvan and E. Tahami, Voice analysis for detecting Parkinson's disease using genetic algorithm and KNN classification method, Biomedical Engineering (ICBME), 2011 18th Iranian Conference of, 14-16 (2011), 278-283, DOI: 10.1109/ICBME.2011.6168572.

R.Mehmood, W.Shahzad and E. Ahmed, Maximum Relavancy Minimum Redundancy Based Feature Subset Selection using Ant Colony Optimization, Journal of Applied Environmental and Biological Sciences, 7(4), (2017)118-130.

S Rajarajeswari, Shree Devi B N, Sushma G, Optimal Feature Selection of Speech using Particle Swarm Optimization Integrated with mRMR for Determining Human Emotion State, International Journal of Computer Applications, 74(10) (2013) 48-52.

S. Nemati, M.E. Basiri, Particle Swarm Optimization for Feature Selection in Speaker Verification, in Proc. International Conference on the Applications of Evolutionary Computation, 6024, (2010) 371–380, DOI:10.1007/978-3-642-12239-2_39

T. A. Souza, M. A. Souza, W. C. d. A. Costa, S. C. Costa, S. E. N. Correia and V. J. D. Vieira, Feature Selection based on Binary Particle Swarm Optimization and Neural Networks for Pathological Voice Detection, 2015 Latin America Congress on Computational Intelligence (LA-CCI),(2015) 1-6, DOI: 10.1109/LA-CCI.2015.7435962.

Widana, I.K., Dewi, G.A.O.C., Suryasa, W. (2020). Ergonomics approach to improve student concentration on learning process of professional ethics. Journal of Advanced Research in Dynamical and Control Systems, 12(7), 429-445.

Thaer M. Taha, Hazem M. El-Bakry, Amal Ibrahim, Samir Abd-Elrazik, Magdi Z. Rashad, Fast Sound Verification Using Support Vector Machine and Particle Swarm Optimization Algorithms, International Journal of Advanced Research in Computer Science & Technology (IJARCST 2016), 4(1), (2016)78-83.

W. Zha and G. K. Venayagamoorthy, Comparison of Non-uniform Optimal Quantizes Designs for Speech Coding With Adaptive Critics and Particle Swarm, IEEE Transactions on Industry Applications, 43(1) (2005)674-679

Wisniewski.M, Kuniszyk-Jozkowiak.W, Smolka.E, Suszynski.W, Automatic detection of prolonged fricative phonemes with the hidden markov models approach, Journal of Medical Informatics and Technologies, 11(2007) 294-298.

X. Wei and X. Yang, Speech dynamic time warping based on ant colony optimization algorithm, 3rd International Conference on Consumer Electronics, Communications and Networks, (2013) 602-604, DOI: 10.1109/CECNet.2013.6703403.

X. Zhang and Y. Guo, Optimization of SVM Parameters Based on PSO Algorithm”, Fifth International Conference on Natural Computation, (2009) 536-539, DOI: 10.1109/ICNC.2009.257.