Open Access Open Access  Restricted Access Subscription Access

Methods and Algorithms of Speech Signals Processing and Compression and Their Implementation in Computer Systems


Affiliations
1 Shaqra University, Saudi Arabia
2 Irbed National University, Jordan
 

The review and comparative analysis of the methods of compression and recognition of speech signals is carried out. The result of the carried out analysis of the existing recognition methods indicates, that all of them are based on the use of “inflexible” algorithms, which are badly adapted to the characteristic features of speech signals, thus degrading the efficiency of the operation of the whole recognition system. The necessity of the use of algorithms for determination of recognition features along with the use of the wavelet packet analysis as one of the advanced directions of the creation of the effective methods and principles of the development of the speech signals recognition systems is substantiated. Analysis of the compression methods with the use of the orthogonal transformations at the complete exception of minimal decomposition factors is conducted; a maximal possible compression degree is defined. In this compression method the orthogonal transformation of the signal segment with the subsequent exception of the set of the smallest modulo decomposition factors, irrespective of the order of their distribution, is conducted. Therefore the additional transfer of the information on the factors distribution is required. As a result, two information streams appear, the first one corresponds to the information stream on the decomposition factors, and the second stream transfers information on the distribution of these factors. Method of the determination of the speech signals recognition features and the algorithm for nonlinear time normalization is proposed and proved. Wavelet-packet transformation is adaptive, i.e. it allows adapting to the signal features more accurately by means of the choice of the proper tree of the optimal decomposition form, which provides the minimal number of wavelet factors at the prescribed accuracy of signal reconstruction, thus eliminating the information-surplus and unnecessary details of the signals. Estimation of the informativeness of the set of wavelet factors is accomplished by the entropy. In order to obtain the recognition factors, the spectral analysis operation is used. In order to carry out the temporary normalization, the deforming function is found, the use of which minimizes the discrepancy between the standard and new words realization. Dedicated to the determination of admissible compression factors on the basis of the orthogonal transformations use at the incomplete elimination of the set of minimal decomposition factors, to the creation of the block diagram of the method of the recognition features formation, to the practical testing of the software- methods. In order to elevate the compression factor, the adaptive uniform quantization is used, where the adaptation is conducted for all the decomposition factors. The program testing of the recognition methods is carried out by means of determination of the classification error probability using Mahalanobis (Gonzales) distance.

Keywords

Recognition of Speech Signals, Compression, Recognition Features.
User
Notifications
Font Size

  • K Kinney, A., and J. Stevens. “Wavelet packet cepstral analysis for speaker recognition.” Conference Record of the Thirty-Sixth Asilomar Conference on Signals, Systems and Computers, 2002.
  • K Klein, ShmuelTomi. “Data Compression in Information Retrieval Systems.” Database and Data Communication Network Systems, 2002, pp. 573–633.
  • Fadi Alkalani,Compression of speech signal based on orthogonal transformations, selection and processing of information, 20(96),P. 137-142, 2004.
  • Car, J. “Improving quality and safety of telephone based delivery of care: teaching telephone consultation skills.” Quality and Safety in Health Care, vol. 13, no. 1, Jan. 2004, pp. 2–3.
  • Burrus, C.S., Gopinath, R.A. and Guo, H. (1998) Introduction to Wavelet and Wavelet Transforms. Prentice Hall, New Jersey. - References - Scientific Research Publishing.
  • Rao, Raghuveer M.; BorosTibor. “Wavelet Transforms: Introduction to Theory and Applications, Journal of Electronic Imaging.” DeepDyve, SPIE, 1 Oct. 1999.
  • https://www.krsu.edu.kg/vestnik/index.html
  • Sydow, A. “Tou, J. T./Gonzalez, R. C., Pattern Recognition Principles, Publishing Company. 1974. Journal of Applied Mathematics and Mechanics, 22 Nov. 200.
  • K Kapustii, B. E., et al. “Features in the design of optimal recognition systems.” Automatic Control and Computer Sciences, vol. 42, no. 2, 2008, pp. 64–70.

Abstract Views: 283

PDF Views: 0




  • Methods and Algorithms of Speech Signals Processing and Compression and Their Implementation in Computer Systems

Abstract Views: 283  |  PDF Views: 0

Authors

Fadi Alkalani
Shaqra University, Saudi Arabia
Raed Sahawneh
Irbed National University, Jordan

Abstract


The review and comparative analysis of the methods of compression and recognition of speech signals is carried out. The result of the carried out analysis of the existing recognition methods indicates, that all of them are based on the use of “inflexible” algorithms, which are badly adapted to the characteristic features of speech signals, thus degrading the efficiency of the operation of the whole recognition system. The necessity of the use of algorithms for determination of recognition features along with the use of the wavelet packet analysis as one of the advanced directions of the creation of the effective methods and principles of the development of the speech signals recognition systems is substantiated. Analysis of the compression methods with the use of the orthogonal transformations at the complete exception of minimal decomposition factors is conducted; a maximal possible compression degree is defined. In this compression method the orthogonal transformation of the signal segment with the subsequent exception of the set of the smallest modulo decomposition factors, irrespective of the order of their distribution, is conducted. Therefore the additional transfer of the information on the factors distribution is required. As a result, two information streams appear, the first one corresponds to the information stream on the decomposition factors, and the second stream transfers information on the distribution of these factors. Method of the determination of the speech signals recognition features and the algorithm for nonlinear time normalization is proposed and proved. Wavelet-packet transformation is adaptive, i.e. it allows adapting to the signal features more accurately by means of the choice of the proper tree of the optimal decomposition form, which provides the minimal number of wavelet factors at the prescribed accuracy of signal reconstruction, thus eliminating the information-surplus and unnecessary details of the signals. Estimation of the informativeness of the set of wavelet factors is accomplished by the entropy. In order to obtain the recognition factors, the spectral analysis operation is used. In order to carry out the temporary normalization, the deforming function is found, the use of which minimizes the discrepancy between the standard and new words realization. Dedicated to the determination of admissible compression factors on the basis of the orthogonal transformations use at the incomplete elimination of the set of minimal decomposition factors, to the creation of the block diagram of the method of the recognition features formation, to the practical testing of the software- methods. In order to elevate the compression factor, the adaptive uniform quantization is used, where the adaptation is conducted for all the decomposition factors. The program testing of the recognition methods is carried out by means of determination of the classification error probability using Mahalanobis (Gonzales) distance.

Keywords


Recognition of Speech Signals, Compression, Recognition Features.

References





DOI: https://doi.org/10.13005/ojcst%2F10.04.06