Applicational Areas of MFCC
Subscribe/Renew Journal
Mel Frequency Cepstral Coefficient is a very common and capable technique for signal processing. It is the basic method used for extracting the features of the voice signal .It is a powerful and popular acoustic vector that is used to represent and recognize the voice features and characteristics of the speaker. Mel-frequency cepstral coefficients are the coefficients that collectively represent the short-term power spectrum of a sound, based on a linear cosine transform of a log power spectrum on a nonlinear mel scale of frequency. In this paper, we study about the various applications of MFCCs. The methods which were briefly studied include Vector Quantization (VQ), K Nearest Nieghbor (KNN), Dynamic Time Wrapping (DTW), Multi-Layered Perceptron (MLP), Gaussian Mixture Model (GMM), Support Vector Machine (SVM), Hidden Markov Model.
Keywords
- Manjot Kaur; Gill Reetkamal Kaur and Jagdev Kaur. Vector Quantization based Speaker Identification, International Journal of Computer Applications (0975 – 8887) Volume 4 – No.2, July 2010
- Kshitiz Kumar; Chanwoo Kim and Richard M. Stern. DELTA-SPECTRAL CEPSTRAL COEFFICIENTS FOR ROBUST SPEECH RECOGNITION, Carnegie Mellon University, Pittsburgh
- The MFCC, Aldebaro Klautau - 11/22/05.
- Satya Narayana Penke; Phanindra Sai Srinivas Gudipudi; Bhanu Prakash Panchakarla; Srikanth Hemadri and Rajeev Ijjada. Frame Optimization and Speaker Recognition through Full Distance Matrix Approach, Indian Journal of Science and Technology, Vol 7(8), 1189–1195, August 2014
- Shikha Gupta; Jafreezal Jaafar; Wan Fatimah wan Ahmad and Arpit Bansal. FEATURE EXTRACTION USING MFCC, Signal & Image Processing, An International Journal (SIPIJ) Vol.4, No.4, August 2013
- Mehmet Cenk Sezgin; Bilge Gunsel and Gunes Karabulut Kurt. Perceptual audio features for emotion detection, Sezgin et al. , EURASIP Journal on Audio, Speech, and Music Processing 2012, 2012:16
- Tsang-Long Pao; Yu-Te Chen; Jun-Heng Yeh; Yun-Maw Cheng and Yu-Yuan Lin. A Comparative Study of Different Weighting Schemes on KNN-Based Emotion Recognition in Mandarin Speech, Tatung University 40 ChungShan North Road, 3rd Section Taipei 104, Taiwan
- Syed Ayaz Ali Shah; Azzam ul Asar and Syed Waqar Shah. Interactive Voice Response with Pattern Recognition Based on Artificial Neural Network Approach, NWFP University of Engineering and Technology, Peshawar, Pakistan
- Prof. M. R. Dixit. Learning Assistant in Educational Field Using Automatic Speech Recognition, International Journal of Innovative Research in Computer Science & Technology (IJIRCST) ISSN: 2347-5552, Volume-1, Issue-2, November- 2013
- Anjali Bala; Abhijeet Kumar and Nidhika Birla.VOICE COMMAND RECOGNITION SYSTEM BASED ON MFCC AND DTW, International Journal of Engineering Science and Technology
- Sharma P.K.; Lakshmikantha B.R. and Sundar K.S. . Real time control of DC motor drive using speech recognition, Power Electronics (IICPE), 2010 India International Conference
- Aruna, C. ; Dhivya Parameswari, A. ; Malini, M. and Gopu G. . Voice recognition and touch screen control based wheel chair for paraplegic persons, Green Computing Communication and Electrical Engineering (ICGCCEE), 2014 International Conference
- Hamid Behravan. Dialect and Accent Recognition, University of Eastern Finland
- Ali Zulfiqar1; Aslam Muhammad and Martinez Enriquez A. M. A Speaker Identification System using MFCC Features with VQ Technique, 2009 Third International Symposium on Intelligent Information Technology Application
- Laura E. Boucheron and Phillip L. De Leon. On the Inversion of Mel-Frequency CepstralCoefficients for Speech Enhancement Applications, Klipsch School of Electrical and Computer Engineering New Mexico State University
- J. I. Godino-Llorente* and P. Gómez-Vilda. Automatic Detection of Voice Impairments by Means of Short-Term Cepstral Parameters and Neural Network Based DETECTORS, IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, VOL. 51, NO. 2, FEBRUARY 2004
- Francesco Beritelli and Andrea Spadaccini. HEART SOUNDS QUALITY ANALYSIS FOR AUTOMATIC CARDIAC BIOMETRY APPLICATIONS, University of Catania, Italy
- Davood MAHMOUD; Hossein Marvi; Mehdi Taghizadeh; Ali Soleimani; Farbod Razzazi and Marzieh Mahmoodi. Age Estimation Based on Speech Features and Support Vector Machine, 2011 3rd Computer Science and Electronic Engineering Conference
- Arijit Ghosal; Rudrasis Chakraborty; Bibhas Chandra Dhara and Sanjoy Kumar Saha. Music Classification based on MFCC Variants and Amplitude Variation Pattern: A Hierarchical Approach, International Journal of Signal Processing, Image Processing and Pattern Recognition Vol. 5, No. 1, March, 2012
- David Pye. Content-Based Methods for the Management of Digital Music, AT&T Laboratories Cambridge,Cambridge, UK
- Róisín Loughran; Jacqueline Walker; Michael O’Neill and Marion O’Farrell. The Use of Mel-frequency Cepstral Coefficients in Musical Instrument Identification, University of Limerick, Limerick, Ireland
- Rajesh M. Hegde and Hema A. Murthy. Automatic language Identification and discrimination using the modified group delay feature, Department of Computer Science and Engineering Indian Institute of Technology,Chennai, India
- Tripti Kapoor and R.K. Sharma. Parkinson’s disease Diagnosis using Mel-frequency Cepstral Coefficients and Vector Quantization, International Journal of Computer Applications (0975–8887) Volume14–No.3, January2011
Abstract Views: 314
PDF Views: 5