Open Access Open Access  Restricted Access Subscription Access
Open Access Open Access Open Access  Restricted Access Restricted Access Subscription Access

Real-Time Speech Emotion Recognition Using Support Vector Machine


Affiliations
1 B.S. Abdur Rahman University, Chennai, Tamil Nadu, India
     

   Subscribe/Renew Journal


In this paper we present an approach for Real-time emotion recognition from speech using Support Vector Machine (SVM) as a classification technique. Automatic Speech Emotion Recognition (ASER) is an upcoming research area in the field of Human Computer Interaction Intelligence (HCII). Human emotions can be detected from their speech signals by extracting some of the speech acoustic and prosodic features like pitch, Mel frequency Cepstral Coefficient (MFCC)and Mel Energy Spectrum Dynamic Coefficient (MEDC). Here SVM classifier is used to classify the emotions as anger, fear, neutral, sad, disgust, happy and boredom. UGA and LDC datasets are used for offline analysis of emotions using LIBSVM kernel functions.With this analysis the machine is trained and designed for detecting emotions in real time speech.

Keywords

Support Vector Machine, Speech Signal, Experimentation, Emotion Analysis, Controller (PDC).
Subscription Login to verify subscription
User
Notifications
Font Size


  • Ayadi, M. E., Kamel, M. S. & Karray, F (2011).Survey on speech emotion recognition: Features, classification schemes, and databases. Pattern Recognition, 44(3),572-587.
  • Emotional Prosody Speech and Transcripts from the Linguistic Data Consortium. (2002). Retrieved from http://www.ldc.upenn.edu/Catalog/catalogEntry. jsp?catalogId = LDC2002S28.
  • Hsu, C.W., Chang, C. C. & Lin, C. J. (2010). A Practical Guide to Support Vector Classification. Department of Computer Science & Information Engineering, National Taiwan University, Taiwan.
  • Khalifa, O., Khan, S., Islam, M. R., Faizal, M. & Dol, D. (2004). Text Independent Automatic Speaker
  • Recognition. 3rd International Conference on Electrical& Computer Engineering, Dhaka, Bangladesh.
  • Kulkarni, P. N. & Gadhe, D. L. (2011). Comparison between SVM & Other Classifiers for SER. International Journal of Research and Technology, January, 2(1), 1-6.
  • Koolagudi, S. G. & Rao, K. S (2010). Real Life Emotion Classification using VOP and Pitch Based Spectral Features. India: Jadavpur University.
  • Lin, Y. L. & Wei, G. (2005). Speech Emotion Recognition based on HMM and SVM. Paper Presented on 2005 at Fourth International Conference on Machine Learning.
  • Ma,J., Huang, D. & Li, F. (2005). SVM based recognition of chinese vowels. Artificial Intelligence, 3802, 812-819.
  • Onen, M. G. & Alpaydin, E. (2011).Multiple kernel learning algorithms. Journal of Machine Learning Research, July, 12, 2211-2268.
  • Pao, T., Chen, Y., Yeh, J. & Li, P. (2006). Mandarin Emotional Speech Recognition based on SVM and NN. Paper presented on 2006 at 18th International Conference on Pattern Recognition (ICPR'06), (1, pp. 1096-1100).
  • Ververidis, D. & Kotropoulos, C. (2006). A State of the Art Review on Emotional Speech Databases. Presented at 11th Australian International Conference on Speech Science and Technology, Auckland, New Zealand.

Abstract Views: 462

PDF Views: 2




  • Real-Time Speech Emotion Recognition Using Support Vector Machine

Abstract Views: 462  |  PDF Views: 2

Authors

P. Vijayalakshmi
B.S. Abdur Rahman University, Chennai, Tamil Nadu, India
A. Anny Leema
B.S. Abdur Rahman University, Chennai, Tamil Nadu, India

Abstract


In this paper we present an approach for Real-time emotion recognition from speech using Support Vector Machine (SVM) as a classification technique. Automatic Speech Emotion Recognition (ASER) is an upcoming research area in the field of Human Computer Interaction Intelligence (HCII). Human emotions can be detected from their speech signals by extracting some of the speech acoustic and prosodic features like pitch, Mel frequency Cepstral Coefficient (MFCC)and Mel Energy Spectrum Dynamic Coefficient (MEDC). Here SVM classifier is used to classify the emotions as anger, fear, neutral, sad, disgust, happy and boredom. UGA and LDC datasets are used for offline analysis of emotions using LIBSVM kernel functions.With this analysis the machine is trained and designed for detecting emotions in real time speech.

Keywords


Support Vector Machine, Speech Signal, Experimentation, Emotion Analysis, Controller (PDC).

References