Open Access Open Access  Restricted Access Subscription Access

Phoneme Segmentation of Tamil Speech Signals Using Spectral Transition Measure


Affiliations
1 Department of Computer Science, D.J. Academy for Managerial Excellence, Coimbatore, 641 032, India
2 Department of Information Technology, Bharathiar University, India
 

Process of identifying the end points of the acoustic units of the speech signal is called speech segmentation.  Speech recognition systems can be designed using sub-word unit like phoneme. A Phoneme is the smallest unit of the language. It is context dependent and tedious to find the boundary.  Automated phoneme segmentation is carried in researches using Short term Energy, Convex hull, Formant, Spectral Transition Measure(STM), Group Delay Functions, Bayesian Information Criterion, etc.  In this research work, STM is used to find the phoneme boundary of Tamil speech utterances.  Tamil spoken word dataset was prepared with 30 words uttered by 4 native speakers with a high quality microphone. The performance of the segmentation is analysed and results are presented.

Keywords

Speech Recognition, Speech Segmentation, Spectral Transition Measure (STM), Phoneme Segmentation.
User
Notifications
Font Size

  • Gomez, Jon Ander, and María José Castro, Automatic segmentation of speech at the phonetic level. Joint IAPR International Workshops on Statistical Techniques in Pattern Recognition (SPR) and Structural and Syntactic Pattern Recognition (SSPR). Springer, Berlin Heidelberg, 2002.
  • R. Rabiner, and B. H. Juang, Fundamentals of Speech Recognition, (Prentice-Hall International, 1993).
  • Thangarajan, R.,Natarajan, A.M. and Selvam, M., Word and Triphone Based Approach in Continuous Speech Recognition for Tamil Language, WSEAS Transaction on Signal Procesing,ISSN:1790-5022,4(3) , pp 76-85, 2008.
  • Qiao, Yu, and Nobuaki Minematsu, Metric learning for unsupervised phoneme segmentation, INTERSPEECH, 2008.
  • Dusan, Sorin, and Lawrence R. Rabiner,On integrating insights from human speech perception into automatic speech recognition, INTERSPEEC, 2005.
  • Sharma, Manish, and Richard Mammone. “Blind” speech segmentation: automatic segmentation of speech without linguistic knowledge.” Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on. 2. IEEE, 1996.
  • Scharenborg, Odette, Vincent Wan, and Mirjam Ernestus, Unsupervised speech segmentation: An analysis of the hypothesized phone boundaries, The Journal of the Acoustical Society of America, 127(2): 1084-1095, (2010).
  • B. Zio³ko, S. Manandhar, and R. C. Wilson, Phoneme segmentation of speech, In ´ Proceedings of the 18th International Conference on Pattern Recognition (ICPR’06), 4, pages 282–285, 2006.
  • Zió³ko, Bartosz, et al, Phoneme segmentation based on wavelet spectra analysis, Archives of Acoustics, 36(1): 29-47, (2011).
  • Kuo, Jen-Wei, Hung-Yi Lo, and Hsin-Min Wang, Improved HMM/SVM methods for automatic phoneme segmentation, INTERSPEECH, 2007.
  • Sarma, Mousmita, and Kandarpa Kumar Sarma. “Segmentation and classification of vowel phonemes of assamese speech using a hybrid neural framework.” Applied Computational Intelligence and Soft Computing 2012: 28, (2012).
  • Almpanidis, George, and Constantine Kotropoulos, Phonemic segmentation using the generalised Gamma distribution and small sample Bayesian information criterion, Speech Communication 50(1), 38-55, (2008).
  • Qiao, Yu, Dean Luo, and Nobuaki Minematsu, A study on unsupervised phoneme segmentation and its application to automatic evaluation of shadowed utterances, Technical report, 2012.
  • Dusan, Sorin, and Lawrence R. Rabiner, On the relation between maximum spectral transition positions and phone boundaries, INTERSPEECH. 2006.

Abstract Views: 317

PDF Views: 0




  • Phoneme Segmentation of Tamil Speech Signals Using Spectral Transition Measure

Abstract Views: 317  |  PDF Views: 0

Authors

K. Geetha
Department of Computer Science, D.J. Academy for Managerial Excellence, Coimbatore, 641 032, India
R. Vadivel
Department of Information Technology, Bharathiar University, India

Abstract


Process of identifying the end points of the acoustic units of the speech signal is called speech segmentation.  Speech recognition systems can be designed using sub-word unit like phoneme. A Phoneme is the smallest unit of the language. It is context dependent and tedious to find the boundary.  Automated phoneme segmentation is carried in researches using Short term Energy, Convex hull, Formant, Spectral Transition Measure(STM), Group Delay Functions, Bayesian Information Criterion, etc.  In this research work, STM is used to find the phoneme boundary of Tamil speech utterances.  Tamil spoken word dataset was prepared with 30 words uttered by 4 native speakers with a high quality microphone. The performance of the segmentation is analysed and results are presented.

Keywords


Speech Recognition, Speech Segmentation, Spectral Transition Measure (STM), Phoneme Segmentation.

References





DOI: https://doi.org/10.13005/ojcst%2F10.01.15