Open Access Open Access  Restricted Access Subscription Access

Diphone Speech Synthesis System for Arabic Using MARY TTS


Affiliations
1 Department of Computer Science, Mansoura University, Egypt
2 Department of Information Systems, Mansoura University, Egypt
 

Concatenative speech synthesis systems generate speech by concatenating small prerecorded speech units which are stored in the speech unit inventory. The most commonly used type of these units is the diphone which is a unit that starts at the middle of one phone and extends to the middle of the following one. Diphones have the advantage of modeling coarticulation by including the transition to the next phone inside the diphone itself. In this paper, a diphone speech synthesis system for the Arabic language using MARY TTS has been developed and evaluated by two types of tests which are the Diagnostic Rhyme Test (DRT) that measures the intelligibility of the synthesized speech and the Categorical Estimation (CE) test that measures the overall quality of the synthesized speech. The results of these tests are illustrated in the experiments and results section.

Keywords

Speech Synthesis, Concatenative Synthesis, Diphone Inventory, Natural Language Processing, Markup Language, Digital Signal Processing.
User
Notifications
Font Size

Abstract Views: 228

PDF Views: 124




  • Diphone Speech Synthesis System for Arabic Using MARY TTS

Abstract Views: 228  |  PDF Views: 124

Authors

M. Z. Rashad
Department of Computer Science, Mansoura University, Egypt
Hazem M. El-Bakry
Department of Information Systems, Mansoura University, Egypt
Islam R. Isma'il
Department of Information Systems, Mansoura University, Egypt

Abstract


Concatenative speech synthesis systems generate speech by concatenating small prerecorded speech units which are stored in the speech unit inventory. The most commonly used type of these units is the diphone which is a unit that starts at the middle of one phone and extends to the middle of the following one. Diphones have the advantage of modeling coarticulation by including the transition to the next phone inside the diphone itself. In this paper, a diphone speech synthesis system for the Arabic language using MARY TTS has been developed and evaluated by two types of tests which are the Diagnostic Rhyme Test (DRT) that measures the intelligibility of the synthesized speech and the Categorical Estimation (CE) test that measures the overall quality of the synthesized speech. The results of these tests are illustrated in the experiments and results section.

Keywords


Speech Synthesis, Concatenative Synthesis, Diphone Inventory, Natural Language Processing, Markup Language, Digital Signal Processing.