Open Access Open Access  Restricted Access Subscription Access

Text Dependent Speaker Identification Using a Bayesian network and Mel Frequency Cepstrum Coefficient


Affiliations
1 Dept. of CSE, BGC Trust University, Bangladesh
 

Speaker identification is a biometric technique. The objective of automatic speaker recognition is to extract, characterize and recognize the information about speaker identity. Speaker Recognition technology has recently been used in large number of commercial areas successfully such as in voice based biometrics; voice controlled appliances, security control for confidential information, remote access to computers and many more interesting areas. A speaker identification system has two phases which are the training phase and the testing phase. Feature extraction is the first step for each phase in speaker recognition. Many algorithms are suggested by the researchers for feature extraction. In this work, the Mel Frequency Cepstrum Coefficient (MFCC) feature has been used for designing a text dependent speaker identification system. While, in the identification phase, the existing reference templates are compared with the unknown voice input. In this thesis, a Bayesian network is used as the training/recognition algorithm which makes the final decision about the specification of the speaker by comparing unknown features to all models in the database and selecting the best matching model. i, e. the highest scored model. The speaker who obtains the highest score is selected as the target speaker.

Keywords

Mel Frequency Cepstrum Coefficient (MFCC), Bayesian Network(BN), Speaker Identification (SI). Graphical Models (GMs), Directed a Cyclic Graph(DAG), Joint Probability Distribution (JPD), Discrete Fourier Transform(DFT).
User
Notifications
Font Size

Abstract Views: 122

PDF Views: 0




  • Text Dependent Speaker Identification Using a Bayesian network and Mel Frequency Cepstrum Coefficient

Abstract Views: 122  |  PDF Views: 0

Authors

Mohd. Manjur Alam
Dept. of CSE, BGC Trust University, Bangladesh
Md. Salah Uddin Chowdury
Dept. of CSE, BGC Trust University, Bangladesh
Niaz Uddin Mahmud
Dept. of CSE, BGC Trust University, Bangladesh
Shamsun Nahar Shoma
Dept. of CSE, BGC Trust University, Bangladesh
Md. Abdul Wahab
Dept. of CSE, BGC Trust University, Bangladesh

Abstract


Speaker identification is a biometric technique. The objective of automatic speaker recognition is to extract, characterize and recognize the information about speaker identity. Speaker Recognition technology has recently been used in large number of commercial areas successfully such as in voice based biometrics; voice controlled appliances, security control for confidential information, remote access to computers and many more interesting areas. A speaker identification system has two phases which are the training phase and the testing phase. Feature extraction is the first step for each phase in speaker recognition. Many algorithms are suggested by the researchers for feature extraction. In this work, the Mel Frequency Cepstrum Coefficient (MFCC) feature has been used for designing a text dependent speaker identification system. While, in the identification phase, the existing reference templates are compared with the unknown voice input. In this thesis, a Bayesian network is used as the training/recognition algorithm which makes the final decision about the specification of the speaker by comparing unknown features to all models in the database and selecting the best matching model. i, e. the highest scored model. The speaker who obtains the highest score is selected as the target speaker.

Keywords


Mel Frequency Cepstrum Coefficient (MFCC), Bayesian Network(BN), Speaker Identification (SI). Graphical Models (GMs), Directed a Cyclic Graph(DAG), Joint Probability Distribution (JPD), Discrete Fourier Transform(DFT).