Estimating Social Background Profiling of Indian Speakers by Acoustic Speech Features

Mohammad Ali Humayun; Hayati Yassin; Pg Emeroylariffion Abas

Estimating Social Background Profiling of Indian Speakers by Acoustic Speech Features

Mohammad Ali Humayun , Hayati Yassin , Pg Emeroylariffion Abas

Affiliations
1 Faculty of Integrated Technologies, Universiti Brunei Darussalam, Jalan Tungku Link, Brunei Darussalam

Social background profiling of speakers refers to estimating the geographical origin of speakers by their speech features. Methods for accent profiling that use linguistic features, require phoneme alignment and transcription of the speech samples. This paper proposes a purely acoustic accent profiling model, composed of multiple convolutional networks with global average-pooling layers, to classify the temporal sequence of acoustic features. The bottleneck representations of the convolutional networks, trained with the original signals and their low-pass filtered copies, are fed to a Support Vector Machine classifier for final prediction. The model has been analysed for a speech dataset of Indian speakers from social backgrounds spread across India. It has been shown that up to 85% accuracy is achievable for classifying the geographic origin of speakers corresponding to regional Indian languages; 17% higher than the benchmark deep learning model using the same features. Results have also indicated that classification of accents is easier using the second language of the speakers, as compared to their native language.

Keywords

Accent identification, Low pass filtering, Ensemble learning, Native language identification, Speaker profiling.

I-Scholar

Journal Help

User

Notifications

Journal Content
Browse

Font Size

Information

Estimating Social Background Profiling of Indian Speakers by Acoustic Speech Features

Abstract Views: 488 | PDF Views: 76

Authors

Mohammad Ali Humayun
Faculty of Integrated Technologies, Universiti Brunei Darussalam, Jalan Tungku Link, Brunei Darussalam

Hayati Yassin
Faculty of Integrated Technologies, Universiti Brunei Darussalam, Jalan Tungku Link, Brunei Darussalam

Pg Emeroylariffion Abas
Faculty of Integrated Technologies, Universiti Brunei Darussalam, Jalan Tungku Link, Brunei Darussalam

Abstract

Keywords

Accent identification, Low pass filtering, Ensemble learning, Native language identification, Speaker profiling.

Journal of Scientific and Industrial Research

Estimating Social Background Profiling of Indian Speakers by Acoustic Speech Features

Keywords

Estimating Social Background Profiling of Indian Speakers by Acoustic Speech Features

Authors

Abstract

Keywords

References

Username
Password
Remember me

Username
Password
Remember me