Open Access Open Access  Restricted Access Subscription Access
Open Access Open Access Open Access  Restricted Access Restricted Access Subscription Access

Speech Recognition in Noisy Conditions Using Radon Transform and Discrete Cosine Transform from the Features Derived from Gammatone Filter Bank (GTFB)


Affiliations
1 Department of Instrumentation Engineering, Padmashree Dr. D.Y. Patil Institute of Engineering and Technology, Pimpri, Pune-18, Maharashtra State (M.S.), India
2 S.G.G.S. Institute of Engineering and Technolgy, Nanded-431607, Maharashtra State (M.S.), India
3 Department of Instrumentation Engineering, S.G.G.S. Institute of Engineering and Technology, Nanded-431607, Maharashtra State (M.S.), India
     

   Subscribe/Renew Journal


This paper presents a new feature extraction technique based on a Gammatone Filter Bank (GTFB) for speech recognition using Radon Transform (RT) and Discrete Cosine Transform (DCT). In the proposed scheme speech specific features have been extracted by applying image processing technique to the patterns available from speech signal by applying Gammatone Filter Bank. Radon projections for twenty six orientations are captured. The acoustic characteristics of the Gammatone Filter Bank applied to the speech signal. DCT applied on Radon projections yields low dimensional feature vectors. The technique is computationally efficient and robust to session variations and insensitive to additive noise. The performance of the proposed algorithm is evaluated in presence of additive white Gaussian noise from (30dB to -5dB SNR) on Texas Instruments-46 (TI-46) speech database. The proposed algorithm improves the performance of the speech recognition system in noisy environment compared to the existing popular algorithms like Mel frequency Cepstral Coefficient (MFCC), Linear Predictive Cepstral Coefficients (LPCC), Perceptual Linear Prediction (PLP).

Keywords

Speech Recognition, Gammatone Filters, Feature Extraction, Radon Transform, Discrete Cosine Transform.
User
Subscription Login to verify subscription
Notifications
Font Size

Abstract Views: 223

PDF Views: 2




  • Speech Recognition in Noisy Conditions Using Radon Transform and Discrete Cosine Transform from the Features Derived from Gammatone Filter Bank (GTFB)

Abstract Views: 223  |  PDF Views: 2

Authors

Yogesh S. Angal
Department of Instrumentation Engineering, Padmashree Dr. D.Y. Patil Institute of Engineering and Technology, Pimpri, Pune-18, Maharashtra State (M.S.), India
Mangesh S. Deshpande
S.G.G.S. Institute of Engineering and Technolgy, Nanded-431607, Maharashtra State (M.S.), India
Raghunath S. Holambe
Department of Instrumentation Engineering, S.G.G.S. Institute of Engineering and Technology, Nanded-431607, Maharashtra State (M.S.), India
Rajan H. Chile
Department of Instrumentation Engineering, S.G.G.S. Institute of Engineering and Technology, Nanded-431607, Maharashtra State (M.S.), India

Abstract


This paper presents a new feature extraction technique based on a Gammatone Filter Bank (GTFB) for speech recognition using Radon Transform (RT) and Discrete Cosine Transform (DCT). In the proposed scheme speech specific features have been extracted by applying image processing technique to the patterns available from speech signal by applying Gammatone Filter Bank. Radon projections for twenty six orientations are captured. The acoustic characteristics of the Gammatone Filter Bank applied to the speech signal. DCT applied on Radon projections yields low dimensional feature vectors. The technique is computationally efficient and robust to session variations and insensitive to additive noise. The performance of the proposed algorithm is evaluated in presence of additive white Gaussian noise from (30dB to -5dB SNR) on Texas Instruments-46 (TI-46) speech database. The proposed algorithm improves the performance of the speech recognition system in noisy environment compared to the existing popular algorithms like Mel frequency Cepstral Coefficient (MFCC), Linear Predictive Cepstral Coefficients (LPCC), Perceptual Linear Prediction (PLP).

Keywords


Speech Recognition, Gammatone Filters, Feature Extraction, Radon Transform, Discrete Cosine Transform.