Open Access Open Access  Restricted Access Subscription Access
Open Access Open Access Open Access  Restricted Access Restricted Access Subscription Access

Improving Performance of Multiclass Audio Classification Using SVM


Affiliations
1 Department of Electronics and Telecomm, College of Engineering, Pune, India
2 College of Engineering, Pune, India
3 Electronics and Telecommunication Department, India
     

   Subscribe/Renew Journal


Audio classification has found widespread use in many emerging applications. It involves extraction of vital temporal, spectral and statistical features, and using these in creating an efficient classifier. Most of the audio classification work has been done on binary class classification. In our work we suggest best suited features for classification of different audio classes. Here, we present an algorithm for audio classification that is capable of segmenting and classifying an audio stream into speech male, speech female, music, noise and silence. The speech clips are further segment into voiced and unvoiced frames. A number of timbre features have been discussed, which distinguish the different audio formats. For pre classification, Probability Density Function (PDF), which is a threshold-based method, is performed over each audio clip. For further classification, K-Nearest Neighbor (KNN) and Support Vector Machine (SVM) Classifiers are proposed. Experiments have been performed to determine the best features of each binary class. Utilization of these features in multiclass classification yielded accuracy 96.34% in audio discrimination.


Keywords

Audio Feature Extraction, Bayesian Classification, K-Nearest Neighbor, Support Vector Machine.
User
Subscription Login to verify subscription
Notifications
Font Size

Abstract Views: 237

PDF Views: 3




  • Improving Performance of Multiclass Audio Classification Using SVM

Abstract Views: 237  |  PDF Views: 3

Authors

Shrinivas P. Mahajan
Department of Electronics and Telecomm, College of Engineering, Pune, India
Jyotsana Sahu
College of Engineering, Pune, India
Mukul S. Sutaone
Electronics and Telecommunication Department, India
V. K. Kokate
College of Engineering, Pune, India

Abstract


Audio classification has found widespread use in many emerging applications. It involves extraction of vital temporal, spectral and statistical features, and using these in creating an efficient classifier. Most of the audio classification work has been done on binary class classification. In our work we suggest best suited features for classification of different audio classes. Here, we present an algorithm for audio classification that is capable of segmenting and classifying an audio stream into speech male, speech female, music, noise and silence. The speech clips are further segment into voiced and unvoiced frames. A number of timbre features have been discussed, which distinguish the different audio formats. For pre classification, Probability Density Function (PDF), which is a threshold-based method, is performed over each audio clip. For further classification, K-Nearest Neighbor (KNN) and Support Vector Machine (SVM) Classifiers are proposed. Experiments have been performed to determine the best features of each binary class. Utilization of these features in multiclass classification yielded accuracy 96.34% in audio discrimination.


Keywords


Audio Feature Extraction, Bayesian Classification, K-Nearest Neighbor, Support Vector Machine.