Open Access Open Access  Restricted Access Subscription Access
Open Access Open Access Open Access  Restricted Access Restricted Access Subscription Access

Support Vector Machine Approach for Isomerases Prediction Problem


Affiliations
1 Maulana Azad National Institute of Technology, Bhopal, MP 462051, India
     

   Subscribe/Renew Journal


As the proteinic enzyme sequences are entering the databases at a prodigious rate, the functional annotation of these sequences has become a major challenge in the field of Bioinformatics. The dispersion in the data makes this task even tougher. The authors illustrate in this paper a simple yet efficient way for functionally characterizing a novel enzyme by the application of support vector machines. The best accuracy gained by this method on generalization test is 91.55% with Mathew's Correlation Coefficient (MCC) of 0.63. The method was further validated by three different types of testing. The resulting accuracy for the LOO estimate was found to be 91.05% with MCC of 0.62 henceforth resolving any over fitting of data that may be present in the instance sets.

Keywords

Isomerases, Support Vector Machine (SVM), Leave-One-Out Estimates, Amino Acid Composition.
User
Subscription Login to verify subscription
Notifications
Font Size

Abstract Views: 204

PDF Views: 4




  • Support Vector Machine Approach for Isomerases Prediction Problem

Abstract Views: 204  |  PDF Views: 4

Authors

Lavanya Rishishwar
Maulana Azad National Institute of Technology, Bhopal, MP 462051, India
Neha Mishra
Maulana Azad National Institute of Technology, Bhopal, MP 462051, India
Bhasker Pant
Maulana Azad National Institute of Technology, Bhopal, MP 462051, India
Kumud Pant
Maulana Azad National Institute of Technology, Bhopal, MP 462051, India
Kamal R. Pardasani
Maulana Azad National Institute of Technology, Bhopal, MP 462051, India

Abstract


As the proteinic enzyme sequences are entering the databases at a prodigious rate, the functional annotation of these sequences has become a major challenge in the field of Bioinformatics. The dispersion in the data makes this task even tougher. The authors illustrate in this paper a simple yet efficient way for functionally characterizing a novel enzyme by the application of support vector machines. The best accuracy gained by this method on generalization test is 91.55% with Mathew's Correlation Coefficient (MCC) of 0.63. The method was further validated by three different types of testing. The resulting accuracy for the LOO estimate was found to be 91.05% with MCC of 0.62 henceforth resolving any over fitting of data that may be present in the instance sets.

Keywords


Isomerases, Support Vector Machine (SVM), Leave-One-Out Estimates, Amino Acid Composition.