Open Access Open Access  Restricted Access Subscription Access
Open Access Open Access Open Access  Restricted Access Restricted Access Subscription Access

The Data Mining Approaches for Multi-Class Protein Fold Recognition


Affiliations
1 Department of Computer Science and Engineering, Rungta College of Engineering and Technology Bhilai, Chhattisgarh, India
     

   Subscribe/Renew Journal


Computation analysis of the biological data obtained in genome sequencing and other projects is essential for understanding cellular function and the discovery of new drug and therapies. Data mining become an important tool for researchers of various field including bioinformatics. Protein fold recognition is an important approach to structure discovery in bioinformatics. In this paper the protein fold recognition methods are studied. Supervised learning methods of data mining are carried out and tested for multi-class protein fold recognition. The accuracy is measured by various statistics parameters and the results are reported in this paper. In the result we found that Bayesian Network classifier works better compare as other methods in the cross validation test. The Bayesian Network and Multi Layer Perceptron are reasonably comparable in independent test data supply; accuracy of both methods relatively similar. It is also observed that one-versus-other and all-versus-all mechanisms improve the accuracy as individual parameters.


Keywords

Protein Structure Recognition, Bioinformatics, Data Mining and Supervised Learning.
User
Subscription Login to verify subscription
Notifications
Font Size

Abstract Views: 208

PDF Views: 3




  • The Data Mining Approaches for Multi-Class Protein Fold Recognition

Abstract Views: 208  |  PDF Views: 3

Authors

Lokesh K. Sharma
Department of Computer Science and Engineering, Rungta College of Engineering and Technology Bhilai, Chhattisgarh, India
Sourabh Rungta
Department of Computer Science and Engineering, Rungta College of Engineering and Technology Bhilai, Chhattisgarh, India

Abstract


Computation analysis of the biological data obtained in genome sequencing and other projects is essential for understanding cellular function and the discovery of new drug and therapies. Data mining become an important tool for researchers of various field including bioinformatics. Protein fold recognition is an important approach to structure discovery in bioinformatics. In this paper the protein fold recognition methods are studied. Supervised learning methods of data mining are carried out and tested for multi-class protein fold recognition. The accuracy is measured by various statistics parameters and the results are reported in this paper. In the result we found that Bayesian Network classifier works better compare as other methods in the cross validation test. The Bayesian Network and Multi Layer Perceptron are reasonably comparable in independent test data supply; accuracy of both methods relatively similar. It is also observed that one-versus-other and all-versus-all mechanisms improve the accuracy as individual parameters.


Keywords


Protein Structure Recognition, Bioinformatics, Data Mining and Supervised Learning.