Open Access Open Access  Restricted Access Subscription Access
Open Access Open Access Open Access  Restricted Access Restricted Access Subscription Access

An Enhanced Method for Period-3 Based Exon and Gene Prediction


Affiliations
1 H.B.T.I., Kanpur, India
2 PEC, Chandigarh, India
     

   Subscribe/Renew Journal


Identification of gene locations in a DNA sequence is one of the important problems in the area of genomics. Nucleotides in exons of a DNA sequence show f=1/3 periodicity. The period-3 property in exons of eukaryotic gene sequences enables signal processing based time-domain and frequency domain methods to predict these regions. Identification of the period-3 regions helps in predicting the gene locations within the billions long DNA sequence of eukaryotic cells. In this paper the DNA symbolic-to-numeric representations are presented and the existing methods of gene prediction are also discussed. Finally, an enhancement over the existing methods has been proposed that combines the features of two best existing computationally efficient methods namely, AMDF (Average Magnitude difference function) and the optimized method. The proposed method improves upon the existing methods in terms of gene prediction accuracy.

Keywords

AMDF, Binary Indicator Sequence, Complex Indicator Sequence, DFT, DNA, EIIP Indicator Sequence, Gene Prediction, Paired Numeric, Period-3, Protein Coding, Roc.
User
Subscription Login to verify subscription
Notifications
Font Size

Abstract Views: 232

PDF Views: 2




  • An Enhanced Method for Period-3 Based Exon and Gene Prediction

Abstract Views: 232  |  PDF Views: 2

Authors

Anshu Vishnoi
H.B.T.I., Kanpur, India
Neelam Rup Prakash
PEC, Chandigarh, India

Abstract


Identification of gene locations in a DNA sequence is one of the important problems in the area of genomics. Nucleotides in exons of a DNA sequence show f=1/3 periodicity. The period-3 property in exons of eukaryotic gene sequences enables signal processing based time-domain and frequency domain methods to predict these regions. Identification of the period-3 regions helps in predicting the gene locations within the billions long DNA sequence of eukaryotic cells. In this paper the DNA symbolic-to-numeric representations are presented and the existing methods of gene prediction are also discussed. Finally, an enhancement over the existing methods has been proposed that combines the features of two best existing computationally efficient methods namely, AMDF (Average Magnitude difference function) and the optimized method. The proposed method improves upon the existing methods in terms of gene prediction accuracy.

Keywords


AMDF, Binary Indicator Sequence, Complex Indicator Sequence, DFT, DNA, EIIP Indicator Sequence, Gene Prediction, Paired Numeric, Period-3, Protein Coding, Roc.