Open Access Open Access  Restricted Access Subscription Access

Design of a New Deterministic Algorithm for Finding Common DNA Subsequence


Affiliations
1 Department of Computer Science and Engineering, University Calcutta, India
 

Computational methods have become especially important since the advent of genome projects, whose objective is to decode the entire DNA sequence. Sequence motifs are short, recurring patterns in DNA that are presumed to have a biological function. These motifs are often responsible for similarity or dissimilarity in biological features and their DNA patterns. In this paper, we start with a database containing the set of DNA patterns. Our aim is to search motifs that occur in the same order where as other characters or gaps might occur between the motifs. We generalize it as a common sub-sequence problem from the computational aspect. The exponential nature of the problem is ischolar_mained in its definition in the sense that the solution set itself is expected to be exponential in size. In this work, a new deterministic subsequence matching algorithm for a set of DNA strings has been proposed from the computational perspective. The proposed deterministic algorithm yields the exhaustive set of common sub-sequences that many of its commercial counterparts cannot guarantee.

Keywords

Motif Identification, Sub Sequence, Genome, Dynamic Programming, NP Hard Problem.
User
Notifications
Font Size

Abstract Views: 387

PDF Views: 166




  • Design of a New Deterministic Algorithm for Finding Common DNA Subsequence

Abstract Views: 387  |  PDF Views: 166

Authors

Bigyan Bhar
Department of Computer Science and Engineering, University Calcutta, India
Nabendu Chaki
Department of Computer Science and Engineering, University Calcutta, India

Abstract


Computational methods have become especially important since the advent of genome projects, whose objective is to decode the entire DNA sequence. Sequence motifs are short, recurring patterns in DNA that are presumed to have a biological function. These motifs are often responsible for similarity or dissimilarity in biological features and their DNA patterns. In this paper, we start with a database containing the set of DNA patterns. Our aim is to search motifs that occur in the same order where as other characters or gaps might occur between the motifs. We generalize it as a common sub-sequence problem from the computational aspect. The exponential nature of the problem is ischolar_mained in its definition in the sense that the solution set itself is expected to be exponential in size. In this work, a new deterministic subsequence matching algorithm for a set of DNA strings has been proposed from the computational perspective. The proposed deterministic algorithm yields the exhaustive set of common sub-sequences that many of its commercial counterparts cannot guarantee.

Keywords


Motif Identification, Sub Sequence, Genome, Dynamic Programming, NP Hard Problem.