Protein coding identification using modified gabor wavelet transform on multicore system
The gene identification problem, which identifies the protein-coding regions (exons) in DNA sequences through computational means, is of great importance nowadays. A DNA sequence can be divided into genes and intergenic spaces. In eukaryotic genes, these regions can be divided into two sub-regions c...
Main Author: | |
---|---|
Format: | Book Chapter |
Language: | English |
Published: |
IIUM Press
2011
|
Subjects: | |
Online Access: | http://irep.iium.edu.my/21667/ http://irep.iium.edu.my/21667/ http://irep.iium.edu.my/21667/1/Chapter_34.pdf |
Summary: | The gene identification problem, which identifies the protein-coding regions (exons) in DNA sequences through computational means, is of great importance nowadays. A DNA sequence can be divided into genes and intergenic spaces. In eukaryotic genes, these regions can be divided into two sub-regions called coding regions (exons) and non-coding regions (introns). The intergenic and intronic regions make up most of the genome. For example, in the human genome, the exonic fraction is as low as 2%. It is well known that protein-coding regions of DNA sequences tend to exhibit a period-3 pattern because of the codon structure involved in
the translation of base sequences into amino acids [1-4]. Many researchers have regarded the period-3 property to be a good indicator ofgene location. |
---|