Identification of Protein Coding Regions in the Eukaryotic DNA Sequences Based on Marple Algorithm and Wavelet Packets Transform
The identification of protein coding regions (exons) plays a critical role in eukaryotic gene structure prediction. Many techniques have been introduced for discriminating between the exons and the introns in the eukaryotic DNA sequences, such as the discrete Fourier transform (DFT) based techniques...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Hindawi Limited
2014-01-01
|
Series: | Abstract and Applied Analysis |
Online Access: | http://dx.doi.org/10.1155/2014/402567 |
id |
doaj-8e6d9677b47a46e198ab0c1651b7d16b |
---|---|
record_format |
Article |
spelling |
doaj-8e6d9677b47a46e198ab0c1651b7d16b2020-11-25T00:12:39ZengHindawi LimitedAbstract and Applied Analysis1085-33751687-04092014-01-01201410.1155/2014/402567402567Identification of Protein Coding Regions in the Eukaryotic DNA Sequences Based on Marple Algorithm and Wavelet Packets TransformGuangchen Liu0Yihui Luan1School of Mathematics, Shandong University, Jinan, Shandong 250100, ChinaSchool of Mathematics, Shandong University, Jinan, Shandong 250100, ChinaThe identification of protein coding regions (exons) plays a critical role in eukaryotic gene structure prediction. Many techniques have been introduced for discriminating between the exons and the introns in the eukaryotic DNA sequences, such as the discrete Fourier transform (DFT) based techniques, but these DFT-based methods rapidly lose their effectiveness in the case of short DNA sequences. In this paper, a novel integrated algorithm based on autoregressive spectrum analysis and wavelet packets transform is presented to improve the efficiency and accuracy of the coding regions identification. The experimental results show that the new algorithm outperforms the conventional DFT-based approaches in improving the prediction accuracy of protein coding regions distinctly by testing GENSCAN65, HMR195, and BG570 benchmark datasets.http://dx.doi.org/10.1155/2014/402567 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Guangchen Liu Yihui Luan |
spellingShingle |
Guangchen Liu Yihui Luan Identification of Protein Coding Regions in the Eukaryotic DNA Sequences Based on Marple Algorithm and Wavelet Packets Transform Abstract and Applied Analysis |
author_facet |
Guangchen Liu Yihui Luan |
author_sort |
Guangchen Liu |
title |
Identification of Protein Coding Regions in the Eukaryotic DNA Sequences Based on Marple Algorithm and Wavelet Packets Transform |
title_short |
Identification of Protein Coding Regions in the Eukaryotic DNA Sequences Based on Marple Algorithm and Wavelet Packets Transform |
title_full |
Identification of Protein Coding Regions in the Eukaryotic DNA Sequences Based on Marple Algorithm and Wavelet Packets Transform |
title_fullStr |
Identification of Protein Coding Regions in the Eukaryotic DNA Sequences Based on Marple Algorithm and Wavelet Packets Transform |
title_full_unstemmed |
Identification of Protein Coding Regions in the Eukaryotic DNA Sequences Based on Marple Algorithm and Wavelet Packets Transform |
title_sort |
identification of protein coding regions in the eukaryotic dna sequences based on marple algorithm and wavelet packets transform |
publisher |
Hindawi Limited |
series |
Abstract and Applied Analysis |
issn |
1085-3375 1687-0409 |
publishDate |
2014-01-01 |
description |
The identification of protein coding regions (exons) plays a critical role in eukaryotic gene structure prediction. Many techniques have been introduced for discriminating between the exons and the introns in the eukaryotic DNA sequences, such as the discrete Fourier transform (DFT) based techniques, but these DFT-based methods rapidly lose their effectiveness in the case of short DNA sequences. In this paper, a novel integrated algorithm based on autoregressive spectrum analysis and wavelet packets transform is presented to improve the efficiency and accuracy of the coding regions identification. The experimental results show that the new algorithm outperforms the conventional DFT-based approaches in improving the prediction accuracy of protein coding regions distinctly by testing GENSCAN65, HMR195, and BG570 benchmark datasets. |
url |
http://dx.doi.org/10.1155/2014/402567 |
work_keys_str_mv |
AT guangchenliu identificationofproteincodingregionsintheeukaryoticdnasequencesbasedonmarplealgorithmandwaveletpacketstransform AT yihuiluan identificationofproteincodingregionsintheeukaryoticdnasequencesbasedonmarplealgorithmandwaveletpacketstransform |
_version_ |
1725398316898844672 |