Identification of Protein Coding Regions in the Eukaryotic DNA Sequences Based on Marple Algorithm and Wavelet Packets Transform

The identification of protein coding regions (exons) plays a critical role in eukaryotic gene structure prediction. Many techniques have been introduced for discriminating between the exons and the introns in the eukaryotic DNA sequences, such as the discrete Fourier transform (DFT) based techniques...

Full description

Bibliographic Details
Main Authors: Guangchen Liu, Yihui Luan
Format: Article
Language:English
Published: Hindawi Limited 2014-01-01
Series:Abstract and Applied Analysis
Online Access:http://dx.doi.org/10.1155/2014/402567
id doaj-8e6d9677b47a46e198ab0c1651b7d16b
record_format Article
spelling doaj-8e6d9677b47a46e198ab0c1651b7d16b2020-11-25T00:12:39ZengHindawi LimitedAbstract and Applied Analysis1085-33751687-04092014-01-01201410.1155/2014/402567402567Identification of Protein Coding Regions in the Eukaryotic DNA Sequences Based on Marple Algorithm and Wavelet Packets TransformGuangchen Liu0Yihui Luan1School of Mathematics, Shandong University, Jinan, Shandong 250100, ChinaSchool of Mathematics, Shandong University, Jinan, Shandong 250100, ChinaThe identification of protein coding regions (exons) plays a critical role in eukaryotic gene structure prediction. Many techniques have been introduced for discriminating between the exons and the introns in the eukaryotic DNA sequences, such as the discrete Fourier transform (DFT) based techniques, but these DFT-based methods rapidly lose their effectiveness in the case of short DNA sequences. In this paper, a novel integrated algorithm based on autoregressive spectrum analysis and wavelet packets transform is presented to improve the efficiency and accuracy of the coding regions identification. The experimental results show that the new algorithm outperforms the conventional DFT-based approaches in improving the prediction accuracy of protein coding regions distinctly by testing GENSCAN65, HMR195, and BG570 benchmark datasets.http://dx.doi.org/10.1155/2014/402567
collection DOAJ
language English
format Article
sources DOAJ
author Guangchen Liu
Yihui Luan
spellingShingle Guangchen Liu
Yihui Luan
Identification of Protein Coding Regions in the Eukaryotic DNA Sequences Based on Marple Algorithm and Wavelet Packets Transform
Abstract and Applied Analysis
author_facet Guangchen Liu
Yihui Luan
author_sort Guangchen Liu
title Identification of Protein Coding Regions in the Eukaryotic DNA Sequences Based on Marple Algorithm and Wavelet Packets Transform
title_short Identification of Protein Coding Regions in the Eukaryotic DNA Sequences Based on Marple Algorithm and Wavelet Packets Transform
title_full Identification of Protein Coding Regions in the Eukaryotic DNA Sequences Based on Marple Algorithm and Wavelet Packets Transform
title_fullStr Identification of Protein Coding Regions in the Eukaryotic DNA Sequences Based on Marple Algorithm and Wavelet Packets Transform
title_full_unstemmed Identification of Protein Coding Regions in the Eukaryotic DNA Sequences Based on Marple Algorithm and Wavelet Packets Transform
title_sort identification of protein coding regions in the eukaryotic dna sequences based on marple algorithm and wavelet packets transform
publisher Hindawi Limited
series Abstract and Applied Analysis
issn 1085-3375
1687-0409
publishDate 2014-01-01
description The identification of protein coding regions (exons) plays a critical role in eukaryotic gene structure prediction. Many techniques have been introduced for discriminating between the exons and the introns in the eukaryotic DNA sequences, such as the discrete Fourier transform (DFT) based techniques, but these DFT-based methods rapidly lose their effectiveness in the case of short DNA sequences. In this paper, a novel integrated algorithm based on autoregressive spectrum analysis and wavelet packets transform is presented to improve the efficiency and accuracy of the coding regions identification. The experimental results show that the new algorithm outperforms the conventional DFT-based approaches in improving the prediction accuracy of protein coding regions distinctly by testing GENSCAN65, HMR195, and BG570 benchmark datasets.
url http://dx.doi.org/10.1155/2014/402567
work_keys_str_mv AT guangchenliu identificationofproteincodingregionsintheeukaryoticdnasequencesbasedonmarplealgorithmandwaveletpacketstransform
AT yihuiluan identificationofproteincodingregionsintheeukaryoticdnasequencesbasedonmarplealgorithmandwaveletpacketstransform
_version_ 1725398316898844672