Global sequence features based translation initiation site prediction in human genomic sequences

Gene prediction has been increasingly important in genome annotation due to advancements in sequencing technology. Genome annotation further helps in determining the structure and function of these genes. Translation initiation site prediction (TIS) in human genomic sequences is one of the fundament...

Full description

Bibliographic Details
Main Authors: Neelam Goel, Shailendra Singh, Trilok Chand Aseri
Format: Article
Language:English
Published: Elsevier 2020-09-01
Series:Heliyon
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2405844020316686
id doaj-4e8a8083422b42f0b69972806d580898
record_format Article
spelling doaj-4e8a8083422b42f0b69972806d5808982020-11-25T01:19:18ZengElsevierHeliyon2405-84402020-09-0169e04825Global sequence features based translation initiation site prediction in human genomic sequencesNeelam Goel0Shailendra Singh1Trilok Chand Aseri2Department of Information Technology, University Institute of Engineering and Technology, Sector-25, Panjab University, Chandigarh 160014, India; Corresponding author.Department of Computer Science and Engineering, Punjab Engineering College (Deemed to be University), Sector-12, Chandigarh 160012, IndiaDepartment of Computer Science and Engineering, Punjab Engineering College (Deemed to be University), Sector-12, Chandigarh 160012, IndiaGene prediction has been increasingly important in genome annotation due to advancements in sequencing technology. Genome annotation further helps in determining the structure and function of these genes. Translation initiation site prediction (TIS) in human genomic sequences is one of the fundamental and essential steps in gene prediction. Thus, accurate prediction of TIS in these sequences is highly desirable. Although many computational methods were developed for this problem, none of them focused on finding these sites in human genomic sequences. In this paper, a new TIS prediction method is proposed by incorporating global sequence based features. Support vector machine is used to assess the prediction power of these features. The proposed method achieved accuracy of above 90% when tested for genomic as well as cDNA sequences. The experimental results indicate that the method works well for both genomic and cDNA sequences. The method can be integrated into gene prediction system in future.http://www.sciencedirect.com/science/article/pii/S2405844020316686Computer scienceGene predictioncDNAmRNAGenomic sequenceSupport vector machine
collection DOAJ
language English
format Article
sources DOAJ
author Neelam Goel
Shailendra Singh
Trilok Chand Aseri
spellingShingle Neelam Goel
Shailendra Singh
Trilok Chand Aseri
Global sequence features based translation initiation site prediction in human genomic sequences
Heliyon
Computer science
Gene prediction
cDNA
mRNA
Genomic sequence
Support vector machine
author_facet Neelam Goel
Shailendra Singh
Trilok Chand Aseri
author_sort Neelam Goel
title Global sequence features based translation initiation site prediction in human genomic sequences
title_short Global sequence features based translation initiation site prediction in human genomic sequences
title_full Global sequence features based translation initiation site prediction in human genomic sequences
title_fullStr Global sequence features based translation initiation site prediction in human genomic sequences
title_full_unstemmed Global sequence features based translation initiation site prediction in human genomic sequences
title_sort global sequence features based translation initiation site prediction in human genomic sequences
publisher Elsevier
series Heliyon
issn 2405-8440
publishDate 2020-09-01
description Gene prediction has been increasingly important in genome annotation due to advancements in sequencing technology. Genome annotation further helps in determining the structure and function of these genes. Translation initiation site prediction (TIS) in human genomic sequences is one of the fundamental and essential steps in gene prediction. Thus, accurate prediction of TIS in these sequences is highly desirable. Although many computational methods were developed for this problem, none of them focused on finding these sites in human genomic sequences. In this paper, a new TIS prediction method is proposed by incorporating global sequence based features. Support vector machine is used to assess the prediction power of these features. The proposed method achieved accuracy of above 90% when tested for genomic as well as cDNA sequences. The experimental results indicate that the method works well for both genomic and cDNA sequences. The method can be integrated into gene prediction system in future.
topic Computer science
Gene prediction
cDNA
mRNA
Genomic sequence
Support vector machine
url http://www.sciencedirect.com/science/article/pii/S2405844020316686
work_keys_str_mv AT neelamgoel globalsequencefeaturesbasedtranslationinitiationsitepredictioninhumangenomicsequences
AT shailendrasingh globalsequencefeaturesbasedtranslationinitiationsitepredictioninhumangenomicsequences
AT trilokchandaseri globalsequencefeaturesbasedtranslationinitiationsitepredictioninhumangenomicsequences
_version_ 1725139008106790912