Global sequence features based translation initiation site prediction in human genomic sequences
Gene prediction has been increasingly important in genome annotation due to advancements in sequencing technology. Genome annotation further helps in determining the structure and function of these genes. Translation initiation site prediction (TIS) in human genomic sequences is one of the fundament...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Elsevier
2020-09-01
|
Series: | Heliyon |
Subjects: | |
Online Access: | http://www.sciencedirect.com/science/article/pii/S2405844020316686 |
id |
doaj-4e8a8083422b42f0b69972806d580898 |
---|---|
record_format |
Article |
spelling |
doaj-4e8a8083422b42f0b69972806d5808982020-11-25T01:19:18ZengElsevierHeliyon2405-84402020-09-0169e04825Global sequence features based translation initiation site prediction in human genomic sequencesNeelam Goel0Shailendra Singh1Trilok Chand Aseri2Department of Information Technology, University Institute of Engineering and Technology, Sector-25, Panjab University, Chandigarh 160014, India; Corresponding author.Department of Computer Science and Engineering, Punjab Engineering College (Deemed to be University), Sector-12, Chandigarh 160012, IndiaDepartment of Computer Science and Engineering, Punjab Engineering College (Deemed to be University), Sector-12, Chandigarh 160012, IndiaGene prediction has been increasingly important in genome annotation due to advancements in sequencing technology. Genome annotation further helps in determining the structure and function of these genes. Translation initiation site prediction (TIS) in human genomic sequences is one of the fundamental and essential steps in gene prediction. Thus, accurate prediction of TIS in these sequences is highly desirable. Although many computational methods were developed for this problem, none of them focused on finding these sites in human genomic sequences. In this paper, a new TIS prediction method is proposed by incorporating global sequence based features. Support vector machine is used to assess the prediction power of these features. The proposed method achieved accuracy of above 90% when tested for genomic as well as cDNA sequences. The experimental results indicate that the method works well for both genomic and cDNA sequences. The method can be integrated into gene prediction system in future.http://www.sciencedirect.com/science/article/pii/S2405844020316686Computer scienceGene predictioncDNAmRNAGenomic sequenceSupport vector machine |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Neelam Goel Shailendra Singh Trilok Chand Aseri |
spellingShingle |
Neelam Goel Shailendra Singh Trilok Chand Aseri Global sequence features based translation initiation site prediction in human genomic sequences Heliyon Computer science Gene prediction cDNA mRNA Genomic sequence Support vector machine |
author_facet |
Neelam Goel Shailendra Singh Trilok Chand Aseri |
author_sort |
Neelam Goel |
title |
Global sequence features based translation initiation site prediction in human genomic sequences |
title_short |
Global sequence features based translation initiation site prediction in human genomic sequences |
title_full |
Global sequence features based translation initiation site prediction in human genomic sequences |
title_fullStr |
Global sequence features based translation initiation site prediction in human genomic sequences |
title_full_unstemmed |
Global sequence features based translation initiation site prediction in human genomic sequences |
title_sort |
global sequence features based translation initiation site prediction in human genomic sequences |
publisher |
Elsevier |
series |
Heliyon |
issn |
2405-8440 |
publishDate |
2020-09-01 |
description |
Gene prediction has been increasingly important in genome annotation due to advancements in sequencing technology. Genome annotation further helps in determining the structure and function of these genes. Translation initiation site prediction (TIS) in human genomic sequences is one of the fundamental and essential steps in gene prediction. Thus, accurate prediction of TIS in these sequences is highly desirable. Although many computational methods were developed for this problem, none of them focused on finding these sites in human genomic sequences. In this paper, a new TIS prediction method is proposed by incorporating global sequence based features. Support vector machine is used to assess the prediction power of these features. The proposed method achieved accuracy of above 90% when tested for genomic as well as cDNA sequences. The experimental results indicate that the method works well for both genomic and cDNA sequences. The method can be integrated into gene prediction system in future. |
topic |
Computer science Gene prediction cDNA mRNA Genomic sequence Support vector machine |
url |
http://www.sciencedirect.com/science/article/pii/S2405844020316686 |
work_keys_str_mv |
AT neelamgoel globalsequencefeaturesbasedtranslationinitiationsitepredictioninhumangenomicsequences AT shailendrasingh globalsequencefeaturesbasedtranslationinitiationsitepredictioninhumangenomicsequences AT trilokchandaseri globalsequencefeaturesbasedtranslationinitiationsitepredictioninhumangenomicsequences |
_version_ |
1725139008106790912 |