TESTLoc: protein subcellular localization prediction from EST data

<p>Abstract</p> <p>Background</p> <p>The eukaryotic cell has an intricate architecture with compartments and substructures dedicated to particular biological processes. Knowing the subcellular location of proteins not only indicates how bio-processes are organized in di...

Full description

Bibliographic Details
Main Authors: Burger Gertraud, Shen Yao-Qing
Format: Article
Language:English
Published: BMC 2010-11-01
Series:BMC Bioinformatics
Online Access:http://www.biomedcentral.com/1471-2105/11/563
id doaj-a90e5ec328cb426b8f1438153eab3dcf
record_format Article
spelling doaj-a90e5ec328cb426b8f1438153eab3dcf2020-11-25T01:56:40ZengBMCBMC Bioinformatics1471-21052010-11-0111156310.1186/1471-2105-11-563TESTLoc: protein subcellular localization prediction from EST dataBurger GertraudShen Yao-Qing<p>Abstract</p> <p>Background</p> <p>The eukaryotic cell has an intricate architecture with compartments and substructures dedicated to particular biological processes. Knowing the subcellular location of proteins not only indicates how bio-processes are organized in different cellular compartments, but also contributes to unravelling the function of individual proteins. Computational localization prediction is possible based on sequence information alone, and has been successfully applied to proteins from virtually all subcellular compartments and all domains of life. However, we realized that current prediction tools do not perform well on partial protein sequences such as those inferred from Expressed Sequence Tag (EST) data, limiting the exploitation of the large and taxonomically most comprehensive body of sequence information from eukaryotes.</p> <p>Results</p> <p>We developed a new predictor, TESTLoc, suited for subcellular localization prediction of proteins based on their partial sequence conceptually translated from ESTs (EST-peptides). Support Vector Machine (SVM) is used as computational method and EST-peptides are represented by different features such as amino acid composition and physicochemical properties. When TESTLoc was applied to the most challenging test case (plant data), it yielded high accuracy (~85%).</p> <p>Conclusions</p> <p>TESTLoc is a localization prediction tool tailored for EST data. It provides a variety of models for the users to choose from, and is available for download at <url>http://megasun.bch.umontreal.ca/~shenyq/TESTLoc/TESTLoc.html</url></p> http://www.biomedcentral.com/1471-2105/11/563
collection DOAJ
language English
format Article
sources DOAJ
author Burger Gertraud
Shen Yao-Qing
spellingShingle Burger Gertraud
Shen Yao-Qing
TESTLoc: protein subcellular localization prediction from EST data
BMC Bioinformatics
author_facet Burger Gertraud
Shen Yao-Qing
author_sort Burger Gertraud
title TESTLoc: protein subcellular localization prediction from EST data
title_short TESTLoc: protein subcellular localization prediction from EST data
title_full TESTLoc: protein subcellular localization prediction from EST data
title_fullStr TESTLoc: protein subcellular localization prediction from EST data
title_full_unstemmed TESTLoc: protein subcellular localization prediction from EST data
title_sort testloc: protein subcellular localization prediction from est data
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2010-11-01
description <p>Abstract</p> <p>Background</p> <p>The eukaryotic cell has an intricate architecture with compartments and substructures dedicated to particular biological processes. Knowing the subcellular location of proteins not only indicates how bio-processes are organized in different cellular compartments, but also contributes to unravelling the function of individual proteins. Computational localization prediction is possible based on sequence information alone, and has been successfully applied to proteins from virtually all subcellular compartments and all domains of life. However, we realized that current prediction tools do not perform well on partial protein sequences such as those inferred from Expressed Sequence Tag (EST) data, limiting the exploitation of the large and taxonomically most comprehensive body of sequence information from eukaryotes.</p> <p>Results</p> <p>We developed a new predictor, TESTLoc, suited for subcellular localization prediction of proteins based on their partial sequence conceptually translated from ESTs (EST-peptides). Support Vector Machine (SVM) is used as computational method and EST-peptides are represented by different features such as amino acid composition and physicochemical properties. When TESTLoc was applied to the most challenging test case (plant data), it yielded high accuracy (~85%).</p> <p>Conclusions</p> <p>TESTLoc is a localization prediction tool tailored for EST data. It provides a variety of models for the users to choose from, and is available for download at <url>http://megasun.bch.umontreal.ca/~shenyq/TESTLoc/TESTLoc.html</url></p>
url http://www.biomedcentral.com/1471-2105/11/563
work_keys_str_mv AT burgergertraud testlocproteinsubcellularlocalizationpredictionfromestdata
AT shenyaoqing testlocproteinsubcellularlocalizationpredictionfromestdata
_version_ 1724978656047005696