GPSuc: Global Prediction of Generic and Species-specific Succinylation Sites by aggregating multiple sequence features.

Lysine succinylation is one of the dominant post-translational modification of the protein that contributes to many biological processes including cell cycle, growth and signal transduction pathways. Identification of succinylation sites is an important step for understanding the function of protein...

Full description

Bibliographic Details
Main Authors: Md Mehedi Hasan, Hiroyuki Kurata
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2018-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC6193575?pdf=render
id doaj-3135ffe76dec48a7b5cd459255bf3627
record_format Article
spelling doaj-3135ffe76dec48a7b5cd459255bf36272020-11-24T21:32:47ZengPublic Library of Science (PLoS)PLoS ONE1932-62032018-01-011310e020028310.1371/journal.pone.0200283GPSuc: Global Prediction of Generic and Species-specific Succinylation Sites by aggregating multiple sequence features.Md Mehedi HasanHiroyuki KurataLysine succinylation is one of the dominant post-translational modification of the protein that contributes to many biological processes including cell cycle, growth and signal transduction pathways. Identification of succinylation sites is an important step for understanding the function of proteins. The complicated sequence patterns of protein succinylation revealed by proteomic studies highlight the necessity of developing effective species-specific in silico strategies for global prediction succinylation sites. Here we have developed the generic and nine species-specific succinylation site classifiers through aggregating multiple complementary features. We optimized the consecutive features using the Wilcoxon-rank feature selection scheme. The final feature vectors were trained by a random forest (RF) classifier. With an integration of RF scores via logistic regression, the resulting predictor termed GPSuc achieved better performance than other existing generic and species-specific succinylation site predictors. To reveal the mechanism of succinylation and assist hypothesis-driven experimental design, our predictor serves as a valuable resource. To provide a promising performance in large-scale datasets, a web application was developed at http://kurata14.bio.kyutech.ac.jp/GPSuc/.http://europepmc.org/articles/PMC6193575?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Md Mehedi Hasan
Hiroyuki Kurata
spellingShingle Md Mehedi Hasan
Hiroyuki Kurata
GPSuc: Global Prediction of Generic and Species-specific Succinylation Sites by aggregating multiple sequence features.
PLoS ONE
author_facet Md Mehedi Hasan
Hiroyuki Kurata
author_sort Md Mehedi Hasan
title GPSuc: Global Prediction of Generic and Species-specific Succinylation Sites by aggregating multiple sequence features.
title_short GPSuc: Global Prediction of Generic and Species-specific Succinylation Sites by aggregating multiple sequence features.
title_full GPSuc: Global Prediction of Generic and Species-specific Succinylation Sites by aggregating multiple sequence features.
title_fullStr GPSuc: Global Prediction of Generic and Species-specific Succinylation Sites by aggregating multiple sequence features.
title_full_unstemmed GPSuc: Global Prediction of Generic and Species-specific Succinylation Sites by aggregating multiple sequence features.
title_sort gpsuc: global prediction of generic and species-specific succinylation sites by aggregating multiple sequence features.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2018-01-01
description Lysine succinylation is one of the dominant post-translational modification of the protein that contributes to many biological processes including cell cycle, growth and signal transduction pathways. Identification of succinylation sites is an important step for understanding the function of proteins. The complicated sequence patterns of protein succinylation revealed by proteomic studies highlight the necessity of developing effective species-specific in silico strategies for global prediction succinylation sites. Here we have developed the generic and nine species-specific succinylation site classifiers through aggregating multiple complementary features. We optimized the consecutive features using the Wilcoxon-rank feature selection scheme. The final feature vectors were trained by a random forest (RF) classifier. With an integration of RF scores via logistic regression, the resulting predictor termed GPSuc achieved better performance than other existing generic and species-specific succinylation site predictors. To reveal the mechanism of succinylation and assist hypothesis-driven experimental design, our predictor serves as a valuable resource. To provide a promising performance in large-scale datasets, a web application was developed at http://kurata14.bio.kyutech.ac.jp/GPSuc/.
url http://europepmc.org/articles/PMC6193575?pdf=render
work_keys_str_mv AT mdmehedihasan gpsucglobalpredictionofgenericandspeciesspecificsuccinylationsitesbyaggregatingmultiplesequencefeatures
AT hiroyukikurata gpsucglobalpredictionofgenericandspeciesspecificsuccinylationsitesbyaggregatingmultiplesequencefeatures
_version_ 1725956001561051136