GPSuc: Global Prediction of Generic and Species-specific Succinylation Sites by aggregating multiple sequence features.
Lysine succinylation is one of the dominant post-translational modification of the protein that contributes to many biological processes including cell cycle, growth and signal transduction pathways. Identification of succinylation sites is an important step for understanding the function of protein...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2018-01-01
|
Series: | PLoS ONE |
Online Access: | http://europepmc.org/articles/PMC6193575?pdf=render |
id |
doaj-3135ffe76dec48a7b5cd459255bf3627 |
---|---|
record_format |
Article |
spelling |
doaj-3135ffe76dec48a7b5cd459255bf36272020-11-24T21:32:47ZengPublic Library of Science (PLoS)PLoS ONE1932-62032018-01-011310e020028310.1371/journal.pone.0200283GPSuc: Global Prediction of Generic and Species-specific Succinylation Sites by aggregating multiple sequence features.Md Mehedi HasanHiroyuki KurataLysine succinylation is one of the dominant post-translational modification of the protein that contributes to many biological processes including cell cycle, growth and signal transduction pathways. Identification of succinylation sites is an important step for understanding the function of proteins. The complicated sequence patterns of protein succinylation revealed by proteomic studies highlight the necessity of developing effective species-specific in silico strategies for global prediction succinylation sites. Here we have developed the generic and nine species-specific succinylation site classifiers through aggregating multiple complementary features. We optimized the consecutive features using the Wilcoxon-rank feature selection scheme. The final feature vectors were trained by a random forest (RF) classifier. With an integration of RF scores via logistic regression, the resulting predictor termed GPSuc achieved better performance than other existing generic and species-specific succinylation site predictors. To reveal the mechanism of succinylation and assist hypothesis-driven experimental design, our predictor serves as a valuable resource. To provide a promising performance in large-scale datasets, a web application was developed at http://kurata14.bio.kyutech.ac.jp/GPSuc/.http://europepmc.org/articles/PMC6193575?pdf=render |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Md Mehedi Hasan Hiroyuki Kurata |
spellingShingle |
Md Mehedi Hasan Hiroyuki Kurata GPSuc: Global Prediction of Generic and Species-specific Succinylation Sites by aggregating multiple sequence features. PLoS ONE |
author_facet |
Md Mehedi Hasan Hiroyuki Kurata |
author_sort |
Md Mehedi Hasan |
title |
GPSuc: Global Prediction of Generic and Species-specific Succinylation Sites by aggregating multiple sequence features. |
title_short |
GPSuc: Global Prediction of Generic and Species-specific Succinylation Sites by aggregating multiple sequence features. |
title_full |
GPSuc: Global Prediction of Generic and Species-specific Succinylation Sites by aggregating multiple sequence features. |
title_fullStr |
GPSuc: Global Prediction of Generic and Species-specific Succinylation Sites by aggregating multiple sequence features. |
title_full_unstemmed |
GPSuc: Global Prediction of Generic and Species-specific Succinylation Sites by aggregating multiple sequence features. |
title_sort |
gpsuc: global prediction of generic and species-specific succinylation sites by aggregating multiple sequence features. |
publisher |
Public Library of Science (PLoS) |
series |
PLoS ONE |
issn |
1932-6203 |
publishDate |
2018-01-01 |
description |
Lysine succinylation is one of the dominant post-translational modification of the protein that contributes to many biological processes including cell cycle, growth and signal transduction pathways. Identification of succinylation sites is an important step for understanding the function of proteins. The complicated sequence patterns of protein succinylation revealed by proteomic studies highlight the necessity of developing effective species-specific in silico strategies for global prediction succinylation sites. Here we have developed the generic and nine species-specific succinylation site classifiers through aggregating multiple complementary features. We optimized the consecutive features using the Wilcoxon-rank feature selection scheme. The final feature vectors were trained by a random forest (RF) classifier. With an integration of RF scores via logistic regression, the resulting predictor termed GPSuc achieved better performance than other existing generic and species-specific succinylation site predictors. To reveal the mechanism of succinylation and assist hypothesis-driven experimental design, our predictor serves as a valuable resource. To provide a promising performance in large-scale datasets, a web application was developed at http://kurata14.bio.kyutech.ac.jp/GPSuc/. |
url |
http://europepmc.org/articles/PMC6193575?pdf=render |
work_keys_str_mv |
AT mdmehedihasan gpsucglobalpredictionofgenericandspeciesspecificsuccinylationsitesbyaggregatingmultiplesequencefeatures AT hiroyukikurata gpsucglobalpredictionofgenericandspeciesspecificsuccinylationsitesbyaggregatingmultiplesequencefeatures |
_version_ |
1725956001561051136 |