Prediction of Signal Peptides in Proteins from Malaria Parasites

Signal peptides are N-terminal presequences responsible for targeting proteins to the endomembrane system, and subsequent subcellular or extracellular compartments, and consequently condition their proper function. The significance of signal peptides stimulates development of new computational metho...

Full description

Bibliographic Details
Main Authors: Michał Burdukiewicz, Piotr Sobczyk, Jarosław Chilimoniuk, Przemysław Gagat, Paweł Mackiewicz
Format: Article
Language:English
Published: MDPI AG 2018-11-01
Series:International Journal of Molecular Sciences
Subjects:
Online Access:https://www.mdpi.com/1422-0067/19/12/3709
id doaj-f9977d46ef2b4036a31cfb4b7ab2ba63
record_format Article
spelling doaj-f9977d46ef2b4036a31cfb4b7ab2ba632020-11-24T21:49:14ZengMDPI AGInternational Journal of Molecular Sciences1422-00672018-11-011912370910.3390/ijms19123709ijms19123709Prediction of Signal Peptides in Proteins from Malaria ParasitesMichał Burdukiewicz0Piotr Sobczyk1Jarosław Chilimoniuk2Przemysław Gagat3Paweł Mackiewicz4Faculty of Mathematics and Information Science, Warsaw University of Technology, 00-661 Warszawa, PolandDepartment of Mathematics, Wrocław University of Technology, 50-370 Wrocław, PolandDepartment of Genomics, University of Wrocław, 50-383 Wrocław, PolandDepartment of Genomics, University of Wrocław, 50-383 Wrocław, PolandDepartment of Genomics, University of Wrocław, 50-383 Wrocław, PolandSignal peptides are N-terminal presequences responsible for targeting proteins to the endomembrane system, and subsequent subcellular or extracellular compartments, and consequently condition their proper function. The significance of signal peptides stimulates development of new computational methods for their detection. These methods employ learning systems trained on datasets comprising signal peptides from different types of proteins and taxonomic groups. As a result, the accuracy of predictions are high in the case of signal peptides that are well-represented in databases, but might be low in other, atypical cases. Such atypical signal peptides are present in proteins found in apicomplexan parasites, causative agents of malaria and toxoplasmosis. Apicomplexan proteins have a unique amino acid composition due to their AT-biased genomes. Therefore, we designed a new, more flexible and universal probabilistic model for recognition of atypical eukaryotic signal peptides. Our approach called signalHsmm includes knowledge about the structure of signal peptides and physicochemical properties of amino acids. It is able to recognize signal peptides from the malaria parasites and related species more accurately than popular programs. Moreover, it is still universal enough to provide prediction of other signal peptides on par with the best preforming predictors.https://www.mdpi.com/1422-0067/19/12/3709apicomplexaplasmodiummalariaHSMMhidden semi-Markov modelsignal peptides
collection DOAJ
language English
format Article
sources DOAJ
author Michał Burdukiewicz
Piotr Sobczyk
Jarosław Chilimoniuk
Przemysław Gagat
Paweł Mackiewicz
spellingShingle Michał Burdukiewicz
Piotr Sobczyk
Jarosław Chilimoniuk
Przemysław Gagat
Paweł Mackiewicz
Prediction of Signal Peptides in Proteins from Malaria Parasites
International Journal of Molecular Sciences
apicomplexa
plasmodium
malaria
HSMM
hidden semi-Markov model
signal peptides
author_facet Michał Burdukiewicz
Piotr Sobczyk
Jarosław Chilimoniuk
Przemysław Gagat
Paweł Mackiewicz
author_sort Michał Burdukiewicz
title Prediction of Signal Peptides in Proteins from Malaria Parasites
title_short Prediction of Signal Peptides in Proteins from Malaria Parasites
title_full Prediction of Signal Peptides in Proteins from Malaria Parasites
title_fullStr Prediction of Signal Peptides in Proteins from Malaria Parasites
title_full_unstemmed Prediction of Signal Peptides in Proteins from Malaria Parasites
title_sort prediction of signal peptides in proteins from malaria parasites
publisher MDPI AG
series International Journal of Molecular Sciences
issn 1422-0067
publishDate 2018-11-01
description Signal peptides are N-terminal presequences responsible for targeting proteins to the endomembrane system, and subsequent subcellular or extracellular compartments, and consequently condition their proper function. The significance of signal peptides stimulates development of new computational methods for their detection. These methods employ learning systems trained on datasets comprising signal peptides from different types of proteins and taxonomic groups. As a result, the accuracy of predictions are high in the case of signal peptides that are well-represented in databases, but might be low in other, atypical cases. Such atypical signal peptides are present in proteins found in apicomplexan parasites, causative agents of malaria and toxoplasmosis. Apicomplexan proteins have a unique amino acid composition due to their AT-biased genomes. Therefore, we designed a new, more flexible and universal probabilistic model for recognition of atypical eukaryotic signal peptides. Our approach called signalHsmm includes knowledge about the structure of signal peptides and physicochemical properties of amino acids. It is able to recognize signal peptides from the malaria parasites and related species more accurately than popular programs. Moreover, it is still universal enough to provide prediction of other signal peptides on par with the best preforming predictors.
topic apicomplexa
plasmodium
malaria
HSMM
hidden semi-Markov model
signal peptides
url https://www.mdpi.com/1422-0067/19/12/3709
work_keys_str_mv AT michałburdukiewicz predictionofsignalpeptidesinproteinsfrommalariaparasites
AT piotrsobczyk predictionofsignalpeptidesinproteinsfrommalariaparasites
AT jarosławchilimoniuk predictionofsignalpeptidesinproteinsfrommalariaparasites
AT przemysławgagat predictionofsignalpeptidesinproteinsfrommalariaparasites
AT pawełmackiewicz predictionofsignalpeptidesinproteinsfrommalariaparasites
_version_ 1725888562682920960