Proteome scanning to predict PDZ domain interactions using support vector machines

<p>Abstract</p> <p>Background</p> <p>PDZ domains mediate protein-protein interactions involved in important biological processes through the recognition of short linear motifs in their target proteins. Two recent independent studies have used protein microarray or phage...

Full description

Bibliographic Details
Main Authors: Bader Gary D, Hui Shirley
Format: Article
Language:English
Published: BMC 2010-10-01
Series:BMC Bioinformatics
Online Access:http://www.biomedcentral.com/1471-2105/11/507
id doaj-96cd45551697414ca9c879553fb0b8db
record_format Article
spelling doaj-96cd45551697414ca9c879553fb0b8db2020-11-25T00:06:17ZengBMCBMC Bioinformatics1471-21052010-10-0111150710.1186/1471-2105-11-507Proteome scanning to predict PDZ domain interactions using support vector machinesBader Gary DHui Shirley<p>Abstract</p> <p>Background</p> <p>PDZ domains mediate protein-protein interactions involved in important biological processes through the recognition of short linear motifs in their target proteins. Two recent independent studies have used protein microarray or phage display technology to detect PDZ domain interactions with peptide ligands on a large scale. Several computational predictors of PDZ domain interactions have been developed, however they are trained using only protein microarray data and focus on limited subsets of PDZ domains. An accurate predictor of genomic PDZ domain interactions would allow the proteomes of organisms to be scanned for potential binders. Such an application would require an accurate and precise predictor to avoid generating too many false positive hits given the large amount of possible interactors in a given proteome. Once validated these predictions will help to increase the coverage of current PDZ domain interaction networks and further our understanding of the roles that PDZ domains play in a variety of biological processes.</p> <p>Results</p> <p>We developed a PDZ domain interaction predictor using a support vector machine (SVM) trained with both protein microarray and phage display data. In order to use the phage display data for training, which only contains positive interactions, we developed a method to generate artificial negative interactions. Using cross-validation and a series of independent tests, we showed that our SVM successfully predicts interactions in different organisms. We then used the SVM to scan the proteomes of human, worm and fly to predict binders for several PDZ domains. Predictions were validated using known genomic interactions and published protein microarray experiments. Based on our results, new protein interactions potentially associated with Usher and Bardet-Biedl syndromes were predicted. A comparison of performance measures (F1 measure and FPR) for the SVM and published predictors demonstrated our SVM's improved accuracy and precision at proteome scanning.</p> <p>Conclusions</p> <p>We built an SVM using mouse and human experimental training data to predict PDZ domain interactions. We showed that it correctly predicts known interactions from proteomes of different organisms and is more accurate and precise at proteome scanning compared with published state-of-the-art predictors.</p> http://www.biomedcentral.com/1471-2105/11/507
collection DOAJ
language English
format Article
sources DOAJ
author Bader Gary D
Hui Shirley
spellingShingle Bader Gary D
Hui Shirley
Proteome scanning to predict PDZ domain interactions using support vector machines
BMC Bioinformatics
author_facet Bader Gary D
Hui Shirley
author_sort Bader Gary D
title Proteome scanning to predict PDZ domain interactions using support vector machines
title_short Proteome scanning to predict PDZ domain interactions using support vector machines
title_full Proteome scanning to predict PDZ domain interactions using support vector machines
title_fullStr Proteome scanning to predict PDZ domain interactions using support vector machines
title_full_unstemmed Proteome scanning to predict PDZ domain interactions using support vector machines
title_sort proteome scanning to predict pdz domain interactions using support vector machines
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2010-10-01
description <p>Abstract</p> <p>Background</p> <p>PDZ domains mediate protein-protein interactions involved in important biological processes through the recognition of short linear motifs in their target proteins. Two recent independent studies have used protein microarray or phage display technology to detect PDZ domain interactions with peptide ligands on a large scale. Several computational predictors of PDZ domain interactions have been developed, however they are trained using only protein microarray data and focus on limited subsets of PDZ domains. An accurate predictor of genomic PDZ domain interactions would allow the proteomes of organisms to be scanned for potential binders. Such an application would require an accurate and precise predictor to avoid generating too many false positive hits given the large amount of possible interactors in a given proteome. Once validated these predictions will help to increase the coverage of current PDZ domain interaction networks and further our understanding of the roles that PDZ domains play in a variety of biological processes.</p> <p>Results</p> <p>We developed a PDZ domain interaction predictor using a support vector machine (SVM) trained with both protein microarray and phage display data. In order to use the phage display data for training, which only contains positive interactions, we developed a method to generate artificial negative interactions. Using cross-validation and a series of independent tests, we showed that our SVM successfully predicts interactions in different organisms. We then used the SVM to scan the proteomes of human, worm and fly to predict binders for several PDZ domains. Predictions were validated using known genomic interactions and published protein microarray experiments. Based on our results, new protein interactions potentially associated with Usher and Bardet-Biedl syndromes were predicted. A comparison of performance measures (F1 measure and FPR) for the SVM and published predictors demonstrated our SVM's improved accuracy and precision at proteome scanning.</p> <p>Conclusions</p> <p>We built an SVM using mouse and human experimental training data to predict PDZ domain interactions. We showed that it correctly predicts known interactions from proteomes of different organisms and is more accurate and precise at proteome scanning compared with published state-of-the-art predictors.</p>
url http://www.biomedcentral.com/1471-2105/11/507
work_keys_str_mv AT badergaryd proteomescanningtopredictpdzdomaininteractionsusingsupportvectormachines
AT huishirley proteomescanningtopredictpdzdomaininteractionsusingsupportvectormachines
_version_ 1725423145705275392