Computational Prediction of PDZ Mediated Protein-protein Interactions

Many protein-protein interactions, especially those involved in eukaryotic signalling, are mediated by PDZ domains through the recognition of hydrophobic C-termini. The availability of experimental PDZ interaction data sets have led to the construction of computational methods to predict PDZ domain...

Full description

Bibliographic Details
Main Author: Hui, Shirley
Other Authors: Bader, Gary D.
Language:en_ca
Published: 2013
Subjects:
Online Access:http://hdl.handle.net/1807/43599
id ndltd-TORONTO-oai-tspace.library.utoronto.ca-1807-43599
record_format oai_dc
spelling ndltd-TORONTO-oai-tspace.library.utoronto.ca-1807-435992014-01-10T04:19:51ZComputational Prediction of PDZ Mediated Protein-protein InteractionsHui, Shirleyprotein-protein interactionsinteraction predictionPDZ domainsmachine learning0715Many protein-protein interactions, especially those involved in eukaryotic signalling, are mediated by PDZ domains through the recognition of hydrophobic C-termini. The availability of experimental PDZ interaction data sets have led to the construction of computational methods to predict PDZ domain-peptide interactions. Such predictors are ideally suited to predict interactions in single organisms or for limited subsets of PDZ domains. As a result, the goal of my thesis has been to build general predictors that can be used to scan the proteomes of multiple organisms for ligands for almost all PDZ domains from select model organisms. A framework consisting of four steps: data collection, feature encoding, predictor training and evaluation was developed and applied for all predictors built in this thesis. The first predictor utilized PDZ domain-peptide sequence information from two interaction data sets obtained from high throughput protein microarray and phage display experiments in mouse and human, respectively. The second predictor used PDZ domain structure and peptide sequence information. I showed that these predictors are complementary to each other, are capable of predicting unseen interactions and can be used for the purposes of proteome scanning in human, worm and fly. As both positive and negative interactions are required for building a successful predictor, a major obstacle I addressed was the generation of artificial negative interactions for training. In particular, I used position weight matrices to generate such negatives for the positive only phage display data and used a semi-supervised learning approach to overcome the problem of over-prediction (i.e. prediction of too many positives). These predictors are available as a community web resource: http://webservice.baderlab.org/domains/POW. Finally, a Bayesian integration method combining information from different biological evidence sources was used to filter the human proteome scanning predictions from both predictors. This resulted in the construction of a comprehensive physiologically relevant high confidence PDZ mediated protein-protein interaction network in human.Bader, Gary D.2013-112014-01-09T21:19:38ZNO_RESTRICTION2014-01-09T21:19:38Z2014-01-09Thesishttp://hdl.handle.net/1807/43599en_ca
collection NDLTD
language en_ca
sources NDLTD
topic protein-protein interactions
interaction prediction
PDZ domains
machine learning
0715
spellingShingle protein-protein interactions
interaction prediction
PDZ domains
machine learning
0715
Hui, Shirley
Computational Prediction of PDZ Mediated Protein-protein Interactions
description Many protein-protein interactions, especially those involved in eukaryotic signalling, are mediated by PDZ domains through the recognition of hydrophobic C-termini. The availability of experimental PDZ interaction data sets have led to the construction of computational methods to predict PDZ domain-peptide interactions. Such predictors are ideally suited to predict interactions in single organisms or for limited subsets of PDZ domains. As a result, the goal of my thesis has been to build general predictors that can be used to scan the proteomes of multiple organisms for ligands for almost all PDZ domains from select model organisms. A framework consisting of four steps: data collection, feature encoding, predictor training and evaluation was developed and applied for all predictors built in this thesis. The first predictor utilized PDZ domain-peptide sequence information from two interaction data sets obtained from high throughput protein microarray and phage display experiments in mouse and human, respectively. The second predictor used PDZ domain structure and peptide sequence information. I showed that these predictors are complementary to each other, are capable of predicting unseen interactions and can be used for the purposes of proteome scanning in human, worm and fly. As both positive and negative interactions are required for building a successful predictor, a major obstacle I addressed was the generation of artificial negative interactions for training. In particular, I used position weight matrices to generate such negatives for the positive only phage display data and used a semi-supervised learning approach to overcome the problem of over-prediction (i.e. prediction of too many positives). These predictors are available as a community web resource: http://webservice.baderlab.org/domains/POW. Finally, a Bayesian integration method combining information from different biological evidence sources was used to filter the human proteome scanning predictions from both predictors. This resulted in the construction of a comprehensive physiologically relevant high confidence PDZ mediated protein-protein interaction network in human.
author2 Bader, Gary D.
author_facet Bader, Gary D.
Hui, Shirley
author Hui, Shirley
author_sort Hui, Shirley
title Computational Prediction of PDZ Mediated Protein-protein Interactions
title_short Computational Prediction of PDZ Mediated Protein-protein Interactions
title_full Computational Prediction of PDZ Mediated Protein-protein Interactions
title_fullStr Computational Prediction of PDZ Mediated Protein-protein Interactions
title_full_unstemmed Computational Prediction of PDZ Mediated Protein-protein Interactions
title_sort computational prediction of pdz mediated protein-protein interactions
publishDate 2013
url http://hdl.handle.net/1807/43599
work_keys_str_mv AT huishirley computationalpredictionofpdzmediatedproteinproteininteractions
_version_ 1716623612049358848