Inferring interaction partners from protein sequences using mutual information.

Functional protein-protein interactions are crucial in most cellular processes. They enable multi-protein complexes to assemble and to remain stable, and they allow signal transduction in various pathways. Functional interactions between proteins result in coevolution between the interacting partner...

Full description

Bibliographic Details
Main Author: Anne-Florence Bitbol
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2018-11-01
Series:PLoS Computational Biology
Online Access:http://europepmc.org/articles/PMC6258550?pdf=render
id doaj-af6e1caa40404974aa7ea9716a959a22
record_format Article
spelling doaj-af6e1caa40404974aa7ea9716a959a222020-11-25T01:34:03ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582018-11-011411e100640110.1371/journal.pcbi.1006401Inferring interaction partners from protein sequences using mutual information.Anne-Florence BitbolFunctional protein-protein interactions are crucial in most cellular processes. They enable multi-protein complexes to assemble and to remain stable, and they allow signal transduction in various pathways. Functional interactions between proteins result in coevolution between the interacting partners, and thus in correlations between their sequences. Pairwise maximum-entropy based models have enabled successful inference of pairs of amino-acid residues that are in contact in the three-dimensional structure of multi-protein complexes, starting from the correlations in the sequence data of known interaction partners. Recently, algorithms inspired by these methods have been developed to identify which proteins are functional interaction partners among the paralogous proteins of two families, starting from sequence data alone. Here, we demonstrate that a slightly higher performance for partner identification can be reached by an approximate maximization of the mutual information between the sequence alignments of the two protein families. Our mutual information-based method also provides signatures of the existence of interactions between protein families. These results stand in contrast with structure prediction of proteins and of multi-protein complexes from sequence data, where pairwise maximum-entropy based global statistical models substantially improve performance compared to mutual information. Our findings entail that the statistical dependences allowing interaction partner prediction from sequence data are not restricted to the residue pairs that are in direct contact at the interface between the partner proteins.http://europepmc.org/articles/PMC6258550?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Anne-Florence Bitbol
spellingShingle Anne-Florence Bitbol
Inferring interaction partners from protein sequences using mutual information.
PLoS Computational Biology
author_facet Anne-Florence Bitbol
author_sort Anne-Florence Bitbol
title Inferring interaction partners from protein sequences using mutual information.
title_short Inferring interaction partners from protein sequences using mutual information.
title_full Inferring interaction partners from protein sequences using mutual information.
title_fullStr Inferring interaction partners from protein sequences using mutual information.
title_full_unstemmed Inferring interaction partners from protein sequences using mutual information.
title_sort inferring interaction partners from protein sequences using mutual information.
publisher Public Library of Science (PLoS)
series PLoS Computational Biology
issn 1553-734X
1553-7358
publishDate 2018-11-01
description Functional protein-protein interactions are crucial in most cellular processes. They enable multi-protein complexes to assemble and to remain stable, and they allow signal transduction in various pathways. Functional interactions between proteins result in coevolution between the interacting partners, and thus in correlations between their sequences. Pairwise maximum-entropy based models have enabled successful inference of pairs of amino-acid residues that are in contact in the three-dimensional structure of multi-protein complexes, starting from the correlations in the sequence data of known interaction partners. Recently, algorithms inspired by these methods have been developed to identify which proteins are functional interaction partners among the paralogous proteins of two families, starting from sequence data alone. Here, we demonstrate that a slightly higher performance for partner identification can be reached by an approximate maximization of the mutual information between the sequence alignments of the two protein families. Our mutual information-based method also provides signatures of the existence of interactions between protein families. These results stand in contrast with structure prediction of proteins and of multi-protein complexes from sequence data, where pairwise maximum-entropy based global statistical models substantially improve performance compared to mutual information. Our findings entail that the statistical dependences allowing interaction partner prediction from sequence data are not restricted to the residue pairs that are in direct contact at the interface between the partner proteins.
url http://europepmc.org/articles/PMC6258550?pdf=render
work_keys_str_mv AT anneflorencebitbol inferringinteractionpartnersfromproteinsequencesusingmutualinformation
_version_ 1725074062263189504