An integration of deep learning with feature embedding for protein–protein interaction prediction
Protein–protein interactions are closely relevant to protein function and drug discovery. Hence, accurately identifying protein–protein interactions will help us to understand the underlying molecular mechanisms and significantly facilitate the drug discovery. However, the majority of existing compu...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
PeerJ Inc.
2019-06-01
|
Series: | PeerJ |
Subjects: | |
Online Access: | https://peerj.com/articles/7126.pdf |
id |
doaj-8486dafb0ef043ee9f01d289adc09aab |
---|---|
record_format |
Article |
spelling |
doaj-8486dafb0ef043ee9f01d289adc09aab2020-11-25T00:47:19ZengPeerJ Inc.PeerJ2167-83592019-06-017e712610.7717/peerj.7126An integration of deep learning with feature embedding for protein–protein interaction predictionYu YaoXiuquan DuYanyu DiaoHuaixu ZhuProtein–protein interactions are closely relevant to protein function and drug discovery. Hence, accurately identifying protein–protein interactions will help us to understand the underlying molecular mechanisms and significantly facilitate the drug discovery. However, the majority of existing computational methods for protein–protein interactions prediction are focused on the feature extraction and combination of features and there have been limited gains from the state-of-the-art models. In this work, a new residue representation method named Res2vec is designed for protein sequence representation. Residue representations obtained by Res2vec describe more precisely residue-residue interactions from raw sequence and supply more effective inputs for the downstream deep learning model. Combining effective feature embedding with powerful deep learning techniques, our method provides a general computational pipeline to infer protein–protein interactions, even when protein structure knowledge is entirely unknown. The proposed method DeepFE-PPI is evaluated on the S. Cerevisiae and human datasets. The experimental results show that DeepFE-PPI achieves 94.78% (accuracy), 92.99% (recall), 96.45% (precision), 89.62% (Matthew’s correlation coefficient, MCC) and 98.71% (accuracy), 98.54% (recall), 98.77% (precision), 97.43% (MCC), respectively. In addition, we also evaluate the performance of DeepFE-PPI on five independent species datasets and all the results are superior to the existing methods. The comparisons show that DeepFE-PPI is capable of predicting protein–protein interactions by a novel residue representation method and a deep learning classification framework in an acceptable level of accuracy. The codes along with instructions to reproduce this work are available from https://github.com/xal2019/DeepFE-PPI.https://peerj.com/articles/7126.pdfDeep learningprotein–protein interactionFeature embeddingMachine learning |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Yu Yao Xiuquan Du Yanyu Diao Huaixu Zhu |
spellingShingle |
Yu Yao Xiuquan Du Yanyu Diao Huaixu Zhu An integration of deep learning with feature embedding for protein–protein interaction prediction PeerJ Deep learning protein–protein interaction Feature embedding Machine learning |
author_facet |
Yu Yao Xiuquan Du Yanyu Diao Huaixu Zhu |
author_sort |
Yu Yao |
title |
An integration of deep learning with feature embedding for protein–protein interaction prediction |
title_short |
An integration of deep learning with feature embedding for protein–protein interaction prediction |
title_full |
An integration of deep learning with feature embedding for protein–protein interaction prediction |
title_fullStr |
An integration of deep learning with feature embedding for protein–protein interaction prediction |
title_full_unstemmed |
An integration of deep learning with feature embedding for protein–protein interaction prediction |
title_sort |
integration of deep learning with feature embedding for protein–protein interaction prediction |
publisher |
PeerJ Inc. |
series |
PeerJ |
issn |
2167-8359 |
publishDate |
2019-06-01 |
description |
Protein–protein interactions are closely relevant to protein function and drug discovery. Hence, accurately identifying protein–protein interactions will help us to understand the underlying molecular mechanisms and significantly facilitate the drug discovery. However, the majority of existing computational methods for protein–protein interactions prediction are focused on the feature extraction and combination of features and there have been limited gains from the state-of-the-art models. In this work, a new residue representation method named Res2vec is designed for protein sequence representation. Residue representations obtained by Res2vec describe more precisely residue-residue interactions from raw sequence and supply more effective inputs for the downstream deep learning model. Combining effective feature embedding with powerful deep learning techniques, our method provides a general computational pipeline to infer protein–protein interactions, even when protein structure knowledge is entirely unknown. The proposed method DeepFE-PPI is evaluated on the S. Cerevisiae and human datasets. The experimental results show that DeepFE-PPI achieves 94.78% (accuracy), 92.99% (recall), 96.45% (precision), 89.62% (Matthew’s correlation coefficient, MCC) and 98.71% (accuracy), 98.54% (recall), 98.77% (precision), 97.43% (MCC), respectively. In addition, we also evaluate the performance of DeepFE-PPI on five independent species datasets and all the results are superior to the existing methods. The comparisons show that DeepFE-PPI is capable of predicting protein–protein interactions by a novel residue representation method and a deep learning classification framework in an acceptable level of accuracy. The codes along with instructions to reproduce this work are available from https://github.com/xal2019/DeepFE-PPI. |
topic |
Deep learning protein–protein interaction Feature embedding Machine learning |
url |
https://peerj.com/articles/7126.pdf |
work_keys_str_mv |
AT yuyao anintegrationofdeeplearningwithfeatureembeddingforproteinproteininteractionprediction AT xiuquandu anintegrationofdeeplearningwithfeatureembeddingforproteinproteininteractionprediction AT yanyudiao anintegrationofdeeplearningwithfeatureembeddingforproteinproteininteractionprediction AT huaixuzhu anintegrationofdeeplearningwithfeatureembeddingforproteinproteininteractionprediction AT yuyao integrationofdeeplearningwithfeatureembeddingforproteinproteininteractionprediction AT xiuquandu integrationofdeeplearningwithfeatureembeddingforproteinproteininteractionprediction AT yanyudiao integrationofdeeplearningwithfeatureembeddingforproteinproteininteractionprediction AT huaixuzhu integrationofdeeplearningwithfeatureembeddingforproteinproteininteractionprediction |
_version_ |
1725260623713927168 |