DeepBindRG: a deep learning based method for estimating effective protein–ligand affinity

Proteins interact with small molecules to modulate several important cellular functions. Many acute diseases were cured by small molecule binding in the active site of protein either by inhibition or activation. Currently, there are several docking programs to estimate the binding position and the b...

Full description

Bibliographic Details
Main Authors: Haiping Zhang, Linbu Liao, Konda Mani Saravanan, Peng Yin, Yanjie Wei
Format: Article
Language:English
Published: PeerJ Inc. 2019-07-01
Series:PeerJ
Subjects:
Online Access:https://peerj.com/articles/7362.pdf
id doaj-c3a06a2ffbe246f9ba02206d847ef4f2
record_format Article
spelling doaj-c3a06a2ffbe246f9ba02206d847ef4f22020-11-24T21:28:26ZengPeerJ Inc.PeerJ2167-83592019-07-017e736210.7717/peerj.7362DeepBindRG: a deep learning based method for estimating effective protein–ligand affinityHaiping ZhangLinbu LiaoKonda Mani SaravananPeng YinYanjie WeiProteins interact with small molecules to modulate several important cellular functions. Many acute diseases were cured by small molecule binding in the active site of protein either by inhibition or activation. Currently, there are several docking programs to estimate the binding position and the binding orientation of protein–ligand complex. Many scoring functions were developed to estimate the binding strength and predict the effective protein–ligand binding. While the accuracy of current scoring function is limited by several aspects, the solvent effect, entropy effect, and multibody effect are largely ignored in traditional machine learning methods. In this paper, we proposed a new deep neural network-based model named DeepBindRG to predict the binding affinity of protein–ligand complex, which learns all the effects, binding mode, and specificity implicitly by learning protein–ligand interface contact information from a large protein–ligand dataset. During the initial data processing step, the critical interface information was preserved to make sure the input is suitable for the proposed deep learning model. While validating our model on three independent datasets, DeepBindRG achieves root mean squared error (RMSE) value of pKa (−logKd or −logKi) about 1.6–1.8 and R value around 0.5–0.6, which is better than the autodock vina whose RMSE value is about 2.2–2.4 and R value is 0.42–0.57. We also explored the detailed reasons for the performance of DeepBindRG, especially for several failed cases by vina. Furthermore, DeepBindRG performed better for four challenging datasets from DUD.E database with no experimental protein–ligand complexes. The better performance of DeepBindRG than autodock vina in predicting protein–ligand binding affinity indicates that deep learning approach can greatly help with the drug discovery process. We also compare the performance of DeepBindRG with a 4D based deep learning method “pafnucy”, the advantage and limitation of both methods have provided clues for improving the deep learning based protein–ligand prediction model in the future.https://peerj.com/articles/7362.pdfProtein–ligand binding affinityResNetDeep neural networkNative-like protein–ligand complexDrug design
collection DOAJ
language English
format Article
sources DOAJ
author Haiping Zhang
Linbu Liao
Konda Mani Saravanan
Peng Yin
Yanjie Wei
spellingShingle Haiping Zhang
Linbu Liao
Konda Mani Saravanan
Peng Yin
Yanjie Wei
DeepBindRG: a deep learning based method for estimating effective protein–ligand affinity
PeerJ
Protein–ligand binding affinity
ResNet
Deep neural network
Native-like protein–ligand complex
Drug design
author_facet Haiping Zhang
Linbu Liao
Konda Mani Saravanan
Peng Yin
Yanjie Wei
author_sort Haiping Zhang
title DeepBindRG: a deep learning based method for estimating effective protein–ligand affinity
title_short DeepBindRG: a deep learning based method for estimating effective protein–ligand affinity
title_full DeepBindRG: a deep learning based method for estimating effective protein–ligand affinity
title_fullStr DeepBindRG: a deep learning based method for estimating effective protein–ligand affinity
title_full_unstemmed DeepBindRG: a deep learning based method for estimating effective protein–ligand affinity
title_sort deepbindrg: a deep learning based method for estimating effective protein–ligand affinity
publisher PeerJ Inc.
series PeerJ
issn 2167-8359
publishDate 2019-07-01
description Proteins interact with small molecules to modulate several important cellular functions. Many acute diseases were cured by small molecule binding in the active site of protein either by inhibition or activation. Currently, there are several docking programs to estimate the binding position and the binding orientation of protein–ligand complex. Many scoring functions were developed to estimate the binding strength and predict the effective protein–ligand binding. While the accuracy of current scoring function is limited by several aspects, the solvent effect, entropy effect, and multibody effect are largely ignored in traditional machine learning methods. In this paper, we proposed a new deep neural network-based model named DeepBindRG to predict the binding affinity of protein–ligand complex, which learns all the effects, binding mode, and specificity implicitly by learning protein–ligand interface contact information from a large protein–ligand dataset. During the initial data processing step, the critical interface information was preserved to make sure the input is suitable for the proposed deep learning model. While validating our model on three independent datasets, DeepBindRG achieves root mean squared error (RMSE) value of pKa (−logKd or −logKi) about 1.6–1.8 and R value around 0.5–0.6, which is better than the autodock vina whose RMSE value is about 2.2–2.4 and R value is 0.42–0.57. We also explored the detailed reasons for the performance of DeepBindRG, especially for several failed cases by vina. Furthermore, DeepBindRG performed better for four challenging datasets from DUD.E database with no experimental protein–ligand complexes. The better performance of DeepBindRG than autodock vina in predicting protein–ligand binding affinity indicates that deep learning approach can greatly help with the drug discovery process. We also compare the performance of DeepBindRG with a 4D based deep learning method “pafnucy”, the advantage and limitation of both methods have provided clues for improving the deep learning based protein–ligand prediction model in the future.
topic Protein–ligand binding affinity
ResNet
Deep neural network
Native-like protein–ligand complex
Drug design
url https://peerj.com/articles/7362.pdf
work_keys_str_mv AT haipingzhang deepbindrgadeeplearningbasedmethodforestimatingeffectiveproteinligandaffinity
AT linbuliao deepbindrgadeeplearningbasedmethodforestimatingeffectiveproteinligandaffinity
AT kondamanisaravanan deepbindrgadeeplearningbasedmethodforestimatingeffectiveproteinligandaffinity
AT pengyin deepbindrgadeeplearningbasedmethodforestimatingeffectiveproteinligandaffinity
AT yanjiewei deepbindrgadeeplearningbasedmethodforestimatingeffectiveproteinligandaffinity
_version_ 1725970445882097664