DLIGAND2: an improved knowledge-based energy function for protein–ligand interactions using the distance-scaled, finite, ideal-gas reference state

Abstract Performance of structure-based molecular docking largely depends on the accuracy of scoring functions. One important type of scoring functions are knowledge-based potentials derived from known three-dimensional structures of proteins and/or protein–ligand complex structures. This study seek...

Full description

Bibliographic Details
Main Authors: Pin Chen, Yaobin Ke, Yutong Lu, Yunfei Du, Jiahui Li, Hui Yan, Huiying Zhao, Yaoqi Zhou, Yuedong Yang
Format: Article
Language:English
Published: BMC 2019-08-01
Series:Journal of Cheminformatics
Subjects:
Online Access:http://link.springer.com/article/10.1186/s13321-019-0373-4
id doaj-687173ee8201457da96792197b2c54b8
record_format Article
spelling doaj-687173ee8201457da96792197b2c54b82020-11-25T03:15:00ZengBMCJournal of Cheminformatics1758-29462019-08-0111111110.1186/s13321-019-0373-4DLIGAND2: an improved knowledge-based energy function for protein–ligand interactions using the distance-scaled, finite, ideal-gas reference statePin Chen0Yaobin Ke1Yutong Lu2Yunfei Du3Jiahui Li4Hui Yan5Huiying Zhao6Yaoqi Zhou7Yuedong Yang8National Supercomputer Center in Guangzhou, School of Data and Computer Science, Sun Yat-sen UniversityNational Supercomputer Center in Guangzhou, School of Data and Computer Science, Sun Yat-sen UniversityNational Supercomputer Center in Guangzhou, School of Data and Computer Science, Sun Yat-sen UniversityNational Supercomputer Center in Guangzhou, School of Data and Computer Science, Sun Yat-sen UniversityNational Supercomputer Center in Guangzhou, School of Data and Computer Science, Sun Yat-sen UniversityNational Supercomputer Center in Guangzhou, School of Data and Computer Science, Sun Yat-sen UniversitySun Yat-sen Memorial Hospital, Sun Yat-sen UniversityInstitute for Glycomics and School of Information and Communication Technology, Griffith UniversityNational Supercomputer Center in Guangzhou, School of Data and Computer Science, Sun Yat-sen UniversityAbstract Performance of structure-based molecular docking largely depends on the accuracy of scoring functions. One important type of scoring functions are knowledge-based potentials derived from known three-dimensional structures of proteins and/or protein–ligand complex structures. This study seeks to improve a knowledge-based protein–ligand potential based on a distance-scale finite ideal-gas reference (DFIRE) state (DLIGAND) by expanding the representation of protein atoms from 13 mol2 atom types to 167 residue-specific atom types, and employing a recently updated dataset containing 12,450 monomer protein chains for training. We found that the updated version DLIGAND2 has a consistent improvement over DLIGAND in predicting binding affinities for either native complex structures or docking-generated poses. More importantly, DLIGAND2 has a 52% increase over DLIGAND in enrichment factors in top 1% predictions based on the DUD-E decoy set, and consistently improves over Autodock Vina and other statistical energy functions in all three benchmark tests. We further found that DLIGAND2 outperforms empirical and machine-learning methods compared for virtual screening on new targets that are not homologous to the DUD-E training set. Given the best performance as a parameter-free statistical potential and among the best in all performance measures, DLIGAND2 should be useful for re-assessing the poses generated by docking software, or acting as one term in other scoring functions. The program is available at https://github.com/sysu-yanglab/DLIGAND2.http://link.springer.com/article/10.1186/s13321-019-0373-4DockingProtein–ligand interactionKnowledge-based energy function
collection DOAJ
language English
format Article
sources DOAJ
author Pin Chen
Yaobin Ke
Yutong Lu
Yunfei Du
Jiahui Li
Hui Yan
Huiying Zhao
Yaoqi Zhou
Yuedong Yang
spellingShingle Pin Chen
Yaobin Ke
Yutong Lu
Yunfei Du
Jiahui Li
Hui Yan
Huiying Zhao
Yaoqi Zhou
Yuedong Yang
DLIGAND2: an improved knowledge-based energy function for protein–ligand interactions using the distance-scaled, finite, ideal-gas reference state
Journal of Cheminformatics
Docking
Protein–ligand interaction
Knowledge-based energy function
author_facet Pin Chen
Yaobin Ke
Yutong Lu
Yunfei Du
Jiahui Li
Hui Yan
Huiying Zhao
Yaoqi Zhou
Yuedong Yang
author_sort Pin Chen
title DLIGAND2: an improved knowledge-based energy function for protein–ligand interactions using the distance-scaled, finite, ideal-gas reference state
title_short DLIGAND2: an improved knowledge-based energy function for protein–ligand interactions using the distance-scaled, finite, ideal-gas reference state
title_full DLIGAND2: an improved knowledge-based energy function for protein–ligand interactions using the distance-scaled, finite, ideal-gas reference state
title_fullStr DLIGAND2: an improved knowledge-based energy function for protein–ligand interactions using the distance-scaled, finite, ideal-gas reference state
title_full_unstemmed DLIGAND2: an improved knowledge-based energy function for protein–ligand interactions using the distance-scaled, finite, ideal-gas reference state
title_sort dligand2: an improved knowledge-based energy function for protein–ligand interactions using the distance-scaled, finite, ideal-gas reference state
publisher BMC
series Journal of Cheminformatics
issn 1758-2946
publishDate 2019-08-01
description Abstract Performance of structure-based molecular docking largely depends on the accuracy of scoring functions. One important type of scoring functions are knowledge-based potentials derived from known three-dimensional structures of proteins and/or protein–ligand complex structures. This study seeks to improve a knowledge-based protein–ligand potential based on a distance-scale finite ideal-gas reference (DFIRE) state (DLIGAND) by expanding the representation of protein atoms from 13 mol2 atom types to 167 residue-specific atom types, and employing a recently updated dataset containing 12,450 monomer protein chains for training. We found that the updated version DLIGAND2 has a consistent improvement over DLIGAND in predicting binding affinities for either native complex structures or docking-generated poses. More importantly, DLIGAND2 has a 52% increase over DLIGAND in enrichment factors in top 1% predictions based on the DUD-E decoy set, and consistently improves over Autodock Vina and other statistical energy functions in all three benchmark tests. We further found that DLIGAND2 outperforms empirical and machine-learning methods compared for virtual screening on new targets that are not homologous to the DUD-E training set. Given the best performance as a parameter-free statistical potential and among the best in all performance measures, DLIGAND2 should be useful for re-assessing the poses generated by docking software, or acting as one term in other scoring functions. The program is available at https://github.com/sysu-yanglab/DLIGAND2.
topic Docking
Protein–ligand interaction
Knowledge-based energy function
url http://link.springer.com/article/10.1186/s13321-019-0373-4
work_keys_str_mv AT pinchen dligand2animprovedknowledgebasedenergyfunctionforproteinligandinteractionsusingthedistancescaledfiniteidealgasreferencestate
AT yaobinke dligand2animprovedknowledgebasedenergyfunctionforproteinligandinteractionsusingthedistancescaledfiniteidealgasreferencestate
AT yutonglu dligand2animprovedknowledgebasedenergyfunctionforproteinligandinteractionsusingthedistancescaledfiniteidealgasreferencestate
AT yunfeidu dligand2animprovedknowledgebasedenergyfunctionforproteinligandinteractionsusingthedistancescaledfiniteidealgasreferencestate
AT jiahuili dligand2animprovedknowledgebasedenergyfunctionforproteinligandinteractionsusingthedistancescaledfiniteidealgasreferencestate
AT huiyan dligand2animprovedknowledgebasedenergyfunctionforproteinligandinteractionsusingthedistancescaledfiniteidealgasreferencestate
AT huiyingzhao dligand2animprovedknowledgebasedenergyfunctionforproteinligandinteractionsusingthedistancescaledfiniteidealgasreferencestate
AT yaoqizhou dligand2animprovedknowledgebasedenergyfunctionforproteinligandinteractionsusingthedistancescaledfiniteidealgasreferencestate
AT yuedongyang dligand2animprovedknowledgebasedenergyfunctionforproteinligandinteractionsusingthedistancescaledfiniteidealgasreferencestate
_version_ 1724641088984055808