How Sure Can We Be about ML Methods-Based Evaluation of Compound Activity: Incorporation of Information about Prediction Uncertainty Using Deep Learning Techniques

A great variety of computational approaches support drug design processes, helping in selection of new potentially active compounds, and optimization of their physicochemical and ADMET properties. Machine learning is a group of methods that are able to evaluate in relatively short time enormous amou...

Full description

Bibliographic Details
Main Authors: Igor Sieradzki, Damian Leśniak, Sabina Podlewska
Format: Article
Language:English
Published: MDPI AG 2020-03-01
Series:Molecules
Subjects:
Online Access:https://www.mdpi.com/1420-3049/25/6/1452
id doaj-207406e1a4704cfc9ccc30b977a05113
record_format Article
spelling doaj-207406e1a4704cfc9ccc30b977a051132020-11-25T02:01:59ZengMDPI AGMolecules1420-30492020-03-01256145210.3390/molecules25061452molecules25061452How Sure Can We Be about ML Methods-Based Evaluation of Compound Activity: Incorporation of Information about Prediction Uncertainty Using Deep Learning TechniquesIgor Sieradzki0Damian Leśniak1Sabina Podlewska2Faculty of Mathematics and Computer Science, Jagiellonian University, 6 S. Łojasiewicza Street, 30-348 Cracow, PolandFaculty of Mathematics and Computer Science, Jagiellonian University, 6 S. Łojasiewicza Street, 30-348 Cracow, PolandDepartment of Technology and Biotechnology of Drugs, Jagiellonian University, Medical College, 9 Medyczna Street, 30-688 Cracow, PolandA great variety of computational approaches support drug design processes, helping in selection of new potentially active compounds, and optimization of their physicochemical and ADMET properties. Machine learning is a group of methods that are able to evaluate in relatively short time enormous amounts of data. However, the quality of machine-learning-based prediction depends on the data supplied for model training. In this study, we used deep neural networks for the task of compound activity prediction and developed dropout-based approaches for estimating prediction uncertainty. Several types of analyses were performed: the relationships between the prediction error, similarity to the training set, prediction uncertainty, number and standard deviation of activity values were examined. It was tested whether incorporation of information about prediction uncertainty influences compounds ranking based on predicted activity and prediction uncertainty was used to search for the potential errors in the ChEMBL database. The obtained outcome indicates that incorporation of information about uncertainty of compound activity prediction can be of great help during virtual screening experiments.https://www.mdpi.com/1420-3049/25/6/1452machine learningprediction uncertaintydeep learningligandschembl database
collection DOAJ
language English
format Article
sources DOAJ
author Igor Sieradzki
Damian Leśniak
Sabina Podlewska
spellingShingle Igor Sieradzki
Damian Leśniak
Sabina Podlewska
How Sure Can We Be about ML Methods-Based Evaluation of Compound Activity: Incorporation of Information about Prediction Uncertainty Using Deep Learning Techniques
Molecules
machine learning
prediction uncertainty
deep learning
ligands
chembl database
author_facet Igor Sieradzki
Damian Leśniak
Sabina Podlewska
author_sort Igor Sieradzki
title How Sure Can We Be about ML Methods-Based Evaluation of Compound Activity: Incorporation of Information about Prediction Uncertainty Using Deep Learning Techniques
title_short How Sure Can We Be about ML Methods-Based Evaluation of Compound Activity: Incorporation of Information about Prediction Uncertainty Using Deep Learning Techniques
title_full How Sure Can We Be about ML Methods-Based Evaluation of Compound Activity: Incorporation of Information about Prediction Uncertainty Using Deep Learning Techniques
title_fullStr How Sure Can We Be about ML Methods-Based Evaluation of Compound Activity: Incorporation of Information about Prediction Uncertainty Using Deep Learning Techniques
title_full_unstemmed How Sure Can We Be about ML Methods-Based Evaluation of Compound Activity: Incorporation of Information about Prediction Uncertainty Using Deep Learning Techniques
title_sort how sure can we be about ml methods-based evaluation of compound activity: incorporation of information about prediction uncertainty using deep learning techniques
publisher MDPI AG
series Molecules
issn 1420-3049
publishDate 2020-03-01
description A great variety of computational approaches support drug design processes, helping in selection of new potentially active compounds, and optimization of their physicochemical and ADMET properties. Machine learning is a group of methods that are able to evaluate in relatively short time enormous amounts of data. However, the quality of machine-learning-based prediction depends on the data supplied for model training. In this study, we used deep neural networks for the task of compound activity prediction and developed dropout-based approaches for estimating prediction uncertainty. Several types of analyses were performed: the relationships between the prediction error, similarity to the training set, prediction uncertainty, number and standard deviation of activity values were examined. It was tested whether incorporation of information about prediction uncertainty influences compounds ranking based on predicted activity and prediction uncertainty was used to search for the potential errors in the ChEMBL database. The obtained outcome indicates that incorporation of information about uncertainty of compound activity prediction can be of great help during virtual screening experiments.
topic machine learning
prediction uncertainty
deep learning
ligands
chembl database
url https://www.mdpi.com/1420-3049/25/6/1452
work_keys_str_mv AT igorsieradzki howsurecanwebeaboutmlmethodsbasedevaluationofcompoundactivityincorporationofinformationaboutpredictionuncertaintyusingdeeplearningtechniques
AT damianlesniak howsurecanwebeaboutmlmethodsbasedevaluationofcompoundactivityincorporationofinformationaboutpredictionuncertaintyusingdeeplearningtechniques
AT sabinapodlewska howsurecanwebeaboutmlmethodsbasedevaluationofcompoundactivityincorporationofinformationaboutpredictionuncertaintyusingdeeplearningtechniques
_version_ 1724954656133611520