How Sure Can We Be about ML Methods-Based Evaluation of Compound Activity: Incorporation of Information about Prediction Uncertainty Using Deep Learning Techniques
A great variety of computational approaches support drug design processes, helping in selection of new potentially active compounds, and optimization of their physicochemical and ADMET properties. Machine learning is a group of methods that are able to evaluate in relatively short time enormous amou...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2020-03-01
|
Series: | Molecules |
Subjects: | |
Online Access: | https://www.mdpi.com/1420-3049/25/6/1452 |
id |
doaj-207406e1a4704cfc9ccc30b977a05113 |
---|---|
record_format |
Article |
spelling |
doaj-207406e1a4704cfc9ccc30b977a051132020-11-25T02:01:59ZengMDPI AGMolecules1420-30492020-03-01256145210.3390/molecules25061452molecules25061452How Sure Can We Be about ML Methods-Based Evaluation of Compound Activity: Incorporation of Information about Prediction Uncertainty Using Deep Learning TechniquesIgor Sieradzki0Damian Leśniak1Sabina Podlewska2Faculty of Mathematics and Computer Science, Jagiellonian University, 6 S. Łojasiewicza Street, 30-348 Cracow, PolandFaculty of Mathematics and Computer Science, Jagiellonian University, 6 S. Łojasiewicza Street, 30-348 Cracow, PolandDepartment of Technology and Biotechnology of Drugs, Jagiellonian University, Medical College, 9 Medyczna Street, 30-688 Cracow, PolandA great variety of computational approaches support drug design processes, helping in selection of new potentially active compounds, and optimization of their physicochemical and ADMET properties. Machine learning is a group of methods that are able to evaluate in relatively short time enormous amounts of data. However, the quality of machine-learning-based prediction depends on the data supplied for model training. In this study, we used deep neural networks for the task of compound activity prediction and developed dropout-based approaches for estimating prediction uncertainty. Several types of analyses were performed: the relationships between the prediction error, similarity to the training set, prediction uncertainty, number and standard deviation of activity values were examined. It was tested whether incorporation of information about prediction uncertainty influences compounds ranking based on predicted activity and prediction uncertainty was used to search for the potential errors in the ChEMBL database. The obtained outcome indicates that incorporation of information about uncertainty of compound activity prediction can be of great help during virtual screening experiments.https://www.mdpi.com/1420-3049/25/6/1452machine learningprediction uncertaintydeep learningligandschembl database |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Igor Sieradzki Damian Leśniak Sabina Podlewska |
spellingShingle |
Igor Sieradzki Damian Leśniak Sabina Podlewska How Sure Can We Be about ML Methods-Based Evaluation of Compound Activity: Incorporation of Information about Prediction Uncertainty Using Deep Learning Techniques Molecules machine learning prediction uncertainty deep learning ligands chembl database |
author_facet |
Igor Sieradzki Damian Leśniak Sabina Podlewska |
author_sort |
Igor Sieradzki |
title |
How Sure Can We Be about ML Methods-Based Evaluation of Compound Activity: Incorporation of Information about Prediction Uncertainty Using Deep Learning Techniques |
title_short |
How Sure Can We Be about ML Methods-Based Evaluation of Compound Activity: Incorporation of Information about Prediction Uncertainty Using Deep Learning Techniques |
title_full |
How Sure Can We Be about ML Methods-Based Evaluation of Compound Activity: Incorporation of Information about Prediction Uncertainty Using Deep Learning Techniques |
title_fullStr |
How Sure Can We Be about ML Methods-Based Evaluation of Compound Activity: Incorporation of Information about Prediction Uncertainty Using Deep Learning Techniques |
title_full_unstemmed |
How Sure Can We Be about ML Methods-Based Evaluation of Compound Activity: Incorporation of Information about Prediction Uncertainty Using Deep Learning Techniques |
title_sort |
how sure can we be about ml methods-based evaluation of compound activity: incorporation of information about prediction uncertainty using deep learning techniques |
publisher |
MDPI AG |
series |
Molecules |
issn |
1420-3049 |
publishDate |
2020-03-01 |
description |
A great variety of computational approaches support drug design processes, helping in selection of new potentially active compounds, and optimization of their physicochemical and ADMET properties. Machine learning is a group of methods that are able to evaluate in relatively short time enormous amounts of data. However, the quality of machine-learning-based prediction depends on the data supplied for model training. In this study, we used deep neural networks for the task of compound activity prediction and developed dropout-based approaches for estimating prediction uncertainty. Several types of analyses were performed: the relationships between the prediction error, similarity to the training set, prediction uncertainty, number and standard deviation of activity values were examined. It was tested whether incorporation of information about prediction uncertainty influences compounds ranking based on predicted activity and prediction uncertainty was used to search for the potential errors in the ChEMBL database. The obtained outcome indicates that incorporation of information about uncertainty of compound activity prediction can be of great help during virtual screening experiments. |
topic |
machine learning prediction uncertainty deep learning ligands chembl database |
url |
https://www.mdpi.com/1420-3049/25/6/1452 |
work_keys_str_mv |
AT igorsieradzki howsurecanwebeaboutmlmethodsbasedevaluationofcompoundactivityincorporationofinformationaboutpredictionuncertaintyusingdeeplearningtechniques AT damianlesniak howsurecanwebeaboutmlmethodsbasedevaluationofcompoundactivityincorporationofinformationaboutpredictionuncertaintyusingdeeplearningtechniques AT sabinapodlewska howsurecanwebeaboutmlmethodsbasedevaluationofcompoundactivityincorporationofinformationaboutpredictionuncertaintyusingdeeplearningtechniques |
_version_ |
1724954656133611520 |