Unified processing framework of high-dimensional and overly imbalanced chemical datasets for virtual screening

Virtual screening in drug discovery involves processing large datasets containing unknown molecules in order to find the ones that are likely to have the desired effects on a biological target, typically a protein receptor or an enzyme. Molecules are thereby classified into active or non-active in r...

Full description

Bibliographic Details
Main Author: Rafati-Afshar, Amir Ali
Published: Bournemouth University 2017
Subjects:
Online Access:https://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.715386
id ndltd-bl.uk-oai-ethos.bl.uk-715386
record_format oai_dc
spelling ndltd-bl.uk-oai-ethos.bl.uk-7153862018-10-03T03:26:29ZUnified processing framework of high-dimensional and overly imbalanced chemical datasets for virtual screeningRafati-Afshar, Amir Ali2017Virtual screening in drug discovery involves processing large datasets containing unknown molecules in order to find the ones that are likely to have the desired effects on a biological target, typically a protein receptor or an enzyme. Molecules are thereby classified into active or non-active in relation to the target. Misclassification of molecules in cases such as drug discovery and medical diagnosis is costly, both in time and finances. In the process of discovering a drug, it is mainly the inactive molecules classified as active towards the biological target i.e. false positives that cause a delay in the progress and high late-stage attrition. However, despite the pool of techniques available, the selection of the suitable approach in each situation is still a major challenge. This PhD thesis is designed to develop a pioneering framework which enables the analysis of the virtual screening of chemical compounds datasets in a wide range of settings in a unified fashion. The proposed method provides a better understanding of the dynamics of innovatively combining data processing and classification methods in order to screen massive, potentially high dimensional and overly imbalanced datasets more efficiently.615.1Bournemouth Universityhttps://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.715386http://eprints.bournemouth.ac.uk/29248/Electronic Thesis or Dissertation
collection NDLTD
sources NDLTD
topic 615.1
spellingShingle 615.1
Rafati-Afshar, Amir Ali
Unified processing framework of high-dimensional and overly imbalanced chemical datasets for virtual screening
description Virtual screening in drug discovery involves processing large datasets containing unknown molecules in order to find the ones that are likely to have the desired effects on a biological target, typically a protein receptor or an enzyme. Molecules are thereby classified into active or non-active in relation to the target. Misclassification of molecules in cases such as drug discovery and medical diagnosis is costly, both in time and finances. In the process of discovering a drug, it is mainly the inactive molecules classified as active towards the biological target i.e. false positives that cause a delay in the progress and high late-stage attrition. However, despite the pool of techniques available, the selection of the suitable approach in each situation is still a major challenge. This PhD thesis is designed to develop a pioneering framework which enables the analysis of the virtual screening of chemical compounds datasets in a wide range of settings in a unified fashion. The proposed method provides a better understanding of the dynamics of innovatively combining data processing and classification methods in order to screen massive, potentially high dimensional and overly imbalanced datasets more efficiently.
author Rafati-Afshar, Amir Ali
author_facet Rafati-Afshar, Amir Ali
author_sort Rafati-Afshar, Amir Ali
title Unified processing framework of high-dimensional and overly imbalanced chemical datasets for virtual screening
title_short Unified processing framework of high-dimensional and overly imbalanced chemical datasets for virtual screening
title_full Unified processing framework of high-dimensional and overly imbalanced chemical datasets for virtual screening
title_fullStr Unified processing framework of high-dimensional and overly imbalanced chemical datasets for virtual screening
title_full_unstemmed Unified processing framework of high-dimensional and overly imbalanced chemical datasets for virtual screening
title_sort unified processing framework of high-dimensional and overly imbalanced chemical datasets for virtual screening
publisher Bournemouth University
publishDate 2017
url https://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.715386
work_keys_str_mv AT rafatiafsharamirali unifiedprocessingframeworkofhighdimensionalandoverlyimbalancedchemicaldatasetsforvirtualscreening
_version_ 1718758548964376576