Evaluation of selected data mining algorithms implemented in Medical Decision Support Systems

The goal of this master’s thesis is to identify and evaluate data mining algorithms which are commonly implemented in modern Medical Decision Support Systems (MDSS). They are used in various healthcare units all over the world. These institutions store large amounts of medical data. This data may co...

Full description

Bibliographic Details
Main Author: Aftarczuk, Kamila
Format: Others
Language:English
Published: Blekinge Tekniska Högskola, Avdelningen för programvarusystem 2007
Subjects:
Online Access:http://urn.kb.se/resolve?urn=urn:nbn:se:bth-6194
id ndltd-UPSALLA1-oai-DiVA.org-bth-6194
record_format oai_dc
spelling ndltd-UPSALLA1-oai-DiVA.org-bth-61942018-01-12T05:13:41ZEvaluation of selected data mining algorithms implemented in Medical Decision Support SystemsengAftarczuk, KamilaBlekinge Tekniska Högskola, Avdelningen för programvarusystem2007Naïve BayesMultilayer PerceptronC4.5medical data miningmedical decision supportComputer SciencesDatavetenskap (datalogi)Software EngineeringProgramvaruteknikThe goal of this master’s thesis is to identify and evaluate data mining algorithms which are commonly implemented in modern Medical Decision Support Systems (MDSS). They are used in various healthcare units all over the world. These institutions store large amounts of medical data. This data may contain relevant medical information hidden in various patterns buried among the records. Within the research several popular MDSS’s are analyzed in order to determine the most common data mining algorithms utilized by them. Three algorithms have been identified: Naïve Bayes, Multilayer Perceptron and C4.5. Prior to the very analyses the algorithms are calibrated. Several testing configurations are tested in order to determine the best setting for the algorithms. Afterwards, an ultimate comparison of the algorithms orders them with respect to their performance. The evaluation is based on a set of performance metrics. The analyses are conducted in WEKA on five UCI medical datasets: breast cancer, hepatitis, heart disease, dermatology disease, diabetes. The analyses have shown that it is very difficult to name a single data mining algorithm to be the most suitable for the medical data. The results gained for the algorithms were very similar. However, the final evaluation of the outcomes allowed singling out the Naïve Bayes to be the best classifier for the given domain. It was followed by the Multilayer Perceptron and the C4.5. Student thesisinfo:eu-repo/semantics/bachelorThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:se:bth-6194Local oai:bth.se:arkivex06EE332670EA55D3C125736E00417C3Aapplication/pdfinfo:eu-repo/semantics/openAccess
collection NDLTD
language English
format Others
sources NDLTD
topic Naïve Bayes
Multilayer Perceptron
C4.5
medical data mining
medical decision support
Computer Sciences
Datavetenskap (datalogi)
Software Engineering
Programvaruteknik
spellingShingle Naïve Bayes
Multilayer Perceptron
C4.5
medical data mining
medical decision support
Computer Sciences
Datavetenskap (datalogi)
Software Engineering
Programvaruteknik
Aftarczuk, Kamila
Evaluation of selected data mining algorithms implemented in Medical Decision Support Systems
description The goal of this master’s thesis is to identify and evaluate data mining algorithms which are commonly implemented in modern Medical Decision Support Systems (MDSS). They are used in various healthcare units all over the world. These institutions store large amounts of medical data. This data may contain relevant medical information hidden in various patterns buried among the records. Within the research several popular MDSS’s are analyzed in order to determine the most common data mining algorithms utilized by them. Three algorithms have been identified: Naïve Bayes, Multilayer Perceptron and C4.5. Prior to the very analyses the algorithms are calibrated. Several testing configurations are tested in order to determine the best setting for the algorithms. Afterwards, an ultimate comparison of the algorithms orders them with respect to their performance. The evaluation is based on a set of performance metrics. The analyses are conducted in WEKA on five UCI medical datasets: breast cancer, hepatitis, heart disease, dermatology disease, diabetes. The analyses have shown that it is very difficult to name a single data mining algorithm to be the most suitable for the medical data. The results gained for the algorithms were very similar. However, the final evaluation of the outcomes allowed singling out the Naïve Bayes to be the best classifier for the given domain. It was followed by the Multilayer Perceptron and the C4.5.
author Aftarczuk, Kamila
author_facet Aftarczuk, Kamila
author_sort Aftarczuk, Kamila
title Evaluation of selected data mining algorithms implemented in Medical Decision Support Systems
title_short Evaluation of selected data mining algorithms implemented in Medical Decision Support Systems
title_full Evaluation of selected data mining algorithms implemented in Medical Decision Support Systems
title_fullStr Evaluation of selected data mining algorithms implemented in Medical Decision Support Systems
title_full_unstemmed Evaluation of selected data mining algorithms implemented in Medical Decision Support Systems
title_sort evaluation of selected data mining algorithms implemented in medical decision support systems
publisher Blekinge Tekniska Högskola, Avdelningen för programvarusystem
publishDate 2007
url http://urn.kb.se/resolve?urn=urn:nbn:se:bth-6194
work_keys_str_mv AT aftarczukkamila evaluationofselecteddataminingalgorithmsimplementedinmedicaldecisionsupportsystems
_version_ 1718606185010036736