Evaluation of selected data mining algorithms implemented in Medical Decision Support Systems
The goal of this master’s thesis is to identify and evaluate data mining algorithms which are commonly implemented in modern Medical Decision Support Systems (MDSS). They are used in various healthcare units all over the world. These institutions store large amounts of medical data. This data may co...
Main Author: | |
---|---|
Format: | Others |
Language: | English |
Published: |
Blekinge Tekniska Högskola, Avdelningen för programvarusystem
2007
|
Subjects: | |
Online Access: | http://urn.kb.se/resolve?urn=urn:nbn:se:bth-6194 |
id |
ndltd-UPSALLA1-oai-DiVA.org-bth-6194 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-UPSALLA1-oai-DiVA.org-bth-61942018-01-12T05:13:41ZEvaluation of selected data mining algorithms implemented in Medical Decision Support SystemsengAftarczuk, KamilaBlekinge Tekniska Högskola, Avdelningen för programvarusystem2007Naïve BayesMultilayer PerceptronC4.5medical data miningmedical decision supportComputer SciencesDatavetenskap (datalogi)Software EngineeringProgramvaruteknikThe goal of this master’s thesis is to identify and evaluate data mining algorithms which are commonly implemented in modern Medical Decision Support Systems (MDSS). They are used in various healthcare units all over the world. These institutions store large amounts of medical data. This data may contain relevant medical information hidden in various patterns buried among the records. Within the research several popular MDSS’s are analyzed in order to determine the most common data mining algorithms utilized by them. Three algorithms have been identified: Naïve Bayes, Multilayer Perceptron and C4.5. Prior to the very analyses the algorithms are calibrated. Several testing configurations are tested in order to determine the best setting for the algorithms. Afterwards, an ultimate comparison of the algorithms orders them with respect to their performance. The evaluation is based on a set of performance metrics. The analyses are conducted in WEKA on five UCI medical datasets: breast cancer, hepatitis, heart disease, dermatology disease, diabetes. The analyses have shown that it is very difficult to name a single data mining algorithm to be the most suitable for the medical data. The results gained for the algorithms were very similar. However, the final evaluation of the outcomes allowed singling out the Naïve Bayes to be the best classifier for the given domain. It was followed by the Multilayer Perceptron and the C4.5. Student thesisinfo:eu-repo/semantics/bachelorThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:se:bth-6194Local oai:bth.se:arkivex06EE332670EA55D3C125736E00417C3Aapplication/pdfinfo:eu-repo/semantics/openAccess |
collection |
NDLTD |
language |
English |
format |
Others
|
sources |
NDLTD |
topic |
Naïve Bayes Multilayer Perceptron C4.5 medical data mining medical decision support Computer Sciences Datavetenskap (datalogi) Software Engineering Programvaruteknik |
spellingShingle |
Naïve Bayes Multilayer Perceptron C4.5 medical data mining medical decision support Computer Sciences Datavetenskap (datalogi) Software Engineering Programvaruteknik Aftarczuk, Kamila Evaluation of selected data mining algorithms implemented in Medical Decision Support Systems |
description |
The goal of this master’s thesis is to identify and evaluate data mining algorithms which are commonly implemented in modern Medical Decision Support Systems (MDSS). They are used in various healthcare units all over the world. These institutions store large amounts of medical data. This data may contain relevant medical information hidden in various patterns buried among the records. Within the research several popular MDSS’s are analyzed in order to determine the most common data mining algorithms utilized by them. Three algorithms have been identified: Naïve Bayes, Multilayer Perceptron and C4.5. Prior to the very analyses the algorithms are calibrated. Several testing configurations are tested in order to determine the best setting for the algorithms. Afterwards, an ultimate comparison of the algorithms orders them with respect to their performance. The evaluation is based on a set of performance metrics. The analyses are conducted in WEKA on five UCI medical datasets: breast cancer, hepatitis, heart disease, dermatology disease, diabetes. The analyses have shown that it is very difficult to name a single data mining algorithm to be the most suitable for the medical data. The results gained for the algorithms were very similar. However, the final evaluation of the outcomes allowed singling out the Naïve Bayes to be the best classifier for the given domain. It was followed by the Multilayer Perceptron and the C4.5. |
author |
Aftarczuk, Kamila |
author_facet |
Aftarczuk, Kamila |
author_sort |
Aftarczuk, Kamila |
title |
Evaluation of selected data mining algorithms implemented in Medical Decision Support Systems |
title_short |
Evaluation of selected data mining algorithms implemented in Medical Decision Support Systems |
title_full |
Evaluation of selected data mining algorithms implemented in Medical Decision Support Systems |
title_fullStr |
Evaluation of selected data mining algorithms implemented in Medical Decision Support Systems |
title_full_unstemmed |
Evaluation of selected data mining algorithms implemented in Medical Decision Support Systems |
title_sort |
evaluation of selected data mining algorithms implemented in medical decision support systems |
publisher |
Blekinge Tekniska Högskola, Avdelningen för programvarusystem |
publishDate |
2007 |
url |
http://urn.kb.se/resolve?urn=urn:nbn:se:bth-6194 |
work_keys_str_mv |
AT aftarczukkamila evaluationofselecteddataminingalgorithmsimplementedinmedicaldecisionsupportsystems |
_version_ |
1718606185010036736 |