Dental implants success prediction by classifier ensemble on imbalanced data
Background: In recent years, increasing advances in data storage have led to an increase in the volume of medical data. Data mining has a significant role in different aspects of medical sciences, including extracting knowledge from the volume of data, diagnosing diseases, determining the effectiven...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Elsevier
2021-01-01
|
Series: | Computer Methods and Programs in Biomedicine Update |
Subjects: | |
Online Access: | http://www.sciencedirect.com/science/article/pii/S2666990021000203 |
id |
doaj-d0acbca88e7f4530aba19c5e806e8082 |
---|---|
record_format |
Article |
spelling |
doaj-d0acbca88e7f4530aba19c5e806e80822021-07-15T04:28:47ZengElsevierComputer Methods and Programs in Biomedicine Update2666-99002021-01-011100021Dental implants success prediction by classifier ensemble on imbalanced dataMostafa Sabzekar0Motahare Namakin1Hanie Alipoor Shahr Babaki2Arash Deldari3Vahide Babaiyan4Department of Computer Engineering, Birjand University of Technology, Birjand, Iran; Corresponding author.Department of Computer Engineering, Ferdowsi University of Mashhad, IranDepartment of Computer Engineering, Islamic Azad University, Birjand, IranDepartment of Computer Engineering, University of Torbat Heydarieh, IranDepartment of Computer Engineering, Birjand University of Technology, Birjand, IranBackground: In recent years, increasing advances in data storage have led to an increase in the volume of medical data. Data mining has a significant role in different aspects of medical sciences, including extracting knowledge from the volume of data, diagnosing diseases, determining the effectiveness of drugs, identifying treatment methods, and predicting the success of treatment. The dental implant is one of the most common methods for tooth root replacement used in prosthetic dentistry. Objective: Accurate prediction of implant success or failure before the surgery is an interesting and challenging topic for dentists. However, imbalanced data is one of the main problems in any medical data mining application. This problem occurs when samples of one or more classes of data are scarce or difficult to collect. This challenge poses severe problems in extracting knowledge from data. Methods: In this paper, we present a new approach for predicting the success or failure of dental implants in the presence of imbalanced data. To this aim, first, the imbalanced data is clustered, and then each cluster is balanced by SMOTE algorithm. In the next step, the balanced data are classified by an ensemble of four well-known classifiers, including the decision tree, the support vector machine, the k-nearest neighbor, and the Naïve Bayes. To improve the classification accuracy, an optimal weight is determined for each classifier using the genetic algorithm. Thus, each classifier will have a different effect on determining the final decision. Results: : The proposed method was applied to a dataset consists of 224 patients who underwent implant and bone graft treatments at the Faculty of Dentistry of the University of Tehran, Iran. Different criteria were used to evaluate the proposed method, including accuracy, sensitivity, specificity, AUC, and G-mean. The evaluation results show that the proposed ensemble algorithm provides a more accurate implant success prediction on imbalanced data compared to using the single-use of each classifier and also state-of-the-art researches. Conclusions: This paper presents a novel ensemble method algorithm to predict the success or failure of dental implants for imbalanced data. The obtained results reported that the use of the hybrid algorithm increases the accuracy parameter by about 5.5%, the sensitivity parameter by about 0.3%, and the specificity parameter with a significant amount of about 25%, in comparison to the best results achieved by single classifiers.http://www.sciencedirect.com/science/article/pii/S2666990021000203Imbalanced dataDental implantsSMOTE algorithmClassifier ensembleGenetic algorithm |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Mostafa Sabzekar Motahare Namakin Hanie Alipoor Shahr Babaki Arash Deldari Vahide Babaiyan |
spellingShingle |
Mostafa Sabzekar Motahare Namakin Hanie Alipoor Shahr Babaki Arash Deldari Vahide Babaiyan Dental implants success prediction by classifier ensemble on imbalanced data Computer Methods and Programs in Biomedicine Update Imbalanced data Dental implants SMOTE algorithm Classifier ensemble Genetic algorithm |
author_facet |
Mostafa Sabzekar Motahare Namakin Hanie Alipoor Shahr Babaki Arash Deldari Vahide Babaiyan |
author_sort |
Mostafa Sabzekar |
title |
Dental implants success prediction by classifier ensemble on imbalanced data |
title_short |
Dental implants success prediction by classifier ensemble on imbalanced data |
title_full |
Dental implants success prediction by classifier ensemble on imbalanced data |
title_fullStr |
Dental implants success prediction by classifier ensemble on imbalanced data |
title_full_unstemmed |
Dental implants success prediction by classifier ensemble on imbalanced data |
title_sort |
dental implants success prediction by classifier ensemble on imbalanced data |
publisher |
Elsevier |
series |
Computer Methods and Programs in Biomedicine Update |
issn |
2666-9900 |
publishDate |
2021-01-01 |
description |
Background: In recent years, increasing advances in data storage have led to an increase in the volume of medical data. Data mining has a significant role in different aspects of medical sciences, including extracting knowledge from the volume of data, diagnosing diseases, determining the effectiveness of drugs, identifying treatment methods, and predicting the success of treatment. The dental implant is one of the most common methods for tooth root replacement used in prosthetic dentistry. Objective: Accurate prediction of implant success or failure before the surgery is an interesting and challenging topic for dentists. However, imbalanced data is one of the main problems in any medical data mining application. This problem occurs when samples of one or more classes of data are scarce or difficult to collect. This challenge poses severe problems in extracting knowledge from data. Methods: In this paper, we present a new approach for predicting the success or failure of dental implants in the presence of imbalanced data. To this aim, first, the imbalanced data is clustered, and then each cluster is balanced by SMOTE algorithm. In the next step, the balanced data are classified by an ensemble of four well-known classifiers, including the decision tree, the support vector machine, the k-nearest neighbor, and the Naïve Bayes. To improve the classification accuracy, an optimal weight is determined for each classifier using the genetic algorithm. Thus, each classifier will have a different effect on determining the final decision. Results: : The proposed method was applied to a dataset consists of 224 patients who underwent implant and bone graft treatments at the Faculty of Dentistry of the University of Tehran, Iran. Different criteria were used to evaluate the proposed method, including accuracy, sensitivity, specificity, AUC, and G-mean. The evaluation results show that the proposed ensemble algorithm provides a more accurate implant success prediction on imbalanced data compared to using the single-use of each classifier and also state-of-the-art researches. Conclusions: This paper presents a novel ensemble method algorithm to predict the success or failure of dental implants for imbalanced data. The obtained results reported that the use of the hybrid algorithm increases the accuracy parameter by about 5.5%, the sensitivity parameter by about 0.3%, and the specificity parameter with a significant amount of about 25%, in comparison to the best results achieved by single classifiers. |
topic |
Imbalanced data Dental implants SMOTE algorithm Classifier ensemble Genetic algorithm |
url |
http://www.sciencedirect.com/science/article/pii/S2666990021000203 |
work_keys_str_mv |
AT mostafasabzekar dentalimplantssuccesspredictionbyclassifierensembleonimbalanceddata AT motaharenamakin dentalimplantssuccesspredictionbyclassifierensembleonimbalanceddata AT haniealipoorshahrbabaki dentalimplantssuccesspredictionbyclassifierensembleonimbalanceddata AT arashdeldari dentalimplantssuccesspredictionbyclassifierensembleonimbalanceddata AT vahidebabaiyan dentalimplantssuccesspredictionbyclassifierensembleonimbalanceddata |
_version_ |
1721301930121101312 |