Dental implants success prediction by classifier ensemble on imbalanced data

Background: In recent years, increasing advances in data storage have led to an increase in the volume of medical data. Data mining has a significant role in different aspects of medical sciences, including extracting knowledge from the volume of data, diagnosing diseases, determining the effectiven...

Full description

Bibliographic Details
Main Authors: Mostafa Sabzekar, Motahare Namakin, Hanie Alipoor Shahr Babaki, Arash Deldari, Vahide Babaiyan
Format: Article
Language:English
Published: Elsevier 2021-01-01
Series:Computer Methods and Programs in Biomedicine Update
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2666990021000203
id doaj-d0acbca88e7f4530aba19c5e806e8082
record_format Article
spelling doaj-d0acbca88e7f4530aba19c5e806e80822021-07-15T04:28:47ZengElsevierComputer Methods and Programs in Biomedicine Update2666-99002021-01-011100021Dental implants success prediction by classifier ensemble on imbalanced dataMostafa Sabzekar0Motahare Namakin1Hanie Alipoor Shahr Babaki2Arash Deldari3Vahide Babaiyan4Department of Computer Engineering, Birjand University of Technology, Birjand, Iran; Corresponding author.Department of Computer Engineering, Ferdowsi University of Mashhad, IranDepartment of Computer Engineering, Islamic Azad University, Birjand, IranDepartment of Computer Engineering, University of Torbat Heydarieh, IranDepartment of Computer Engineering, Birjand University of Technology, Birjand, IranBackground: In recent years, increasing advances in data storage have led to an increase in the volume of medical data. Data mining has a significant role in different aspects of medical sciences, including extracting knowledge from the volume of data, diagnosing diseases, determining the effectiveness of drugs, identifying treatment methods, and predicting the success of treatment. The dental implant is one of the most common methods for tooth root replacement used in prosthetic dentistry. Objective: Accurate prediction of implant success or failure before the surgery is an interesting and challenging topic for dentists. However, imbalanced data is one of the main problems in any medical data mining application. This problem occurs when samples of one or more classes of data are scarce or difficult to collect. This challenge poses severe problems in extracting knowledge from data. Methods: In this paper, we present a new approach for predicting the success or failure of dental implants in the presence of imbalanced data. To this aim, first, the imbalanced data is clustered, and then each cluster is balanced by SMOTE algorithm. In the next step, the balanced data are classified by an ensemble of four well-known classifiers, including the decision tree, the support vector machine, the k-nearest neighbor, and the Naïve Bayes. To improve the classification accuracy, an optimal weight is determined for each classifier using the genetic algorithm. Thus, each classifier will have a different effect on determining the final decision. Results: : The proposed method was applied to a dataset consists of 224 patients who underwent implant and bone graft treatments at the Faculty of Dentistry of the University of Tehran, Iran. Different criteria were used to evaluate the proposed method, including accuracy, sensitivity, specificity, AUC, and G-mean. The evaluation results show that the proposed ensemble algorithm provides a more accurate implant success prediction on imbalanced data compared to using the single-use of each classifier and also state-of-the-art researches. Conclusions: This paper presents a novel ensemble method algorithm to predict the success or failure of dental implants for imbalanced data. The obtained results reported that the use of the hybrid algorithm increases the accuracy parameter by about 5.5%, the sensitivity parameter by about 0.3%, and the specificity parameter with a significant amount of about 25%, in comparison to the best results achieved by single classifiers.http://www.sciencedirect.com/science/article/pii/S2666990021000203Imbalanced dataDental implantsSMOTE algorithmClassifier ensembleGenetic algorithm
collection DOAJ
language English
format Article
sources DOAJ
author Mostafa Sabzekar
Motahare Namakin
Hanie Alipoor Shahr Babaki
Arash Deldari
Vahide Babaiyan
spellingShingle Mostafa Sabzekar
Motahare Namakin
Hanie Alipoor Shahr Babaki
Arash Deldari
Vahide Babaiyan
Dental implants success prediction by classifier ensemble on imbalanced data
Computer Methods and Programs in Biomedicine Update
Imbalanced data
Dental implants
SMOTE algorithm
Classifier ensemble
Genetic algorithm
author_facet Mostafa Sabzekar
Motahare Namakin
Hanie Alipoor Shahr Babaki
Arash Deldari
Vahide Babaiyan
author_sort Mostafa Sabzekar
title Dental implants success prediction by classifier ensemble on imbalanced data
title_short Dental implants success prediction by classifier ensemble on imbalanced data
title_full Dental implants success prediction by classifier ensemble on imbalanced data
title_fullStr Dental implants success prediction by classifier ensemble on imbalanced data
title_full_unstemmed Dental implants success prediction by classifier ensemble on imbalanced data
title_sort dental implants success prediction by classifier ensemble on imbalanced data
publisher Elsevier
series Computer Methods and Programs in Biomedicine Update
issn 2666-9900
publishDate 2021-01-01
description Background: In recent years, increasing advances in data storage have led to an increase in the volume of medical data. Data mining has a significant role in different aspects of medical sciences, including extracting knowledge from the volume of data, diagnosing diseases, determining the effectiveness of drugs, identifying treatment methods, and predicting the success of treatment. The dental implant is one of the most common methods for tooth root replacement used in prosthetic dentistry. Objective: Accurate prediction of implant success or failure before the surgery is an interesting and challenging topic for dentists. However, imbalanced data is one of the main problems in any medical data mining application. This problem occurs when samples of one or more classes of data are scarce or difficult to collect. This challenge poses severe problems in extracting knowledge from data. Methods: In this paper, we present a new approach for predicting the success or failure of dental implants in the presence of imbalanced data. To this aim, first, the imbalanced data is clustered, and then each cluster is balanced by SMOTE algorithm. In the next step, the balanced data are classified by an ensemble of four well-known classifiers, including the decision tree, the support vector machine, the k-nearest neighbor, and the Naïve Bayes. To improve the classification accuracy, an optimal weight is determined for each classifier using the genetic algorithm. Thus, each classifier will have a different effect on determining the final decision. Results: : The proposed method was applied to a dataset consists of 224 patients who underwent implant and bone graft treatments at the Faculty of Dentistry of the University of Tehran, Iran. Different criteria were used to evaluate the proposed method, including accuracy, sensitivity, specificity, AUC, and G-mean. The evaluation results show that the proposed ensemble algorithm provides a more accurate implant success prediction on imbalanced data compared to using the single-use of each classifier and also state-of-the-art researches. Conclusions: This paper presents a novel ensemble method algorithm to predict the success or failure of dental implants for imbalanced data. The obtained results reported that the use of the hybrid algorithm increases the accuracy parameter by about 5.5%, the sensitivity parameter by about 0.3%, and the specificity parameter with a significant amount of about 25%, in comparison to the best results achieved by single classifiers.
topic Imbalanced data
Dental implants
SMOTE algorithm
Classifier ensemble
Genetic algorithm
url http://www.sciencedirect.com/science/article/pii/S2666990021000203
work_keys_str_mv AT mostafasabzekar dentalimplantssuccesspredictionbyclassifierensembleonimbalanceddata
AT motaharenamakin dentalimplantssuccesspredictionbyclassifierensembleonimbalanceddata
AT haniealipoorshahrbabaki dentalimplantssuccesspredictionbyclassifierensembleonimbalanceddata
AT arashdeldari dentalimplantssuccesspredictionbyclassifierensembleonimbalanceddata
AT vahidebabaiyan dentalimplantssuccesspredictionbyclassifierensembleonimbalanceddata
_version_ 1721301930121101312