Assessing Diagnostic Accuracy of Doctors Without a Gold Standard using Bayesian Networks and Kmodes Dlustering Algorithm

Background & Aim: The diagnostic accuracy of a test is the ability to discriminate accuratelybetween patients who have and do not have the target disease. A common problem in assessing thediagnostic accuracy of doctors is the unknown true disease status which in the literature is referredas “ab...

Full description

Bibliographic Details
Main Authors: Parisa Niloofar, Parastoo Niloofar, Mehdi Yaseri
Format: Article
Language:English
Published: Tehran University of Medical Sciences 2019-03-01
Series:Journal of Biostatistics and Epidemiology
Subjects:
Online Access:https://jbe.tums.ac.ir/index.php/jbe/article/view/226
Description
Summary:Background & Aim: The diagnostic accuracy of a test is the ability to discriminate accuratelybetween patients who have and do not have the target disease. A common problem in assessing thediagnostic accuracy of doctors is the unknown true disease status which in the literature is referredas “absence of a gold standard”. Methods & Material: In this article, a Naïve Bayesian network with hidden class node and a clusteringbased algorithm for categorical data named K-modes are proposed for estimating the diagnosticaccuracy of 5 physicians in diagnosing Diabetic Retinopathy. Also to assess and compare the efficiencies of these models, a simulation study with two different scenarios is conducted. Results: Simulation study indicates that for Naïve Bayesian network and the non-rare disease, say forprevalence 0.1 and 0.2, as the sample size increases so the coverage probability. But for high prevalencevalues, say 0.5, coverage probabilities are not as good as those of non-rare disease. K-modes algorithm's efficiency decreases by the increase in the number of records, but it achieves betterresults when there are a small number of records, prevalence is approximately 0.3 and sensitivitiesare high. Results of the real data set reveal that sensitivities for all physicians except one, were higher than 85% and all specificities were higher than 90%. Also the estimated prevalence happensto be 0.32. Conclusion: Through simulations and data analysis we show that this new approach based on Naïve Bayesian networks provides a useful alternative to traditional latent class modeling approaches usedin this setting.
ISSN:2383-4196
2383-420X