Impact of Image Context on Deep Learning for Classification of Teeth on Radiographs

<i>Objectives:</i> We aimed to assess the impact of image context information on the accuracy of deep learning models for tooth classification on panoramic dental radiographs. <i>Methods:</i> Our dataset contained 5008 panoramic radiographs with a mean number of 25.2 teeth pe...

Full description

Bibliographic Details
Main Authors: Joachim Krois, Lisa Schneider, Falk Schwendicke
Format: Article
Language:English
Published: MDPI AG 2021-04-01
Series:Journal of Clinical Medicine
Subjects:
Online Access:https://www.mdpi.com/2077-0383/10/8/1635
Description
Summary:<i>Objectives:</i> We aimed to assess the impact of image context information on the accuracy of deep learning models for tooth classification on panoramic dental radiographs. <i>Methods:</i> Our dataset contained 5008 panoramic radiographs with a mean number of 25.2 teeth per image. Teeth were segmented bounding-box-wise and classified by one expert; this was validated by another expert. Tooth segments were cropped allowing for different context; the baseline size was 100% of each box and was scaled up to capture 150%, 200%, 250% and 300% to increase context. On each of the five generated datasets, ResNet-34 classification models were trained using the Adam optimizer with a learning rate of 0.001 over 25 epochs with a batch size of 16. A total of 20% of the data was used for testing; in subgroup analyses, models were tested only on specific tooth types. Feature visualization using gradient-weighted class activation mapping (Grad-CAM) was employed to visualize salient areas. <i>Results:</i> F1-scores increased monotonically from 0.77 in the base-case (100%) to 0.93 on the largest segments (300%; <i>p</i> = 0.0083; Mann–Kendall-test). Gains in accuracy were limited between 200% and 300%. This behavior was found for all tooth types except canines, where accuracy was much higher even for smaller segments and increasing context yielded only minimal gains. With increasing context salient areas were more widely distributed over each segment; at maximum segment size, the models assessed minimum 3–4 teeth as well as the interdental or inter-arch space to come to a classification. <i>Conclusions:</i> Context matters; classification accuracy increased significantly with increasing context.
ISSN:2077-0383