Improving generalization of deep learning models for diagnostic pathology by increasing variability in training data: Experiments on osteosarcoma subtypes

Background: Artificial intelligence has an emerging progress in diagnostic pathology. A large number of studies of applying deep learning models to histopathological images have been published in recent years. While many studies claim high accuracies, they may fall into the pitfalls of overfitting a...

Full description

Bibliographic Details
Main Authors: Haiming Tang, Nanfei Sun, Steven Shen
Format: Article
Language:English
Published: Wolters Kluwer Medknow Publications 2021-01-01
Series:Journal of Pathology Informatics
Subjects:
Online Access:http://www.jpathinformatics.org/article.asp?issn=2153-3539;year=2021;volume=12;issue=1;spage=30;epage=30;aulast=Tang
id doaj-1e1fa3d37e37487ba6e7d02f9449b21f
record_format Article
spelling doaj-1e1fa3d37e37487ba6e7d02f9449b21f2021-09-08T03:48:10ZengWolters Kluwer Medknow PublicationsJournal of Pathology Informatics2229-50892021-01-01121303010.4103/jpi.jpi_78_20Improving generalization of deep learning models for diagnostic pathology by increasing variability in training data: Experiments on osteosarcoma subtypesHaiming TangNanfei SunSteven ShenBackground: Artificial intelligence has an emerging progress in diagnostic pathology. A large number of studies of applying deep learning models to histopathological images have been published in recent years. While many studies claim high accuracies, they may fall into the pitfalls of overfitting and lack of generalization due to the high variability of the histopathological images. Aims and Objects: Use the model training of osteosarcoma as an example to illustrate the pitfalls of overfitting and how the addition of model input variability can help improve model performance. Materials and Methods: We use the publicly available osteosarcoma dataset to retrain a previously published classification model for osteosarcoma. We partition the same set of images into the training and testing datasets differently than the original study: the test dataset consists of images from one patient while the training dataset consists images of all other patients. We also show the influence of training data variability on model performance by collecting a minimal dataset of 10 osteosarcoma subtypes as well as benign tissues and benign bone tumors of differentiation. Results: The performance of the re-trained model on the test set using the new partition schema declines dramatically, indicating a lack of model generalization and overfitting. We show the additions of more and moresubtypes into the training data step by step under the same model schema yield a series of coherent models with increasing performances. Conclusions: In conclusion, we bring forward data preprocessing and collection tactics for histopathological images of high variability to avoid the pitfalls of overfitting and build deep learning models of higher generalization abilities.http://www.jpathinformatics.org/article.asp?issn=2153-3539;year=2021;volume=12;issue=1;spage=30;epage=30;aulast=Tangartificial intelligencecomputer visiondeep learningdiagnostic pathologyosteosarcomaoverfitting
collection DOAJ
language English
format Article
sources DOAJ
author Haiming Tang
Nanfei Sun
Steven Shen
spellingShingle Haiming Tang
Nanfei Sun
Steven Shen
Improving generalization of deep learning models for diagnostic pathology by increasing variability in training data: Experiments on osteosarcoma subtypes
Journal of Pathology Informatics
artificial intelligence
computer vision
deep learning
diagnostic pathology
osteosarcoma
overfitting
author_facet Haiming Tang
Nanfei Sun
Steven Shen
author_sort Haiming Tang
title Improving generalization of deep learning models for diagnostic pathology by increasing variability in training data: Experiments on osteosarcoma subtypes
title_short Improving generalization of deep learning models for diagnostic pathology by increasing variability in training data: Experiments on osteosarcoma subtypes
title_full Improving generalization of deep learning models for diagnostic pathology by increasing variability in training data: Experiments on osteosarcoma subtypes
title_fullStr Improving generalization of deep learning models for diagnostic pathology by increasing variability in training data: Experiments on osteosarcoma subtypes
title_full_unstemmed Improving generalization of deep learning models for diagnostic pathology by increasing variability in training data: Experiments on osteosarcoma subtypes
title_sort improving generalization of deep learning models for diagnostic pathology by increasing variability in training data: experiments on osteosarcoma subtypes
publisher Wolters Kluwer Medknow Publications
series Journal of Pathology Informatics
issn 2229-5089
publishDate 2021-01-01
description Background: Artificial intelligence has an emerging progress in diagnostic pathology. A large number of studies of applying deep learning models to histopathological images have been published in recent years. While many studies claim high accuracies, they may fall into the pitfalls of overfitting and lack of generalization due to the high variability of the histopathological images. Aims and Objects: Use the model training of osteosarcoma as an example to illustrate the pitfalls of overfitting and how the addition of model input variability can help improve model performance. Materials and Methods: We use the publicly available osteosarcoma dataset to retrain a previously published classification model for osteosarcoma. We partition the same set of images into the training and testing datasets differently than the original study: the test dataset consists of images from one patient while the training dataset consists images of all other patients. We also show the influence of training data variability on model performance by collecting a minimal dataset of 10 osteosarcoma subtypes as well as benign tissues and benign bone tumors of differentiation. Results: The performance of the re-trained model on the test set using the new partition schema declines dramatically, indicating a lack of model generalization and overfitting. We show the additions of more and moresubtypes into the training data step by step under the same model schema yield a series of coherent models with increasing performances. Conclusions: In conclusion, we bring forward data preprocessing and collection tactics for histopathological images of high variability to avoid the pitfalls of overfitting and build deep learning models of higher generalization abilities.
topic artificial intelligence
computer vision
deep learning
diagnostic pathology
osteosarcoma
overfitting
url http://www.jpathinformatics.org/article.asp?issn=2153-3539;year=2021;volume=12;issue=1;spage=30;epage=30;aulast=Tang
work_keys_str_mv AT haimingtang improvinggeneralizationofdeeplearningmodelsfordiagnosticpathologybyincreasingvariabilityintrainingdataexperimentsonosteosarcomasubtypes
AT nanfeisun improvinggeneralizationofdeeplearningmodelsfordiagnosticpathologybyincreasingvariabilityintrainingdataexperimentsonosteosarcomasubtypes
AT stevenshen improvinggeneralizationofdeeplearningmodelsfordiagnosticpathologybyincreasingvariabilityintrainingdataexperimentsonosteosarcomasubtypes
_version_ 1717762887964950528