High-throughput adaptive sampling for whole-slide histopathology image analysis (HASHI) via convolutional neural networks: Application to invasive breast cancer detection.

Precise detection of invasive cancer on whole-slide images (WSI) is a critical first step in digital pathology tasks of diagnosis and grading. Convolutional neural network (CNN) is the most popular representation learning method for computer vision tasks, which have been successfully applied in digi...

Full description

Bibliographic Details
Main Authors: Angel Cruz-Roa, Hannah Gilmore, Ajay Basavanhally, Michael Feldman, Shridar Ganesan, Natalie Shih, John Tomaszewski, Anant Madabhushi, Fabio González
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2018-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0196828
id doaj-b29ce0449f0649e0a66508d97065af00
record_format Article
spelling doaj-b29ce0449f0649e0a66508d97065af002021-03-04T11:23:25ZengPublic Library of Science (PLoS)PLoS ONE1932-62032018-01-01135e019682810.1371/journal.pone.0196828High-throughput adaptive sampling for whole-slide histopathology image analysis (HASHI) via convolutional neural networks: Application to invasive breast cancer detection.Angel Cruz-RoaHannah GilmoreAjay BasavanhallyMichael FeldmanShridar GanesanNatalie ShihJohn TomaszewskiAnant MadabhushiFabio GonzálezPrecise detection of invasive cancer on whole-slide images (WSI) is a critical first step in digital pathology tasks of diagnosis and grading. Convolutional neural network (CNN) is the most popular representation learning method for computer vision tasks, which have been successfully applied in digital pathology, including tumor and mitosis detection. However, CNNs are typically only tenable with relatively small image sizes (200 × 200 pixels). Only recently, Fully convolutional networks (FCN) are able to deal with larger image sizes (500 × 500 pixels) for semantic segmentation. Hence, the direct application of CNNs to WSI is not computationally feasible because for a WSI, a CNN would require billions or trillions of parameters. To alleviate this issue, this paper presents a novel method, High-throughput Adaptive Sampling for whole-slide Histopathology Image analysis (HASHI), which involves: i) a new efficient adaptive sampling method based on probability gradient and quasi-Monte Carlo sampling, and, ii) a powerful representation learning classifier based on CNNs. We applied HASHI to automated detection of invasive breast cancer on WSI. HASHI was trained and validated using three different data cohorts involving near 500 cases and then independently tested on 195 studies from The Cancer Genome Atlas. The results show that (1) the adaptive sampling method is an effective strategy to deal with WSI without compromising prediction accuracy by obtaining comparative results of a dense sampling (∼6 million of samples in 24 hours) with far fewer samples (∼2,000 samples in 1 minute), and (2) on an independent test dataset, HASHI is effective and robust to data from multiple sites, scanners, and platforms, achieving an average Dice coefficient of 76%.https://doi.org/10.1371/journal.pone.0196828
collection DOAJ
language English
format Article
sources DOAJ
author Angel Cruz-Roa
Hannah Gilmore
Ajay Basavanhally
Michael Feldman
Shridar Ganesan
Natalie Shih
John Tomaszewski
Anant Madabhushi
Fabio González
spellingShingle Angel Cruz-Roa
Hannah Gilmore
Ajay Basavanhally
Michael Feldman
Shridar Ganesan
Natalie Shih
John Tomaszewski
Anant Madabhushi
Fabio González
High-throughput adaptive sampling for whole-slide histopathology image analysis (HASHI) via convolutional neural networks: Application to invasive breast cancer detection.
PLoS ONE
author_facet Angel Cruz-Roa
Hannah Gilmore
Ajay Basavanhally
Michael Feldman
Shridar Ganesan
Natalie Shih
John Tomaszewski
Anant Madabhushi
Fabio González
author_sort Angel Cruz-Roa
title High-throughput adaptive sampling for whole-slide histopathology image analysis (HASHI) via convolutional neural networks: Application to invasive breast cancer detection.
title_short High-throughput adaptive sampling for whole-slide histopathology image analysis (HASHI) via convolutional neural networks: Application to invasive breast cancer detection.
title_full High-throughput adaptive sampling for whole-slide histopathology image analysis (HASHI) via convolutional neural networks: Application to invasive breast cancer detection.
title_fullStr High-throughput adaptive sampling for whole-slide histopathology image analysis (HASHI) via convolutional neural networks: Application to invasive breast cancer detection.
title_full_unstemmed High-throughput adaptive sampling for whole-slide histopathology image analysis (HASHI) via convolutional neural networks: Application to invasive breast cancer detection.
title_sort high-throughput adaptive sampling for whole-slide histopathology image analysis (hashi) via convolutional neural networks: application to invasive breast cancer detection.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2018-01-01
description Precise detection of invasive cancer on whole-slide images (WSI) is a critical first step in digital pathology tasks of diagnosis and grading. Convolutional neural network (CNN) is the most popular representation learning method for computer vision tasks, which have been successfully applied in digital pathology, including tumor and mitosis detection. However, CNNs are typically only tenable with relatively small image sizes (200 × 200 pixels). Only recently, Fully convolutional networks (FCN) are able to deal with larger image sizes (500 × 500 pixels) for semantic segmentation. Hence, the direct application of CNNs to WSI is not computationally feasible because for a WSI, a CNN would require billions or trillions of parameters. To alleviate this issue, this paper presents a novel method, High-throughput Adaptive Sampling for whole-slide Histopathology Image analysis (HASHI), which involves: i) a new efficient adaptive sampling method based on probability gradient and quasi-Monte Carlo sampling, and, ii) a powerful representation learning classifier based on CNNs. We applied HASHI to automated detection of invasive breast cancer on WSI. HASHI was trained and validated using three different data cohorts involving near 500 cases and then independently tested on 195 studies from The Cancer Genome Atlas. The results show that (1) the adaptive sampling method is an effective strategy to deal with WSI without compromising prediction accuracy by obtaining comparative results of a dense sampling (∼6 million of samples in 24 hours) with far fewer samples (∼2,000 samples in 1 minute), and (2) on an independent test dataset, HASHI is effective and robust to data from multiple sites, scanners, and platforms, achieving an average Dice coefficient of 76%.
url https://doi.org/10.1371/journal.pone.0196828
work_keys_str_mv AT angelcruzroa highthroughputadaptivesamplingforwholeslidehistopathologyimageanalysishashiviaconvolutionalneuralnetworksapplicationtoinvasivebreastcancerdetection
AT hannahgilmore highthroughputadaptivesamplingforwholeslidehistopathologyimageanalysishashiviaconvolutionalneuralnetworksapplicationtoinvasivebreastcancerdetection
AT ajaybasavanhally highthroughputadaptivesamplingforwholeslidehistopathologyimageanalysishashiviaconvolutionalneuralnetworksapplicationtoinvasivebreastcancerdetection
AT michaelfeldman highthroughputadaptivesamplingforwholeslidehistopathologyimageanalysishashiviaconvolutionalneuralnetworksapplicationtoinvasivebreastcancerdetection
AT shridarganesan highthroughputadaptivesamplingforwholeslidehistopathologyimageanalysishashiviaconvolutionalneuralnetworksapplicationtoinvasivebreastcancerdetection
AT natalieshih highthroughputadaptivesamplingforwholeslidehistopathologyimageanalysishashiviaconvolutionalneuralnetworksapplicationtoinvasivebreastcancerdetection
AT johntomaszewski highthroughputadaptivesamplingforwholeslidehistopathologyimageanalysishashiviaconvolutionalneuralnetworksapplicationtoinvasivebreastcancerdetection
AT anantmadabhushi highthroughputadaptivesamplingforwholeslidehistopathologyimageanalysishashiviaconvolutionalneuralnetworksapplicationtoinvasivebreastcancerdetection
AT fabiogonzalez highthroughputadaptivesamplingforwholeslidehistopathologyimageanalysishashiviaconvolutionalneuralnetworksapplicationtoinvasivebreastcancerdetection
_version_ 1714803713049100288