TY - JOUR
T1 - High-throughput adaptive sampling for whole-slide histopathology image analysis (HASHI) via convolutional neural networks
T2 - Application to invasive breast cancer detection
AU - Cruz-Roa, Angel
AU - Gilmore, Hannah
AU - Basavanhally, Ajay
AU - Feldman, Michael
AU - Ganesan, Shridar
AU - Shih, Natalie
AU - Tomaszewski, John
AU - Madabhushi, Anant
AU - González, Fabio
N1 - Funding Information:
Research reported in this publication was funded by doctoral fellowship grant from the Administrative Department of Science, Technology and Innovation - Colciencias (528/2011), Universidad Nacional de Colombia, projects C03-F02-35-2015 and C05-F02-039-2016 from Universidad de los Llanos, the National Cancer Institute of the National Institutes of Health under award numbers 1U24CA199374-01; R01CA202752-01A1; R01CA208236-01A1; R21CA179327-01; R21CA195152-01 the National Institute of Diabetes and Digestive and Kidney Diseases under award number R01DK098503-02, National Center for Research Resources under award number 1 C06 RR12463-01; the United States Department of Defense Prostate Cancer Synergistic Idea Development Award (PC120857); the United States Department of Defense Lung Cancer Idea Development New Investigator Award (LC130463); the United States Department of Defense Prostate Cancer Idea Development Award; the United States Department of Defense Peer Reviewed Cancer Research Program W81XWH-16-1-0329; the Case Comprehensive Cancer Center Pilot Grant; VelaSano Grant from the Cleveland Clinic; the Wallace H. Coulter Foundation Program in the Department of Biomedical Engineering at Case Western Reserve University. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.
Publisher Copyright:
© 2018 Cruz-Roa et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
PY - 2018/5
Y1 - 2018/5
N2 - Precise detection of invasive cancer on whole-slide images (WSI) is a critical first step in digital pathology tasks of diagnosis and grading. Convolutional neural network (CNN) is the most popular representation learning method for computer vision tasks, which have been successfully applied in digital pathology, including tumor and mitosis detection. However, CNNs are typically only tenable with relatively small image sizes (200 × 200 pixels). Only recently, Fully convolutional networks (FCN) are able to deal with larger image sizes (500 × 500 pixels) for semantic segmentation. Hence, the direct application of CNNs to WSI is not computationally feasible because for a WSI, a CNN would require billions or trillions of parameters. To alleviate this issue, this paper presents a novel method, High-throughput Adaptive Sampling for whole-slide Histopathology Image analysis (HASHI), which involves: i) a new efficient adaptive sampling method based on probability gradient and quasi-Monte Carlo sampling, and, ii) a powerful representation learning classifier based on CNNs. We applied HASHI to automated detection of invasive breast cancer on WSI. HASHI was trained and validated using three different data cohorts involving near 500 cases and then independently tested on 195 studies from The Cancer Genome Atlas. The results show that (1) the adaptive sampling method is an effective strategy to deal with WSI without compromising prediction accuracy by obtaining comparative results of a dense sampling (*6 million of samples in 24 hours) with far fewer samples (*2,000 samples in 1 minute), and (2) on an independent test dataset, HASHI is effective androbust to data from multiple sites, scanners, and platforms, achieving an average Dice coefficient of 76%.
AB - Precise detection of invasive cancer on whole-slide images (WSI) is a critical first step in digital pathology tasks of diagnosis and grading. Convolutional neural network (CNN) is the most popular representation learning method for computer vision tasks, which have been successfully applied in digital pathology, including tumor and mitosis detection. However, CNNs are typically only tenable with relatively small image sizes (200 × 200 pixels). Only recently, Fully convolutional networks (FCN) are able to deal with larger image sizes (500 × 500 pixels) for semantic segmentation. Hence, the direct application of CNNs to WSI is not computationally feasible because for a WSI, a CNN would require billions or trillions of parameters. To alleviate this issue, this paper presents a novel method, High-throughput Adaptive Sampling for whole-slide Histopathology Image analysis (HASHI), which involves: i) a new efficient adaptive sampling method based on probability gradient and quasi-Monte Carlo sampling, and, ii) a powerful representation learning classifier based on CNNs. We applied HASHI to automated detection of invasive breast cancer on WSI. HASHI was trained and validated using three different data cohorts involving near 500 cases and then independently tested on 195 studies from The Cancer Genome Atlas. The results show that (1) the adaptive sampling method is an effective strategy to deal with WSI without compromising prediction accuracy by obtaining comparative results of a dense sampling (*6 million of samples in 24 hours) with far fewer samples (*2,000 samples in 1 minute), and (2) on an independent test dataset, HASHI is effective androbust to data from multiple sites, scanners, and platforms, achieving an average Dice coefficient of 76%.
UR - http://www.scopus.com/inward/record.url?scp=85047379249&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85047379249&partnerID=8YFLogxK
U2 - 10.1371/journal.pone.0196828
DO - 10.1371/journal.pone.0196828
M3 - Article
C2 - 29795581
AN - SCOPUS:85047379249
SN - 1932-6203
VL - 13
JO - PloS one
JF - PloS one
IS - 5
M1 - e0196828
ER -