Deep Learning With Sampling in Colon Cancer Histology

被引:47
作者
Shapcott, Mary [1 ]
Hewitt, Katherine J. [2 ]
Rajpoot, Nasir [1 ,2 ]
机构
[1] Univ Warwick, Dept Comp Sci, Coventry, W Midlands, England
[2] Univ Hosp Coventry & Warwickshire, Cellular Pathol Dept, Coventry, W Midlands, England
关键词
sampling; histopathology; TCGA; morphology; colon cancer; deep learning; COLORECTAL-CANCER; CLASSIFICATION; PROGNOSIS; SUBTYPE;
D O I
10.3389/fbioe.2019.00052
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
This study applied a deep-learning cell identification algorithm to diagnostic images from the colon cancer repository at The Cancer Genome Atlas (TCGA). Within-image sampling improved performance without loss of accuracy. The features thus derived were associated with various clinical variables including metastasis, residual tumor, venous invasion, and lymphatic invasion. The deep-learning algorithm was trained using images from a locally available data set, then applied to the TCGA images by tiling them, and identifying cells in each patch defined by the tiling. In this application the average number of patches containing tissue in an image was similar to 900. Processing a random sample of patches greatly reduced computation costs. The cell identification algorithm was applied directly to each sampled patch, resulting in a list of cells. Each cell was labeled with its location and classification ("epithelial," "inflammatory," "fibroblast," or "other"). The number of cells of a given type in the patch was calculated, resulting in a patch profile containing four features. A morphological profile that applied to the entire image was obtained by averaging profiles over all patches. Two sampling policies were examined. The first policy was random sampling which samples patches with uniform weighting. The second policy was systematic random sampling which takes spatial dependencies into account. Compared with the processing of complete whole slide images there was a seven-fold improvement in performance when systematic random spatial sampling was used to select 100 tiles from the whole-slide image for processing, with very little loss of accuracy (similar to 4% on average). We found links between the predicted features and clinical variables in the TCGA colon cancer data set. Several significant associations were found: increased fibroblast numbers were associated with the presence of metastasis, venous invasion, lymphatic invasion and residual tumor while decreased numbers of inflammatory cells were associated with mucinous carcinomas. Regarding the four different types of cell, deep learning has generated morphological features that are indicators of cell density. The features are related to cellularity, the numbers, degree, or quality of cells present in a tumor. Cellularity has been reported to be related to patient survival and other diagnostic and prognostic indicators, indicating that the features calculated here may be of general usefulness.
引用
收藏
页数:9
相关论文
共 30 条
[1]  
Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
[2]  
[Anonymous], 2018, GEOSPATIAL ANAL COMP
[3]  
[Anonymous], 2018, DEF CIFAR10 MOD
[4]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[5]   Classification and mutation prediction from non-small cell lung cancer histopathology images using deep learning [J].
Coudray, Nicolas ;
Ocampo, Paolo Santiago ;
Sakellaropoulos, Theodore ;
Narula, Navneet ;
Snuderl, Matija ;
Fenyo, David ;
Moreira, Andre L. ;
Razavian, Narges ;
Tsirigos, Aristotelis .
NATURE MEDICINE, 2018, 24 (10) :1559-+
[6]   Image analysis with deep learning to predict breast cancer grade, ER status, histologic subtype, and intrinsic subtype [J].
Couture, Heather D. ;
Williams, Lindsay A. ;
Geradts, Joseph ;
Nyante, Sarah J. ;
Butler, Ebonee N. ;
Marron, J. S. ;
Perou, Charles M. ;
Troester, Melissa A. ;
Niethammer, Marc .
NPJ BREAST CANCER, 2018, 4
[7]   High-throughput adaptive sampling for whole-slide histopathology image analysis (HASHI) via convolutional neural networks: Application to invasive breast cancer detection [J].
Cruz-Roa, Angel ;
Gilmore, Hannah ;
Basavanhally, Ajay ;
Feldman, Michael ;
Ganesan, Shridar ;
Shih, Natalie ;
Tomaszewski, John ;
Madabhushi, Anant ;
Gonzalez, Fabio .
PLOS ONE, 2018, 13 (05)
[8]   Tumour budding in colorectal cancer: what do we know and what can we do? [J].
De Smedt, Linde ;
Palmans, Sofie ;
Sagaert, Xavier .
VIRCHOWS ARCHIV, 2016, 468 (04) :397-408
[9]   Poor-prognosis colon cancer is defined by a molecularly distinct subtype and develops from serrated precursor lesions [J].
De Sousa E Melo, Felipe ;
Wang, Xin ;
Jansen, Marnix ;
Fessler, Evelyn ;
Trinh, Anne ;
de Rooij, Laura P. M. H. ;
de Jong, Joan H. ;
de Boer, Onno J. ;
van Leersum, Ronald ;
Bijlsma, Maarten F. ;
Rodermond, Hans ;
van der Heijden, Maartje ;
van Noesel, Carel J. M. ;
Tuynman, Jurriaan B. ;
Dekker, Evelien ;
Markowetz, Florian ;
Medema, Jan Paul ;
Vermeulen, Louis .
NATURE MEDICINE, 2013, 19 (05) :614-618
[10]  
Delmelle Eric., 2009, SAGE HDB SPATIAL ANA, P183, DOI DOI 10.4135/9780857020130.N10