Machine Learning Based Classification of Colorectal Cancer Tumour Tissue in Whole-Slide Images

被引:7
作者
Morkunas, Mindaugas [1 ,2 ]
Treigys, Povilas [1 ]
Bernataviciene, Jolita [1 ]
Laurinavicius, Arvydas [2 ]
Korvel, Grazina [1 ]
机构
[1] Vilnius Univ, Inst Data Sci & Digital Technol, Akad Str 4, LT-08663 Vilnius, Lithuania
[2] Vilnius Univ Hosp Santaros Klin, Natl Ctr Pathol, P Baublio Str 5, LT-08406 Vilnius, Lithuania
关键词
tumour; whole-slide image; machine learning; superpixel; ground truth; colour and texture features; convolutional neural network; EXPRESSION;
D O I
10.15388/Informatica.2018.158
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The recent introduction of whole-slide scanning systems enabled accumulation of high-quality pathology images into large collections, thus opening new perspectives in cancer research, as well as new analysis challenges. Automated identification of tumour tissue in the whole-slide image enables further use of developed grading systems that classify tumour cell abnormalities and predict tumour developments. In this article, we describe several possibilities to achieve epithelium-stroma classification of tumour tissues in digital pathology images by employing annotated superpixels to train machine learning algorithms. We emphasize that annotating superpixels rather than manually outlining tissue classes in raw images is less time consuming, and more effective way of producing ground truth for computational pathology pipelines. In our approach feature space for supervised learning is created from tissue class assigned superpixels by extracting colour and texture parameters, and applying dimensionality reduction methods. Alternatively, to train convolutional neural network, labelled superpixels are used to generate square image patches by moving fixed size window around each superpixel centroid. The proposed method simplifies the process of ground truth data collection and should minimize the time spent by a skilled expert to perform manual annotation of whole-slide images. We evaluate our method on a private data set of colorectal cancer images. Obtained results confirm that a method produces accurate reference data suitable for the use of different machine learning based classification algorithms.
引用
收藏
页码:75 / 90
页数:16
相关论文
共 30 条
[1]  
Abadi Martin, 2016, arXiv
[2]   SLIC Superpixels Compared to State-of-the-Art Superpixel Methods [J].
Achanta, Radhakrishna ;
Shaji, Appu ;
Smith, Kevin ;
Lucchi, Aurelien ;
Fua, Pascal ;
Suesstrunk, Sabine .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (11) :2274-2281
[3]   Image statistics and data mining of anal intraepithelial neoplasia [J].
Ahammer, H. ;
Kroepfl, J. M. ;
Hackl, Ch. ;
Sedivy, R. .
PATTERN RECOGNITION LETTERS, 2008, 29 (16) :2189-2196
[4]  
[Anonymous], 2002, COMP ALGORITHMS MAXI, DOI DOI 10.3115/1118853.1118871
[5]  
[Anonymous], 2010, Journal of Machine Learning Research, DOI DOI 10.5555/1756006.1859899
[6]   Automated Detection of DCIS in Whole-Slide H&E Stained Breast Histopathology Images [J].
Bejnordi, Babak Ehteshami ;
Balkenhol, Maschenka ;
Litjens, Geert ;
Holland, Roland ;
Bult, Peter ;
Karssemeijer, Nico ;
van der Laak, Jeroen A. W. M. .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2016, 35 (09) :2141-2150
[7]   Discrimination between tumour epithelium and stroma via perception-based features [J].
Bianconi, Francesco ;
Alvarez-Larran, Alberto ;
Fernandez, Antonio .
NEUROCOMPUTING, 2015, 154 :119-126
[8]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[9]  
Bunyak F., 2015, IEEE INT C BIOINF BI, DOI [10.1109/BIBM.2015.7359726, DOI 10.1109/BIBM.2015.7359726]
[10]  
CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411