1399 H&E-stained sentinel lymph node sections of breast cancer patients: the CAMELYON dataset

被引:224
作者
Litjens, Geert [1 ]
Bandi, Peter [2 ]
Bejnordi, Babak Ehteshami [1 ]
Geessink, Oscar [1 ]
Balkenhol, Maschenka [1 ]
Bult, Peter [1 ]
Halilovic, Altuna [1 ]
Hermsen, Meyke [1 ]
van de Loo, Rob [1 ]
Vogels, Rob [1 ]
Manson, Quirine F. [2 ]
Stathonikos, Nikolas [2 ]
Baidoshvili, Alexi [3 ]
van Diest, Paul [2 ]
Wauters, Carla [4 ]
van Dijk, Marcory [5 ]
van der Laak, Jeroen [1 ]
机构
[1] Radboud Univ Nijmegen, Med Ctr, Dept Pathol, Diagnost Image Anal Grp, Huispost 824,Geert Groottepl Zuid 10, NL-6525 GA Nijmegen, Netherlands
[2] Univ Med Ctr Huispost, Dept Pathol, H04-312,Heidelberglaan 100, NL-3584 CX Utrecht, Netherlands
[3] Lab Pathol East Netherlands LabPON, Postbus 516, NL-7550 AM Hengelo, Netherlands
[4] Canisius Wilhelmina Hosp, Dept Pathol, Postbus 9015, NL-6500 GS Nijmegen, Netherlands
[5] Rijnstate Hosp, Dept Pathol, Pathol DNA, Postbus 9555, NL-6800 TA Arnhem, Netherlands
来源
GIGASCIENCE | 2018年 / 7卷 / 06期
关键词
breast cancer; lymph node metastases; whole-slide images; grand challenge; sentinel node; RECOMMENDATIONS;
D O I
10.1093/gigascience/giy065
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: The presence of lymph node metastases is one of the most important factors in breast cancer prognosis. The most common way to assess regional lymph node status is the sentinel lymph node procedure. The sentinel lymph node is the most likely lymph node to contain metastasized cancer cells and is excised, histopathologically processed, and examined by a pathologist. This tedious examination process is time-consuming and can lead to small metastases being missed. However, recent advances in whole-slide imaging and machine learning have opened an avenue for analysis of digitized lymph node sections with computer algorithms. For example, convolutional neural networks, a type of machine-learning algorithm, can be used to automatically detect cancer metastases in lymph nodes with high accuracy. To train machine-learning models, large, well-curated datasets are needed. Results: We released a dataset of 1,399 annotated whole-slide images (WSIs) of lymph nodes, both with and without metastases, in 3 terabytes of data in the context of the CAMELYON16 and CAMELYON17 Grand Challenges. Slides were collected from five medical centers to cover a broad range of image appearance and staining variations. Each WSI has a slide-level label indicating whether it contains no metastases, macro-metastases, micro-metastases, or isolated tumor cells. Furthermore, for 209 WSIs, detailed hand-drawn contours for all metastases are provided. Last, open-source software tools to visualize and interact with the data have been made available. Conclusions: A unique dataset of annotated, whole-slide digital histopathology images has been provided with high potential for re-use.
引用
收藏
页数:8
相关论文
共 26 条
  • [1] Amin MB., 2016, AJCC Cancer Staging Manual, V8th, DOI DOI 10.1007/978-3-319-40618-3
  • [2] [Anonymous], GIGASCIENCE DATABASE
  • [3] [Anonymous], AUTOMATE SLIDE ANAL
  • [4] [Anonymous], 2014, SEER CANC STAT REV 1
  • [5] Diagnostic Assessment of Deep Learning Algorithms for Detection of Lymph Node Metastases in Women With Breast Cancer
    Bejnordi, Babak Ehteshami
    Veta, Mitko
    van Diest, Paul Johannes
    van Ginneken, Bram
    Karssemeijer, Nico
    Litjens, Geert
    van der Laak, Jeroen A. W. M.
    [J]. JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2017, 318 (22): : 2199 - 2210
  • [6] Clinical outcome of patients with lymph node-negative breast carcinoma who have sentinel lymph node micrometastases detected by immunohistochemistry
    Chagpar, A
    Middleton, LP
    Sahin, AA
    Meric-Bernstam, F
    Kuerer, HM
    Feig, BW
    Ross, MI
    Ames, FC
    Singletary, SE
    Buchholz, TA
    Valero, V
    Hunt, KK
    [J]. CANCER, 2005, 103 (08) : 1581 - 1586
  • [7] A COEFFICIENT OF AGREEMENT FOR NOMINAL SCALES
    COHEN, J
    [J]. EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1960, 20 (01) : 37 - 46
  • [8] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
  • [9] The use of morphological characteristics and texture analysis in the identification of tissue composition in prostatic neoplasia
    Diamond, J
    Anderson, NH
    Bartels, PH
    Montironi, R
    Hamilton, PW
    [J]. HUMAN PATHOLOGY, 2004, 35 (09) : 1121 - 1131
  • [10] The American Joint Committee on Cancer: the 7th Edition of the AJCC Cancer Staging Manual and the Future of TNM
    Edge, Stephen B.
    Compton, Carolyn C.
    [J]. ANNALS OF SURGICAL ONCOLOGY, 2010, 17 (06) : 1471 - 1474