Highdicom: a Python']Python Library for Standardized Encoding of Image Annotations and Machine Learning Model Outputs in Pathology and Radiology

被引:10
作者
Bridge, Christopher P. [1 ,2 ]
Gorman, Chris [3 ]
Pieper, Steven [4 ]
Doyle, Sean W. [2 ]
Lennerz, Jochen K. [5 ,6 ]
Kalpathy-Cramer, Jayashree [1 ,2 ,7 ]
Clunie, David A. [8 ]
Fedorov, Andriy Y. [7 ,9 ]
Herrmann, Markus D. [3 ,6 ]
机构
[1] Massachusetts Gen Hosp, Martinos Ctr Biomed Imaging, Boston, MA 02114 USA
[2] Mass Gen Brigham, MGH & BWH Ctr Clin Data Sci, Boston, MA USA
[3] Massachusetts Gen Hosp, Dept Pathol, Computat Pathol, Boston, MA 02114 USA
[4] Isomics Inc, Cambridge, MA USA
[5] Massachusetts Gen Hosp, Dept Pathol, Ctr Integrated Diagnost, Boston, MA 02114 USA
[6] Harvard Med Sch, Dept Pathol, Boston, MA 02115 USA
[7] Harvard Med Sch, Dept Radiol, Boston, MA 02115 USA
[8] PixelMed Publishing LLC, Bangor, PA USA
[9] Brigham & Womens Hosp, Dept Radiol, Surg Planning Lab, 75 Francis St, Boston, MA 02115 USA
基金
美国国家卫生研究院;
关键词
DICOM; !text type='Python']Python[!/text; Software; Machine learning; Segmentations; Structured reports; RESOURCE; DICOM;
D O I
10.1007/s10278-022-00683-y
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Machine learning (ML) is revolutionizing image-based diagnostics in pathology and radiology. ML models have shown promising results in research settings, but the lack of interoperability between ML systems and enterprise medical imaging systems has been a major barrier for clinical integration and evaluation. The DICOM (R) standard specifies information object definitions (IODs) and services for the representation and communication of digital images and related information, including image-derived annotations and analysis results. However, the complexity of the standard represents an obstacle for its adoption in the ML community and creates a need for software libraries and tools that simplify working with datasets in DICOM format. Here we present the highdicom library, which provides a high-level application programming interface (API) for the Python programming language that abstracts low-level details of the standard and enables encoding and decoding of image-derived information in DICOM format in a few lines of Python code. The highdicom library leverages NumPy arrays for efficient data representation and ties into the extensive Python ecosystem for image processing and machine learning. Simultaneously, by simplifying creation and parsing of DICOM-compliant files, highdicom achieves interoperability with the medical imaging systems that hold the data used to train and run ML models, and ultimately communicate and store model outputs for clinical use. We demonstrate through experiments with slide microscopy and computed tomography imaging, that, by bridging these two ecosystems, highdicom enables developers and researchers to train and evaluate state-of-the-art ML models in pathology and radiology while remaining compliant with the DICOM standard and interoperable with clinical systems at all stages. To promote standardization of ML research and streamline the ML model development and deployment process, we made the library available free and open-source at https://github.com/herrmannlab/ highd icom.
引用
收藏
页码:1719 / 1737
页数:19
相关论文
共 52 条
  • [1] Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
  • [2] Computational pathology definitions, best practices, and recommendations for regulatory guidance: a white paper from the Digital Pathology Association
    Abels, Esther
    Pantanowitz, Liron
    Aeffner, Famke
    Zarella, Mark D.
    van der Laak, Jeroen
    Bui, Marilyn M.
    Vemuri, Venkata N. P.
    Parwani, Anil V.
    Gibbs, Jeff
    Agosto-Arroyo, Emmanuel
    Beck, Andrew H.
    Kozlowski, Cleopatra
    [J]. JOURNAL OF PATHOLOGY, 2019, 249 (03) : 286 - 294
  • [3] A Road Map for Translational Research on Artificial Intelligence in Medical Imaging: From the 2018 National Institutes of Health/RSNA/ACR/The Academy Workshop
    Allen, Bibb, Jr.
    Seltzer, Steven E.
    Langlotz, Curtis P.
    Dreyer, Keith P.
    Summers, Ronald M.
    Petrick, Nicholas
    Marinac-Dabic, Danica
    Cruz, Marisa
    Alkasab, Tarik K.
    Hanisch, Robert J.
    Nilsen, Wendy J.
    Burleson, Judy
    Lyman, Kevin
    Kandarpa, Krishna
    [J]. JOURNAL OF THE AMERICAN COLLEGE OF RADIOLOGY, 2019, 16 (09) : 1179 - 1189
  • [4] [Anonymous], 2020, IHE PALM TECHN COMM
  • [5] End-to-end lung cancer screening with three-dimensional deep learning on low-dose chest computed tomography
    Ardila, Diego
    Kiraly, Atilla P.
    Bharadwaj, Sujeeth
    Choi, Bokyung
    Reicher, Joshua J.
    Peng, Lily
    Tse, Daniel
    Etemadi, Mozziyar
    Ye, Wenxing
    Corrado, Greg
    Naidich, David P.
    Shetty, Shravya
    [J]. NATURE MEDICINE, 2019, 25 (06) : 954 - +
  • [6] Armato III S. G., 2015, Data from lidc-idri data set. the cancer imaging archive
  • [7] The Lung Image Database Consortium, (LIDC) and Image Database Resource Initiative (IDRI): A Completed Reference Database of Lung Nodules on CT Scans
    Armato, Samuel G., III
    McLennan, Geoffrey
    Bidaut, Luc
    McNitt-Gray, Michael F.
    Meyer, Charles R.
    Reeves, Anthony P.
    Zhao, Binsheng
    Aberle, Denise R.
    Henschke, Claudia I.
    Hoffman, Eric A.
    Kazerooni, Ella A.
    MacMahon, Heber
    van Beek, Edwin J. R.
    Yankelevitz, David
    Biancardi, Alberto M.
    Bland, Peyton H.
    Brown, Matthew S.
    Engelmann, Roger M.
    Laderach, Gary E.
    Max, Daniel
    Pais, Richard C.
    Qing, David P-Y
    Roberts, Rachael Y.
    Smith, Amanda R.
    Starkey, Adam
    Batra, Poonam
    Caligiuri, Philip
    Farooqi, Ali
    Gladish, Gregory W.
    Jude, C. Matilda
    Munden, Reginald F.
    Petkovska, Iva
    Quint, Leslie E.
    Schwartz, Lawrence H.
    Sundaram, Baskaran
    Dodd, Lori E.
    Fenimore, Charles
    Gur, David
    Petrick, Nicholas
    Freymann, John
    Kirby, Justin
    Hughes, Brian
    Casteele, Alessi Vande
    Gupte, Sangeeta
    Sallam, Maha
    Heath, Michael D.
    Kuhn, Michael H.
    Dharaiya, Ekta
    Burns, Richard
    Fryd, David S.
    [J]. MEDICAL PHYSICS, 2011, 38 (02) : 915 - 931
  • [8] Bidgood WD, 1998, METHOD INFORM MED, V37, P404
  • [9] Bradski G, 2000, DR DOBBS J, V25, P120
  • [10] Clinical-grade computational pathology using weakly supervised deep learning on whole slide images
    Campanella, Gabriele
    Hanna, Matthew G.
    Geneslaw, Luke
    Miraflor, Allen
    Silva, Vitor Werneck Krauss
    Busam, Klaus J.
    Brogi, Edi
    Reuter, Victor E.
    Klimstra, David S.
    Fuchs, Thomas J.
    [J]. NATURE MEDICINE, 2019, 25 (08) : 1301 - +