A machine-learning expert-supporting system for diagnosis prediction of lymphoid neoplasms using a probabilistic decision-tree algorithm and immunohistochemistry profile database

被引:10
作者
Chong, Yosep [1 ,2 ]
Lee, Ji Young [1 ]
Kim, Yejin [3 ,4 ]
Choi, Jingyun [5 ]
Yu, Hwanjo [5 ]
Park, Gyeongsin [1 ]
Cho, Mee Yon [6 ]
Thakur, Nishant [1 ]
机构
[1] Catholic Univ Korea, Dept Hosp Pathol, Coll Med, Seoul, South Korea
[2] Catholic Univ Korea, Postech Catholic Biomed Engn Inst, Coll Med, Seoul, South Korea
[3] POSTECH, Dept Creat Informat Technol, Pohang, South Korea
[4] Univ Texas Hlth Sci Ctr Houston, Houston, TX 77030 USA
[5] POSTECH, Comp Sci & Engn, Pohang, South Korea
[6] Yonsei Univ, Dept Pathol, Wonju Coll Med, Wonju, South Korea
基金
新加坡国家研究基金会;
关键词
Database; Expert-supporting system; Machine learning; Immunohistochemistry; Probabilistic decision tree; CARCINOMAS; MARKER;
D O I
10.4132/jptm.2020.07.11
中图分类号
R36 [病理学];
学科分类号
100104 ;
摘要
Background: Immunohistochemistry (IHC) has played an essential role in the diagnosis of hematolymphoid neoplasms. However, IHC interpretations can be challenging in daily practice, and exponentially expanding volumes of IHC data are making the task increasingly difficult. We therefore developed a machine-learning expert-supporting system for diagnosing lymphoid neoplasms. Methods: A probabilistic decision-tree algorithm based on the Bayesian theorem was used to develop mobile application software for iOS and Android platforms. We tested the software with real data from 602 training and 392 validation cases of lymphoid neoplasms and compared the precision hit rates between the training and validation datasets. Results: IHC expression data for 150 lymphoid neoplasms and 584 antibodies was gathered. The precision hit rates of 94.7% in the training data and 95.7% in the validation data for lymphomas were not statistically significant. Results in most B-cell lymphomas were excellent, and generally equivalent performance was seen in T-cell lymphomas. The primary reasons for lack of precision were atypical IHC profiles for certain cases (e.g., CD15-negative Hodgkin lymphoma), a lack of disease-specific markers, and overlapping IHC profiles of similar diseases. Conclusions: Application of the machine-learning algorithm to diagnosis precision produced acceptable hit rates in training and validation datasets. Because of the lack of origin- or disease-specific markers in differential diagnosis, contextual information such as clinical and histological features should be taken into account to make proper use of this system in the pathologic decision-making process.
引用
收藏
页码:462 / 470
页数:9
相关论文
共 21 条
[1]   The diagnostic utility of the triple markers Napsin A, TTF-1, and PAX8 in differentiating between primary and metastatic lung carcinomas [J].
Abd El-Maqsoud, Nehad M. R. ;
Tawfiek, Ehab Rifat ;
Abdelmeged, Ayman ;
Rahman, Mohamed Fathy Abdel ;
Moustafa, Alaa A. E. .
TUMOR BIOLOGY, 2016, 37 (03) :3123-3134
[2]   Napsin A and thyroid transcription factor-1 expression in carcinomas of the lung, breast, pancreas, colon, kidney, thyroid, and malignant mesothelioma [J].
Bishop, Justin A. ;
Sharma, Rajni ;
Illei, Peter B. .
HUMAN PATHOLOGY, 2010, 41 (01) :20-25
[3]  
Buchwalow IB, 2010, IMMUNOHISTOCHEMISTRY: BASICS AND METHODS, P1, DOI 10.1007/978-3-642-04609-4
[4]  
Chu P, 2014, CAMBR ILLUS SURG PAT, P1
[5]   LABELING TECHNIQUES IN DIAGNOSIS OF VIRAL DISEASES [J].
COONS, AH .
BACTERIOLOGICAL REVIEWS, 1964, 28 (04) :397-&
[6]  
Dabbs D J., 2014, Diagnostic Immunohistochemistry: Theranostic and Genomic Applications, V4th
[7]  
de Matos LL, 2010, BIOMARK INSIGHTS, V5, P9
[8]  
DeYoung BR, 2000, SEMIN DIAGN PATHOL, V17, P184
[9]   Programmed death-1 (PD-1) is a marker of germinal center-associated T cells and angioimmunoblastic T-cell lymphoma [J].
Dorfman, DM ;
Brown, JA ;
Shahsafaei, A ;
Freeman, GJ .
AMERICAN JOURNAL OF SURGICAL PATHOLOGY, 2006, 30 (07) :802-810
[10]  
ELIAS JM, 2000, ANTIGEN RETRIEVAL TE, P7