A deep learning framework to classify breast density with noisy labels regularization

被引：6

作者：

Lopez-Almazan, Hector ^{[1
]}

Perez-Benito, Francisco Javier ^{[1
]}

Larroza, Andres ^{[1
]}

Perez-Cortes, Juan-Carlos ^{[1
]}

Pollan, Marina ^{[2
,3
]}

Perez-Gomez, Beatriz ^{[2
,3
]}

Trejo, Dolores Salas ^{[4
,5
]}

Casals, Maria ^{[4
,5
]}

Llobet, Rafael ^{[1
]}

机构：

[1] Univ Politecn Valencia, Inst Tecnol Informat, Camino Vera S-N, Valencia 46022, Spain

[2] Carlos III Inst Hlth, Natl Ctr Epidemiol, Monforte De Lemos 5, Madrid 28029, Spain

[3] Carlos Inst Hlth 3, Consortium Biomed Res Epidemiol & Publ Hlth CIBER, Monforte Lemos 5, Madrid 28029, Spain

[4] Gen Directorate Publ Hlth, Valencian Breast Canc Screening Program, Valencia, Spain

[5] FISABIO, Ctr Super Invest Salud Publ CSISP, Valencia, Spain

来源：

COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE | 2022年 / 221卷

关键词：

Breast density; Noisy labels; Deep learning; Dense tissue classification; Mammography; CLASSIFICATION; VARIABILITY; CANCER; MAMMOGRAPHY;

D O I：

10.1016/j.cmpb.2022.106885

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Background and Objective: Breast density assessed from digital mammograms is a biomarker for higher risk of developing breast cancer. Experienced radiologists assess breast density using the Breast Image and Data System (BI-RADS) categories. Supervised learning algorithms have been developed with this objective in mind, however, the performance of these algorithms depends on the quality of the ground truth information which is usually labeled by expert readers. These labels are noisy approximations of the ground truth, as there is often intra-and inter-reader variability among labels. Thus, it is crucial to provide a reliable method to obtain digital mammograms matching BI-RADS categories. This paper presents RegL (Labels Regularizer), a methodology that includes different image pre-processes to allow both a correct breast segmentation and the enhancement of image quality through an intensity adjustment, thus allowing the use of deep learning to classify the mammograms into BI-RADS categories. The Confusion Matrix (CM) CNN network used implements an architecture that models each radiologist's noisy label. The final methodology pipeline was determined after comparing the performance of image pre-processes combined with different DL architectures.Methods: A multi-center study composed of 1395 women whose mammograms were classified into the four BI-RADS categories by three experienced radiologists is presented. A total of 892 mammograms were used as the training corpus, 224 formed the validation corpus, and 279 the test corpus.Results: The combination of five networks implementing the RegL methodology achieved the best results among all the models in the test set. The ensemble model obtained an accuracy of (0.85) and a kappa index of 0.71. Conclusions: The proposed methodology has a similar performance to the experienced radiologists in the classification of digital mammograms into BI-RADS categories. This suggests that the pre-processing steps and modelling of each radiologist's label allows for a better estimation of the unknown ground truth labels.(c) 2022 The Author(s). Published by Elsevier B.V. This is an open access article under the CC BY-NC-ND license ( http://creativecommons.org/licenses/by-nc-nd/4.0/ )

引用

页数：11

共 50 条

[41] Data fusing and joint training for learning with noisy labels
Yi Wei
Mei Xue
Xin Liu
Pengxiang Xu
Frontiers of Computer Science, 2022, 16
[42] Recycling: Semi-Supervised Learning With Noisy Labels in Deep Neural works
Kong, Kyeongbo
Lee, Junggi
Kwak, Youngchul
Kang, Minsung
Kim, Seong Gyun
Song, Woo-Jin
IEEE ACCESS, 2019, 7 : 66998 - 67005
[43] Data fusing and joint training for learning with noisy labels
Wei, Yi
Xue, Mei
Liu, Xin
Xu, Pengxiang
FRONTIERS OF COMPUTER SCIENCE, 2022, 16 (06)
[44] Breast Dense Tissue Segmentation with Noisy Labels: A Hybrid Threshold-Based and Mask-Based Approach
Larroza, Andres
Perez-Benito, Francisco Javier
Perez-Cortes, Juan-Carlos
Roman, Marta
Pollan, Marina
Perez-Gomez, Beatriz
Salas-Trejo, Dolores
Casals, Maria
Llobet, Rafael
DIAGNOSTICS, 2022, 12 (08)
[45] Fully Automated Breast Density Segmentation and Classification Using Deep Learning
Saffari, Nasibeh
Rashwan, Hatem A.
Abdel-Nasser, Mohamed
Kumar Singh, Vivek
Arenas, Meritxell
Mangina, Eleni
Herrera, Blas
Puig, Domenec
DIAGNOSTICS, 2020, 10 (11)
[46] Deep learning prediction of mammographic breast density using screening data
Chen, Chen
Wang, Enyu
Wang, Vicky Yang
Chen, Xiayi
Feng, Bojian
Yan, Ruxuan
Zhu, Lingying
Xu, Dong
SCIENTIFIC REPORTS, 2025, 15 (01):
[47] Deep Learning from Noisy Labels via Robust Nonnegative Matrix Factorization-Based Design
Wolnick, Daniel Grey
Ibrahim, Shahana
Marrinan, Tim
Fu, Xiao
2023 IEEE 9TH INTERNATIONAL WORKSHOP ON COMPUTATIONAL ADVANCES IN MULTI-SENSOR ADAPTIVE PROCESSING, CAMSAP, 2023, : 446 - 450
[48] Compressing Features for Learning With Noisy Labels
Chen, Yingyi
Hu, Shell Xu
Shen, Xi
Ai, Chunrong
Suykens, Johan A. K.
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (02) : 2124 - 2138
[49] Progressive Stochastic Learning for Noisy Labels
Han, Bo
Tsang, Ivor W.
Chen, Ling
Yu, Celina P.
Fung, Sai-Fu
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (10) : 5136 - 5148
[50] P-DIFF plus : Improving learning classifier with noisy labels by Noisy Negative Learning loss
Zhao, QiHao
Hu, Wei
Huang, Yangyu
Zhang, Fan
NEURAL NETWORKS, 2021, 144 : 1 - 10

← 1 2 3 4 5 →