Mix-supervised multiset learning for cancer prognosis analysis with high-censoring survival data

被引:2
作者
Du, Denghui [1 ,2 ,3 ]
Feng, Qianjin [1 ,2 ,3 ]
Chen, Wufan [1 ,2 ,3 ]
Ning, Zhenyuan [1 ,2 ,3 ]
Zhang, Yu [1 ,2 ,3 ]
机构
[1] Southern Med Univ, Sch Biomed Engn, Guangzhou 510515, Peoples R China
[2] Southern Med Univ, Guangdong Prov Key Lab Med Image Proc, Guangzhou 510515, Peoples R China
[3] Southern Med Univ, Guangdong Prov Engn Lab Med Imaging & Diagnost Tec, Guangzhou 510515, Peoples R China
基金
中国国家自然科学基金;
关键词
High-censoring data; Multiset representation learning; Mix-supervised; Prognosis analysis; MULTIMODAL DATA INTEGRATION; BREAST-CANCER; REGRESSION; DIAGNOSIS; MODELS; IMAGES;
D O I
10.1016/j.eswa.2023.122430
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
High censoring phenomenon usually occurs in cancer prognosis analysis, which, however, would introduce bias for model construction and limit generalization performance. In this paper, we first explore and identify an appropriate censoring range for cancer prognosis evaluation, upon which we present a mix-supervised multiset learning framework to cope with high-censoring data. Specifically, we construct multiple subsets with the specified censoring proportion, followed by a multiset representation learning method to learn subset-specific representations, which equips with adversary integrality preservation and dependency limitation constraints to ensure the unbiasedness of subsets and eliminate the redundancy among subsets, respectively. Furthermore, a mix-supervised multiset fusion model is proposed to estimate the relative survival risk, in which teacher model can make full use of the survival time of uncensored samples and the prognosis-related attributes of censored ones to generate reliable pseudo-labels and latent-space for student model. We evaluate the proposed method on three public datasets, and extensive experimental results demonstrate its superiority.
引用
收藏
页数:15
相关论文
共 71 条
  • [1] NONPARAMETRIC INFERENCE FOR A FAMILY OF COUNTING PROCESSES
    AALEN, O
    [J]. ANNALS OF STATISTICS, 1978, 6 (04) : 701 - 726
  • [2] Principal component analysis
    Abdi, Herve
    Williams, Lynne J.
    [J]. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2010, 2 (04): : 433 - 459
  • [3] On the robustness of right andmiddle censoring schemes in parametric survival models
    Abuzaid, A. H.
    Abu El-Qumsan, M. K.
    El-Habil, A. M.
    [J]. COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2017, 46 (03) : 1771 - 1780
  • [4] Logistic Regression Confined by Cardinality-Constrained Sample and Feature Selection
    Adeli, Ehsan
    Li, Xiaorui
    Kwon, Dongjin
    Zhang, Yong
    Pohl, Kilian M.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (07) : 1713 - 1728
  • [5] Radiomic MRI Phenotyping of Glioblastoma: Improving Survival Preciction
    Bae, Sohi
    Choi, Yoon Seong
    Ahn, Sung Soo
    Chang, Jong Hee
    Kang, Seok-Gu
    Kim, Eui Hyun
    Kim, Se Hoon
    Lee, Seung-Koo
    [J]. RADIOLOGY, 2018, 289 (03) : 797 - 806
  • [6] Balaji Y, 2019, PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P1995
  • [7] Generating survival times to simulate Cox proportional hazards models
    Bender, R
    Augustin, T
    Blettner, M
    [J]. STATISTICS IN MEDICINE, 2005, 24 (11) : 1713 - 1723
  • [8] Multimodal data integration using machine learning improves risk stratification of high-grade serous ovarian cancer
    Boehm, Kevin M.
    Aherne, Emily A.
    Ellenson, Lora
    Nikolovski, Ines
    Alghamdi, Mohammed
    Vazquez-Garcia, Ignacio
    Zamarin, Dmitriy
    Roche, Kara Long
    Liu, Ying
    Patel, Druv
    Aukerman, Andrew
    Pasha, Arfath
    Rose, Doori
    Selenica, Pier
    Causa Andrieu, Pamela I.
    Fong, Chris
    Capanu, Marinela
    Reis-Filho, Jorge S.
    Vanguri, Rami
    Veeraraghavan, Harini
    Gangai, Natalie
    Sosa, Ramon
    Leung, Samantha
    McPherson, Andrew
    Gao, JianJiong
    Lakhman, Yulia
    Shah, Sohrab P.
    [J]. NATURE CANCER, 2022, 3 (06) : 723 - +
  • [9] A SINGULAR VALUE THRESHOLDING ALGORITHM FOR MATRIX COMPLETION
    Cai, Jian-Feng
    Candes, Emmanuel J.
    Shen, Zuowei
    [J]. SIAM JOURNAL ON OPTIMIZATION, 2010, 20 (04) : 1956 - 1982
  • [10] A new semi-supervised learning model combined with Cox and SP-AFT models in cancer survival analysis
    Chai, Hua
    Li, Zi-na
    Meng, De-yu
    Xia, Liang-yong
    Liang, Yong
    [J]. SCIENTIFIC REPORTS, 2017, 7