Optimal subsampling proportional subdistribution hazards regression with rare events in big data

被引:0
作者
Li Erqian [1 ]
Tang Man-Lai [2 ]
Tian Maozai [3 ]
Yu Keming [4 ]
机构
[1] North China Univ Technol, Coll Sci, Beijing, Peoples R China
[2] Univ Hertfordshire, Ctr Data Innovat Res, Dept Phys Astron & Math, Sch Phys Engn & Comp Sci, Hatfield, Herts, England
[3] Renmin Univ China, Ctr Appl Stat, Sch Stat, Beijing, Peoples R China
[4] Brunel Univ London, Dept Math Sci, London, England
基金
中国国家自然科学基金;
关键词
Big data; Competing risks data; Optimal subsampling; PSEUDO-OBSERVATIONS; COMPETING RISKS; MODEL;
D O I
暂无
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The proportional subdistribution hazards (PSH) model has been widely employed for analyzing competing risks data which have mutually exclusive events with multiple causes and commonly occur in clinical research. With the rapid development of healthcare industry, massively sized survival data sets are becoming increasingly prevalent and classical PSH models are computationally intensive with large data sets. In this article, we propose the optimal subsampling estimators and two-step algorithm for the FineGray model. Asymptotic properties of the proposed estimators are established and an extensive simulation study is conducted to demonstrate the efficiency of the estimators. Our proposed methodology is then illustrated with the large dataset from the SEER (Surveillance, Epidemiology, and End Results) database.
引用
收藏
页码:361 / 377
页数:17
相关论文
共 39 条
[2]  
Ai M., 2021, Journal of Complexity, V62, P101
[3]   OPTIMAL SUBSAMPLING ALGORITHMS FOR BIG DATA REGRESSIONS [J].
Ai, Mingyao ;
Yu, Jun ;
Zhang, Huiming ;
Wang, HaiYing .
STATISTICA SINICA, 2021, 31 (02) :749-772
[4]  
Akhtar-Danesh N., 2008, FALL N AM STAT US GR, V4
[5]   Pseudo-observations in survival analysis [J].
Andersen, Per Kragh ;
Perme, Maja Pohar .
STATISTICAL METHODS IN MEDICAL RESEARCH, 2010, 19 (01) :71-99
[6]   COX REGRESSION-MODEL FOR COUNTING-PROCESSES - A LARGE SAMPLE STUDY [J].
ANDERSEN, PK ;
GILL, RD .
ANNALS OF STATISTICS, 1982, 10 (04) :1100-1120
[7]   Pseudo-observations for competing risks with covariate dependent censoring [J].
Binder, Nadine ;
Gerds, Thomas A. ;
Andersen, Per Kragh .
LIFETIME DATA ANALYSIS, 2014, 20 (02) :303-315
[8]  
BOOS D. D., 2013, Essential Statistical In- i9Vs0yo+oEWAIwxIQ27gBGdsonf5A1wbnJdNWQDNzau ference: Theory and Methods
[9]   Links between Infections, Lung Cancer, and the Immune System [J].
Budisan, Liviuta ;
Zanoaga, Oana ;
Braicu, Cornelia ;
Pirlog, Radu ;
Covaliu, Bogdan ;
Esanu, Victor ;
Korban, Schuyler S. ;
Berindan-Neagoe, Ioana .
INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2021, 22 (17)
[10]   The non-breast-cancer death rate among breast cancer patients [J].
Bush, Devon ;
Smith, Barbara ;
Younger, Jerry ;
Michaelson, James S. .
BREAST CANCER RESEARCH AND TREATMENT, 2011, 127 (01) :243-249