A prediction model for advanced colorectal neoplasia in an asymptomatic screening population

被引:25
作者
Hong, Sung Noh [1 ]
Son, Hee Jung [1 ,2 ]
Choi, Sun Kyu [3 ]
Chang, Dong Kyung [1 ]
Kim, Young-Ho [1 ]
Jung, Sin-Ho [3 ,4 ]
Rhee, Poong-Lyul [1 ]
机构
[1] Sungkyunkwan Univ, Dept Med, Samsung Med Ctr, Sch Med, Seoul, South Korea
[2] Samsung Med Ctr, Ctr Hlth Promot, Seoul, South Korea
[3] Samsung Med Ctr, Samsung Canc Res Inst, Biostat & Bioinformat Ctr, Seoul, South Korea
[4] Duke Univ, Dept Biostat & Bioinformat, Durham, NC USA
来源
PLOS ONE | 2017年 / 12卷 / 08期
关键词
MISSING DATA; RISK; CANCER; COLONOSCOPY; VALIDATION; GUIDELINES; SCORE; INDEX; TOOL; MEN;
D O I
10.1371/journal.pone.0181040
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background An electronic medical record (EMR) database of a large unselected population who received screening colonoscopies may minimize sampling error and represent real-world estimates of risk for screening target lesions of advanced colorectal neoplasia (CRN). Our aim was to develop and validate a prediction model for assessing the probability of advanced CRN using a clinical data warehouse. Methods A total of 49,450 screenees underwent their first colonoscopy as part of a health check-up from 2002 to 2012 at Samsung Medical Center, and the dataset was constructed by means of natural language processing from the computerized EMR system. The screenees were randomized into training and validation sets. The prediction model was developed using logistic regression. The model performance was validated and compared with existing models using area under receiver operating curve (AUC) analysis. Results In the training set, age, gender, smoking duration, drinking frequency, and aspirin use were identified as independent predictors for advanced CRN (adjusted P <.01). The developed model had good discrimination (AUC = 0.726) and was internally validated (AUC = 0.713). The high-risk group had a 3.7-fold increased risk of advanced CRN compared to the low-risk group (1.1% vs. 4.0%, P <.001). The discrimination performance of the present model for high-risk patients with advanced CRN was better than that of the Asia-Pacific Colorectal Screening score (AUC = 0.678, P <.001) and Schroy's CAN index (AUC = 0.672, P <.001). Conclusion The present 5-item risk model can be calculated readily using a simple questionnaire and can identify the low- and high-risk groups of advanced CRN at the first screening colonoscopy. This model may increase colorectal cancer risk awareness and assist healthcare providers in encouraging the high-risk group to undergo a colonoscopy.
引用
收藏
页数:19
相关论文
共 28 条
  • [1] Natural language processing for the development of a clinical registry: a validation study in intraductal papillary mucinous neoplasms
    Al-Haddad, Mohammad A.
    Friedlin, Jeff
    Kesterson, Joe
    Waters, Joshua A.
    Aguilar-Saavedra, Juan R.
    Schmidt, C. Max
    [J]. HPB, 2010, 12 (10) : 688 - 695
  • [2] [Anonymous], 2002, GASTROINTESTINAL E S, V58, pS3
  • [3] Derivation and Validation of a Prediction Rule for Estimating Advanced Colorectal Neoplasm Risk in Average-Risk Chinese
    Cai, Quan-Cai
    Yu, En-Da
    Xiao, Yi
    Bai, Wen-Yuan
    Chen, Xing
    He, Li-Ping
    Yang, Yu-Xiu
    Zhou, Ping-Hong
    Jiang, Xue-Liang
    Xu, Hui-Min
    Fan, Hong
    Ge, Zhi-Zheng
    Lv, Nong-Hua
    Huang, Zhi-Gang
    Li, You-Ming
    Ma, Shu-Ren
    Chen, Jie
    Li, Yan-Qing
    Xu, Jian-Ming
    Xiang, Ping
    Yang, Li
    Lin, Fu-Lin
    Li, Zhao-Shen
    [J]. AMERICAN JOURNAL OF EPIDEMIOLOGY, 2012, 175 (06) : 584 - 593
  • [4] Harvard report on cancer prevention volume 4: Harvard Cancer Risk Index
    Colditz, GA
    Atwood, KA
    Emmons, K
    Monson, RR
    Willett, WC
    Trichopoulos, D
    Hunter, DJ
    [J]. CANCER CAUSES & CONTROL, 2000, 11 (06) : 477 - 488
  • [5] Development of a risk score for colorectal cancer in men
    Driver, Jane A.
    Gaziano, J. Michael
    Gelber, Rebecca P.
    Lee, I-Min
    Buring, Julie E.
    Kurth, Tobias
    [J]. AMERICAN JOURNAL OF MEDICINE, 2007, 120 (03) : 257 - 263
  • [6] Cancer incidence and mortality worldwide: Sources, methods and major patterns in GLOBOCAN 2012
    Ferlay, Jacques
    Soerjomataram, Isabelle
    Dikshit, Rajesh
    Eser, Sultan
    Mathers, Colin
    Rebelo, Marise
    Parkin, Donald Maxwell
    Forman, David
    Bray, Freddie
    [J]. INTERNATIONAL JOURNAL OF CANCER, 2015, 136 (05) : E359 - E386
  • [7] Handling missing data in self-report measures
    Fox-Wasylyshyn, SM
    El-Masri, MM
    [J]. RESEARCH IN NURSING & HEALTH, 2005, 28 (06) : 488 - 495
  • [8] Colorectal Cancer Risk Prediction Tool for White Men and Women Without Known Susceptibility
    Freedman, Andrew N.
    Slattery, Martha L.
    Ballard-Barbash, Rachel
    Willis, Gordon
    Cann, Bette J.
    Pee, David
    Gail, Mitchell H.
    Pfeiffer, Ruth M.
    [J]. JOURNAL OF CLINICAL ONCOLOGY, 2009, 27 (05) : 686 - 693
  • [9] Friedlin Jeff, 2010, AMIA Annu Symp Proc, V2010, P237
  • [10] Missing Data Analysis: Making It Work in the Real World
    Graham, John W.
    [J]. ANNUAL REVIEW OF PSYCHOLOGY, 2009, 60 : 549 - 576