Towards identifying intervention arms in randomized controlled trials: Extracting coordinating constructions

被引:18
作者
Chung, Grace Yuet-Chee [1 ]
机构
[1] Univ New S Wales, Ctr Hlth Informat, Sydney, NSW 2052, Australia
基金
澳大利亚研究理事会;
关键词
Information extraction; Biomedical text mining; Biomedical natural langauge processing; Medical informatics; INFORMATION;
D O I
10.1016/j.jbi.2008.12.011
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Background: Large numbers of reports of randomized controlled trials (RCTs) are published each year, and it is becoming increasingly difficult for clinicians practicing evidence-based medicine to find answers to clinical questions. The automatic machine extraction of RCT experimental details, including design methodology and outcomes, could help clinicians and reviewers locate relevant studies more rapidly and easily. Aim: This paper investigates how the comparison of interventions is documented in the abstracts of published RCTs. The ultimate goal is to use automated text mining to locate each intervention arm of a trial. This preliminary work aims to identify coordinating constructions, which are prevalent in the expression of intervention comparisons. Methods and results: An analysis of the types of constructs that describe the allocation of intervention arms is conducted, revealing that the compared interventions are predominantly embedded in coordinating constructions. A method is developed for identifying the descriptions of the assignment of treatment arms in clinical trials, using a full sentence parser to locate coordinating constructions and a statistical classifier for labeling positive examples. Predicate-argument structures are used along with other linguistic features with a maximum entropy classifier. An F-score of 0.78 is obtained for labeling relevant coordinating constructions in an independent test set. Conclusions: The intervention arms of a randomized controlled trials can be identified by machine extraction incorporating syntactic features derived from full sentence parsing. (C) 2008 Elsevier Inc. All rights reserved.
引用
收藏
页码:790 / 800
页数:11
相关论文
共 55 条
[1]  
*AD HOC WORK GROUP, 1987, ANN INTERN MED, V106, P595
[2]  
Agarwal Rajeev., 1992, Proceedings of the 30th annual meeting on Association for Computational Linguistics, P15
[3]   Endorsement of the CONSORT statement by high impact medical journals: survey of instructions for authors [J].
Altman, DG .
BMJ-BRITISH MEDICAL JOURNAL, 2005, 330 (7499) :1056-1057
[4]  
[Anonymous], 2005, ACP Journal Club
[5]  
[Anonymous], Evidence-Based Medicine, V5
[6]  
ARONSON AR, 2001, AMIA ANN S P, P17
[7]   Standards of reporting of randomized controlled trials in general surgery - Can we do better? [J].
Balasubramanian, Sabapathy P. ;
Wiener, Martin ;
Alshameeri, Zeiad ;
Tiruvoipati, Ravindranath ;
Elbourne, Diana ;
Reed, Malcolm W. .
ANNALS OF SURGERY, 2006, 244 (05) :663-667
[8]   Statistical models for text segmentation [J].
Beeferman, D ;
Berger, A ;
Lafferty, J .
MACHINE LEARNING, 1999, 34 (1-3) :177-210
[9]  
BOOTS I, 2004, 12 COCHR C BRIDG GAP
[10]  
Buyko E., 2007, PACLING 2007 P 10 C, P163