Combining classifiers for robust PICO element detection

被引:88
作者
Boudin, Florian [1 ]
Nie, Jian-Yun [1 ]
Bartlett, Joan C.
Grad, Roland
Pluye, Pierre
Dawes, Martin [2 ]
机构
[1] Univ Montreal, DIRO, Montreal, PQ H3C 3J7, Canada
[2] McGill Univ, Dept Family Med, Montreal, PQ H2W 1S4, Canada
关键词
ABSTRACTS; KNOWLEDGE;
D O I
10.1186/1472-6947-10-29
中图分类号
R-058 [];
学科分类号
摘要
Background: Formulating a clinical information need in terms of the four atomic parts which are Population/Problem, Intervention, Comparison and Outcome (known as PICO elements) facilitates searching for a precise answer within a large medical citation database. However, using PICO defined items in the information retrieval process requires a search engine to be able to detect and index PICO elements in the collection in order for the system to retrieve relevant documents. Methods: In this study, we tested multiple supervised classification algorithms and their combinations for detecting PICO elements within medical abstracts. Using the structural descriptors that are embedded in some medical abstracts, we have automatically gathered large training/testing data sets for each PICO element. Results: Combining multiple classifiers using a weighted linear combination of their prediction scores achieves promising results with an f-measure score of 86.3% for P, 67% for I and 56.6% for O. Conclusions: Our experiments on the identification of PICO elements showed that the task is very challenging. Nevertheless, the performance achieved by our identification method is competitive with previously published results and shows that this task can be achieved with a high accuracy for the P element but lower ones for I and O elements.
引用
收藏
页数:6
相关论文
共 11 条
[1]  
Aronson AR, 2001, J AM MED INFORM ASSN, P17
[2]   Sentence retrieval for abstracts of randomized controlled trials [J].
Chung, Grace Y. .
BMC MEDICAL INFORMATICS AND DECISION MAKING, 2009, 9
[3]  
Dawes Martin, 2007, Inform Prim Care, V15, P9
[4]   Answering clinical questions with knowledge-based and statistical techniques [J].
Demner-Fushman, Dina ;
Lin, Jimmy .
COMPUTATIONAL LINGUISTICS, 2007, 33 (01) :63-103
[5]  
DEMNERFUSHMAN D, 2006, P 21 INT C COMP LING, P841
[6]   A method of extracting the number of trial participants from abstracts describing randomized controlled trials [J].
Hansen, Marie J. ;
Rasmussen, Nana O. ;
Chung, Grace .
JOURNAL OF TELEMEDICINE AND TELECARE, 2008, 14 (07) :354-358
[7]   Agreement, the F-measure, and reliability in information retrieval [J].
Hripcsak, G ;
Rothschild, AS .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2005, 12 (03) :296-298
[8]  
McKnight Larry, 2003, AMIA Annu Symp Proc, P440
[9]  
Richardson WS, 1995, ACP J CLUB, V123, pA12, DOI DOI 10.7326/ACPJC-1995-123-3-A12
[10]   The interaction of domain knowledge and linguistic structure in natural language processing: interpreting hypernymic propositions in biomedical text [J].
Rindflesch, TC ;
Fiszman, M .
JOURNAL OF BIOMEDICAL INFORMATICS, 2003, 36 (06) :462-477