medpie: an information extraction package for medical message board posts

被引:8
作者
Benton, A. [1 ]
Holmes, J. H. [1 ]
Hill, S. [2 ]
Chung, A. [1 ]
Ungar, L. [3 ]
机构
[1] Univ Penn, Sch Med, Ctr Clin Epidemiol & Biostat, Philadelphia, PA 19104 USA
[2] Univ Penn, Wharton Sch, Dept Operat & Informat Management, Philadelphia, PA 19104 USA
[3] Univ Penn, Sch Engn & Appl Sci, Dept Comp & Informat Sci, Philadelphia, PA 19104 USA
关键词
D O I
10.1093/bioinformatics/bts030
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
We have developed medpie, a software package for preparing medical message board corpora and extracting patient mentions and statistics for drugs, herbs and adverse effects experienced from them. The package is divided into web-crawling, HTML-cleaning, de-identification and information extraction modules. It also includes a sample controlled vocabulary of drugs, herbs and adverse effect terms.
引用
收藏
页码:743 / 744
页数:2
相关论文
共 6 条
[1]   Identifying potential adverse effects using the web: A new approach to medical hypothesis generation [J].
Benton, Adrian ;
Ungar, Lyle ;
Hill, Shawndra ;
Hennessy, Sean ;
Mao, Jun ;
Chung, Annie ;
Leonard, Charles E. ;
Holmes, John H. .
JOURNAL OF BIOMEDICAL INFORMATICS, 2011, 44 (06) :989-996
[2]   A system for de-identifying medical message board text [J].
Benton, Adrian ;
Hill, Shawndra ;
Ungar, Lyle ;
Chung, Annie ;
Leonard, Charles ;
Freeman, Cristin ;
Holmes, John H. .
BMC BIOINFORMATICS, 2011, 12
[3]  
Durant Kathleen T, 2010, Summit Transl Bioinform, V2010, P6
[4]   Extracting product comparisons from discussion boards [J].
Feldman, Ronen ;
Fresko, Moshe ;
Goldenberg, Jacob ;
Netzer, Oded ;
Ungar, Lyle .
ICDM 2007: PROCEEDINGS OF THE SEVENTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, 2007, :469-+
[5]   Evaluating the state-of-the-art in automatic de-identification [J].
Uzuner, Oezlem ;
Luo, Yuan ;
Szolovits, Peter .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2007, 14 (05) :550-563
[6]  
Zeng Qing T, 2006, AMIA Annu Symp Proc, P1155