Large-scale extraction of drug-disease pairs from the medical literature
被引:17
作者:
Wang, Pengwei
论文数: 0引用数: 0
h-index: 0
机构:
South China Univ Technol, Sch Elect & Informat Engn, Guangzhou, Guangdong, Peoples R ChinaSouth China Univ Technol, Sch Elect & Informat Engn, Guangzhou, Guangdong, Peoples R China
Wang, Pengwei
[1
]
Hao, Tianyong
论文数: 0引用数: 0
h-index: 0
机构:
Guangdong Univ Foreign Studies, Cisco Sch Informat, Guangzhou, Guangdong, Peoples R ChinaSouth China Univ Technol, Sch Elect & Informat Engn, Guangzhou, Guangdong, Peoples R China
Hao, Tianyong
[2
]
Yan, Jun
论文数: 0引用数: 0
h-index: 0
机构:
Microsoft Res Asia, Beijing, Peoples R ChinaSouth China Univ Technol, Sch Elect & Informat Engn, Guangzhou, Guangdong, Peoples R China
Yan, Jun
[3
]
Jin, Lianwen
论文数: 0引用数: 0
h-index: 0
机构:
South China Univ Technol, Sch Elect & Informat Engn, Guangzhou, Guangdong, Peoples R ChinaSouth China Univ Technol, Sch Elect & Informat Engn, Guangzhou, Guangdong, Peoples R China
Jin, Lianwen
[1
]
机构:
[1] South China Univ Technol, Sch Elect & Informat Engn, Guangzhou, Guangdong, Peoples R China
[2] Guangdong Univ Foreign Studies, Cisco Sch Informat, Guangzhou, Guangdong, Peoples R China
Automatic extraction of large-scale and accurate drug-disease pairs from the medical literature plays an important role for drug repurposing. However, many existing extraction methods are mainly in a supervised manner. It is costly and time-consuming to manually label drug-disease pairs datasets. There are many drug-disease pairs buried in free text. In this work, we first leverage a pattern-based method to automatically extract drug-disease pairs with treatment and inducement relationships from free text. Then, to reflect a drug-disease relation, a network embedding algorithm is proposed to calculate the degree of correlation of a drug-disease pair. In the experiments, we use the method to extract treatment and inducement drug-disease pairs from 27 million medical abstracts and titles available on PubMed. We extract 138,318 unique treatment pairs and 75,396 unique inducement pairs. Our algorithm achieves a precision of 0.912 and a recall of 0.898 in extracting the frequent treatment drug-disease pairs, and a precision of 0.923 and a recall of 0.833 in extracting the frequent inducement drug-disease pairs. Besides, our proposed information network embedding algorithm can efficiently reflect the degree of correlation of drug-disease pairs. Our algorithm can achieve a precision of 0.802, a recall of 0.783 in the fine-grained evaluation of extracting frequent pairs.
机构:
Univ Tokyo, Grad Sch Informat Sci & Technol, Dept Comp Sci, Bunkyo Ku, Tokyo 1130033, JapanUniv Tokyo, Grad Sch Informat Sci & Technol, Dept Comp Sci, Bunkyo Ku, Tokyo 1130033, Japan
Miwa, Makoto
Saetre, Rune
论文数: 0引用数: 0
h-index: 0
机构:Univ Tokyo, Grad Sch Informat Sci & Technol, Dept Comp Sci, Bunkyo Ku, Tokyo 1130033, Japan
Saetre, Rune
Miyao, Yusuke
论文数: 0引用数: 0
h-index: 0
机构:Univ Tokyo, Grad Sch Informat Sci & Technol, Dept Comp Sci, Bunkyo Ku, Tokyo 1130033, Japan
Miyao, Yusuke
Tsujii, Jun'ichi
论文数: 0引用数: 0
h-index: 0
机构:
Univ Manchester, Sch Comp Sci, Manchester M13 9PL, Lancs, EnglandUniv Tokyo, Grad Sch Informat Sci & Technol, Dept Comp Sci, Bunkyo Ku, Tokyo 1130033, Japan
机构:
Stanford Univ, Sch Med, Dept Pediat, Div Syst Med, Stanford, CA 94305 USA
Stanford Univ, Sch Med, Training Program Biomed Informat, Stanford, CA 94305 USA
Lucile Packard Childrens Hosp, Palo Alto, CA 94304 USAStanford Univ, Sch Med, Dept Pediat, Div Syst Med, Stanford, CA 94305 USA
Sirota, Marina
Dudley, Joel T.
论文数: 0引用数: 0
h-index: 0
机构:
Stanford Univ, Sch Med, Dept Pediat, Div Syst Med, Stanford, CA 94305 USA
Stanford Univ, Sch Med, Training Program Biomed Informat, Stanford, CA 94305 USA
Lucile Packard Childrens Hosp, Palo Alto, CA 94304 USAStanford Univ, Sch Med, Dept Pediat, Div Syst Med, Stanford, CA 94305 USA
Dudley, Joel T.
Kim, Jeewon
论文数: 0引用数: 0
h-index: 0
机构:
Stanford Univ, Sch Med, Stanford Comprehens Canc Ctr, Stanford, CA 94305 USAStanford Univ, Sch Med, Dept Pediat, Div Syst Med, Stanford, CA 94305 USA
Kim, Jeewon
Chiang, Annie P.
论文数: 0引用数: 0
h-index: 0
机构:
Stanford Univ, Sch Med, Dept Pediat, Div Syst Med, Stanford, CA 94305 USA
Stanford Univ, Sch Med, Training Program Biomed Informat, Stanford, CA 94305 USA
Lucile Packard Childrens Hosp, Palo Alto, CA 94304 USAStanford Univ, Sch Med, Dept Pediat, Div Syst Med, Stanford, CA 94305 USA
Chiang, Annie P.
Morgan, Alex A.
论文数: 0引用数: 0
h-index: 0
机构:
Stanford Univ, Sch Med, Dept Pediat, Div Syst Med, Stanford, CA 94305 USA
Stanford Univ, Sch Med, Training Program Biomed Informat, Stanford, CA 94305 USA
Lucile Packard Childrens Hosp, Palo Alto, CA 94304 USAStanford Univ, Sch Med, Dept Pediat, Div Syst Med, Stanford, CA 94305 USA
Morgan, Alex A.
Sweet-Cordero, Alejandro
论文数: 0引用数: 0
h-index: 0
机构:
Stanford Univ, Sch Med, Dept Pediat, Div Syst Med, Stanford, CA 94305 USA
Stanford Univ, Sch Med, Canc Biol Program, Stanford, CA 94305 USAStanford Univ, Sch Med, Dept Pediat, Div Syst Med, Stanford, CA 94305 USA
Sweet-Cordero, Alejandro
Sage, Julien
论文数: 0引用数: 0
h-index: 0
机构:
Stanford Univ, Sch Med, Dept Pediat, Div Syst Med, Stanford, CA 94305 USA
Stanford Univ, Sch Med, Canc Biol Program, Stanford, CA 94305 USA
Stanford Univ, Dept Genet, Stanford, CA 94305 USAStanford Univ, Sch Med, Dept Pediat, Div Syst Med, Stanford, CA 94305 USA
Sage, Julien
Butte, Atul J.
论文数: 0引用数: 0
h-index: 0
机构:
Stanford Univ, Sch Med, Dept Pediat, Div Syst Med, Stanford, CA 94305 USA
Lucile Packard Childrens Hosp, Palo Alto, CA 94304 USA
Stanford Univ, Sch Med, Canc Biol Program, Stanford, CA 94305 USAStanford Univ, Sch Med, Dept Pediat, Div Syst Med, Stanford, CA 94305 USA
机构:
Univ Tokyo, Grad Sch Informat Sci & Technol, Dept Comp Sci, Bunkyo Ku, Tokyo 1130033, JapanUniv Tokyo, Grad Sch Informat Sci & Technol, Dept Comp Sci, Bunkyo Ku, Tokyo 1130033, Japan
Miwa, Makoto
Saetre, Rune
论文数: 0引用数: 0
h-index: 0
机构:Univ Tokyo, Grad Sch Informat Sci & Technol, Dept Comp Sci, Bunkyo Ku, Tokyo 1130033, Japan
Saetre, Rune
Miyao, Yusuke
论文数: 0引用数: 0
h-index: 0
机构:Univ Tokyo, Grad Sch Informat Sci & Technol, Dept Comp Sci, Bunkyo Ku, Tokyo 1130033, Japan
Miyao, Yusuke
Tsujii, Jun'ichi
论文数: 0引用数: 0
h-index: 0
机构:
Univ Manchester, Sch Comp Sci, Manchester M13 9PL, Lancs, EnglandUniv Tokyo, Grad Sch Informat Sci & Technol, Dept Comp Sci, Bunkyo Ku, Tokyo 1130033, Japan
机构:
Stanford Univ, Sch Med, Dept Pediat, Div Syst Med, Stanford, CA 94305 USA
Stanford Univ, Sch Med, Training Program Biomed Informat, Stanford, CA 94305 USA
Lucile Packard Childrens Hosp, Palo Alto, CA 94304 USAStanford Univ, Sch Med, Dept Pediat, Div Syst Med, Stanford, CA 94305 USA
Sirota, Marina
Dudley, Joel T.
论文数: 0引用数: 0
h-index: 0
机构:
Stanford Univ, Sch Med, Dept Pediat, Div Syst Med, Stanford, CA 94305 USA
Stanford Univ, Sch Med, Training Program Biomed Informat, Stanford, CA 94305 USA
Lucile Packard Childrens Hosp, Palo Alto, CA 94304 USAStanford Univ, Sch Med, Dept Pediat, Div Syst Med, Stanford, CA 94305 USA
Dudley, Joel T.
Kim, Jeewon
论文数: 0引用数: 0
h-index: 0
机构:
Stanford Univ, Sch Med, Stanford Comprehens Canc Ctr, Stanford, CA 94305 USAStanford Univ, Sch Med, Dept Pediat, Div Syst Med, Stanford, CA 94305 USA
Kim, Jeewon
Chiang, Annie P.
论文数: 0引用数: 0
h-index: 0
机构:
Stanford Univ, Sch Med, Dept Pediat, Div Syst Med, Stanford, CA 94305 USA
Stanford Univ, Sch Med, Training Program Biomed Informat, Stanford, CA 94305 USA
Lucile Packard Childrens Hosp, Palo Alto, CA 94304 USAStanford Univ, Sch Med, Dept Pediat, Div Syst Med, Stanford, CA 94305 USA
Chiang, Annie P.
Morgan, Alex A.
论文数: 0引用数: 0
h-index: 0
机构:
Stanford Univ, Sch Med, Dept Pediat, Div Syst Med, Stanford, CA 94305 USA
Stanford Univ, Sch Med, Training Program Biomed Informat, Stanford, CA 94305 USA
Lucile Packard Childrens Hosp, Palo Alto, CA 94304 USAStanford Univ, Sch Med, Dept Pediat, Div Syst Med, Stanford, CA 94305 USA
Morgan, Alex A.
Sweet-Cordero, Alejandro
论文数: 0引用数: 0
h-index: 0
机构:
Stanford Univ, Sch Med, Dept Pediat, Div Syst Med, Stanford, CA 94305 USA
Stanford Univ, Sch Med, Canc Biol Program, Stanford, CA 94305 USAStanford Univ, Sch Med, Dept Pediat, Div Syst Med, Stanford, CA 94305 USA
Sweet-Cordero, Alejandro
Sage, Julien
论文数: 0引用数: 0
h-index: 0
机构:
Stanford Univ, Sch Med, Dept Pediat, Div Syst Med, Stanford, CA 94305 USA
Stanford Univ, Sch Med, Canc Biol Program, Stanford, CA 94305 USA
Stanford Univ, Dept Genet, Stanford, CA 94305 USAStanford Univ, Sch Med, Dept Pediat, Div Syst Med, Stanford, CA 94305 USA
Sage, Julien
Butte, Atul J.
论文数: 0引用数: 0
h-index: 0
机构:
Stanford Univ, Sch Med, Dept Pediat, Div Syst Med, Stanford, CA 94305 USA
Lucile Packard Childrens Hosp, Palo Alto, CA 94304 USA
Stanford Univ, Sch Med, Canc Biol Program, Stanford, CA 94305 USAStanford Univ, Sch Med, Dept Pediat, Div Syst Med, Stanford, CA 94305 USA