共 32 条
@Minter: automated text-mining of microbial interactions
被引:25
作者:
Lim, Kun Ming Kenneth
[1
,2
]
Li, Chenhao
[1
,3
]
Chng, Kern Rei
[1
]
Nagarajan, Niranjan
[1
,3
]
机构:
[1] Genome Inst Singapore, Computat & Syst Biol, Singapore 138672, Singapore
[2] Natl Univ Singapore, Fac Sci, Computat Biol Program, Singapore, Singapore
[3] Natl Univ Singapore, Dept Comp Sci, Singapore, Singapore
关键词:
DISEASE;
D O I:
10.1093/bioinformatics/btw357
中图分类号:
Q5 [生物化学];
学科分类号:
071010 ;
081704 ;
摘要:
Motivation: Microbial consortia are frequently defined by numerous interactions within the community that are key to understanding their function. While microbial interactions have been extensively studied experimentally, information regarding them is dispersed in the scientific literature. As manual collation is an infeasible option, automated data processing tools are needed to make this information easily accessible. Results: We present @Minter, an automated information extraction system based on Support Vector Machines to analyze paper abstracts and infer microbial interactions. @Minter was trained and tested on a manually curated gold standard dataset of 735 species interactions and 3917 annotated abstracts, constructed as part of this study. Cross-validation analysis showed that @Minter was able to detect abstracts pertaining to one or more microbial interactions with high specificity (specificity=95%, AUC = 0.97). Despite challenges in identifying specific microbial interactions in an abstract (interaction level recall = 95%, precision = 25%), @Minter was shown to reduce annotator workload 13-fold compared to alternate approaches. Applying @Minter to 175 bacterial species abundant on human skin, we identified a network of 357 literature-reported microbial interactions, demonstrating its utility for the study of microbial communities.
引用
收藏
页码:2981 / 2987
页数:7
相关论文