共 50 条
A Hybrid Protein-Protein Interaction Triple Extraction Method for Biomedical Literature
被引:0
作者:
Zhao, Zhehuan
[1
]
Yang, Zhihao
[1
]
Sun, Cong
[1
]
Wang, Lei
[2
]
Lin, Hongfei
[1
]
机构:
[1] Dalian Univ Technol, Coll Comp Sci & Technol, Dalian 116024, Peoples R China
[2] Beijing Inst, Hlth Adm & Med Informat, Beijing 100850, Peoples R China
来源:
2017 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM)
|
2017年
关键词:
protein protein interaction triple extraction;
interaction word extraction;
protein named entity recognition;
MOLECULAR INTERACTION DATABASE;
INFORMATION;
UMLS;
TEXT;
D O I:
暂无
中图分类号:
TP39 [计算机的应用];
学科分类号:
081203 ;
0835 ;
摘要:
Protein-protein interaction extraction research can be widely applied to the field of life science research. However, most of the machine learning based methods focus on binary PPI relation extraction, which loses rich relationship type information that is critical to the PPIs study. The rule based open information extraction methods can extract the PPI triple (i.e. "protein1, interaction word, protein2"), but suffers from low recall rate problem. In this paper, we propose a hybrid protein-protein interaction triple extraction method. In this method, firstly, machine learning techniques are used to recognize protein entities and extract relational protein pairs. Then, the syntactic patterns and a dictionary are employed to find out corresponding interaction words that represent the relationships between two proteins. This method obtains an F-score of 40.18% on the AImed corpus, which is much higher than the result achieved by the rule based Stanford open information extraction method.
引用
收藏
页码:1515 / 1521
页数:7
相关论文
共 50 条