An application of convolutional neural networks with salient features for relation classification

被引:11
作者
Dashdorj, Zolzaya [1 ]
Song, Min [1 ]
机构
[1] Yonsei Univ, Dept Lib & Informat Sci, 50 Yonsei Ro, Seoul 120749, South Korea
基金
新加坡国家研究基金会;
关键词
Convolutional neural networks; Biomedical data analysis; Relation classification; Hyperparameter optimization; Deep learning; KNOWLEDGE;
D O I
10.1186/s12859-019-2808-3
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
BackgroundDue to the advent of deep learning, the increasing number of studies in the biomedical domain has attracted much interest in feature extraction and classification tasks. In this research, we seek the best combination of feature set and hyperparameter setting of deep learning algorithms for relation classification. To this end, we incorporate an entity and relation extraction tool, PKDE4J to extract biomedical features (i.e., biomedical entities, relations) for the relation classification. We compared the chosen Convolutional Neural Networks (CNN) based classification model with the most widely used learning algorithms.ResultsOur CNN based classification model outperforms the most widely used supervised algorithms. We achieved a significant performance on binary classification with a weighted macro-average F1-score: 94.79% using pre-extracted relevant feature combinations. For multi-class classification, the weighted macro-average F1-score is estimated around 86.95%.ConclusionsOur results suggest that our proposed CNN based model using the not only single feature as the raw text of the sentences of biomedical literature, but also coupling with multiple and highlighted features extracted from the biomedical sentences could improve the classification performance significantly. We offer hyperparameter tuning and optimization approaches for our proposed model to obtain optimal hyperparameters of the models with the best performance.
引用
收藏
页数:12
相关论文
共 32 条
[1]  
[Anonymous], 2012, ABS12070580 CORR
[2]  
[Anonymous], 2010, P ICML 10 P 27 INT C
[3]  
Bergstra J, 2012, J MACH LEARN RES, V13, P281
[4]  
Bunescu R. C., 2005, P C HUM LANG TECHN E, P724, DOI DOI 10.3115/1220575.1220666
[5]  
Collobert R, 2011, J MACH LEARN RES, V12, P2493
[6]  
Culotta A., 2006, P HUM LANG TECHN C N, P296, DOI 10.3115/1220835.1220873
[7]  
Culotta A., 2004, P 42 ANN M ASS COMPU, DOI DOI 10.3115/1218955.1219009
[8]  
Dashdorj Zolzaya, 2016, Semantic Technology. 6th Joint International Conference, JIST 2016. Revised Selected Papers: LNCS 10055, P305, DOI 10.1007/978-3-319-50112-3_23
[9]   High-level event identification in social media [J].
Dashdorj, Zolzaya ;
Altangerel, Erdenebaatar .
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2019, 31 (03)
[10]  
Hendrickx Iris, 2009, Proceedings of the Workshop on Semantic Evaluations: Recent Achievements and Future Directions, DEW '09, P94