Named Entity Recognition in Turkish with Bayesian Learning and Hybrid Approaches

被引:4
作者
RehaYavuz, Sermet [1 ]
Kucuk, Dilek [2 ]
Yazici, Adnan [1 ]
机构
[1] Middle East Tech Univ, Dept Comp Engn, TR-06800 Ankara, Turkey
[2] TUBITAK Energy Inst, Elect Power Technol Grp, TR-06800 Ankara, Turkey
来源
INFORMATION SCIENCES AND SYSTEMS 2013 | 2013年 / 264卷
关键词
INFORMATION EXTRACTION;
D O I
10.1007/978-3-319-01604-7_13
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Named entity recognition is one of the significant textual information extraction tasks. In this paper, we present two approaches for named entity recognition on Turkish texts. The first is a Bayesian learning approach which is trained on a considerably limited training set. The second approach comprises two hybrid systems based on joint utilization of this Bayesian learning approach and a previously proposed rule-based named entity recognizer. All of the proposed three approaches achieve promising performance rates. This paper is significant as it reports the first use of the Bayesian approach for the task of named entity recognition on Turkish texts for which especially practical approaches are still insufficient.
引用
收藏
页码:129 / 138
页数:10
相关论文
共 14 条
[1]  
[Anonymous], 1997, P 5 APPL NAT LANG PR, DOI DOI 10.3115/974557.974586
[2]   Machine learning for information extraction in informal domains [J].
Freitag, D .
MACHINE LEARNING, 2000, 39 (2-3) :169-202
[3]  
GRISHMAN R, 2003, OXFORD HDB COMPUTATI
[4]  
KUCUK D, 2009, P INT C FLEX QUER AN, V5822, P524
[5]   A hybrid named entity recognizer for Turkish [J].
Kucuk, Dilek ;
Yazici, Adnan .
EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (03) :2733-2742
[6]   Exploiting information extraction techniques for automatic semantic video indexing with an application to Turkish news videos [J].
Kucuk, Dilek ;
Yazici, Adnan .
KNOWLEDGE-BASED SYSTEMS, 2011, 24 (06) :844-857
[7]   Adapting SVM for data sparseness and imbalance: a case study in information extraction [J].
Li, Yaoyong ;
Bontcheva, Kalina ;
Cunningham, Hamish .
NATURAL LANGUAGE ENGINEERING, 2009, 15 :241-271
[8]  
Maynard D., 2001, P C REC ADV NAT LANG
[9]  
McCallum A., 2003, Proceedings of CoNLL, P188
[10]  
SAY B, 2002, P 11 INT C TURK LING