Efficient Temporal Information Extraction from Korean Documents

被引:1
作者
Lim, Chae-Gyun [1 ]
Choi, Ho-Jin [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Sch Comp, Daejeon, South Korea
来源
2017 18TH IEEE INTERNATIONAL CONFERENCE ON MOBILE DATA MANAGEMENT (IEEE MDM 2017) | 2017年
关键词
natural language processing; temporal information extraction; time expression;
D O I
10.1109/MDM.2017.63
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
As the amount of documents continues to increase steadily, it has become an important issue to shorten processing time in the field of natural language processing. In this paper, we describe a method to reduce the execution speed of the Korean temporal information extraction module from a development perspective. While the rule-based approach is useful for finding time representations from natural language sentences, the process of applying rules for each sentence can take a long time. In addition, in Korean sentences, the linguistic characteristics must be considered together to obtain exact temporal information. We first attempted to reduce the time to look for a time expression using the characteristics of the Korean morpheme, and then modified the module to speed up the search for the entire rule base. And we also show an experimental result that the change of execution speed according to the proposed approaches.
引用
收藏
页码:366 / 370
页数:5
相关论文
共 11 条
[1]  
Angeli G., 2013, P 51 ANN M ASS COMP, P83
[2]  
[Anonymous], 2007, ANN M ASS COMP LING
[3]  
[Anonymous], 2015, P 19 C COMP NAT LANG
[4]  
Bod R, 2006, COLING/ACL 2006, VOLS 1 AND 2, PROCEEDINGS OF THE CONFERENCE, P865
[5]  
Hacioglu K, 2005, LECT NOTES COMPUT SC, V3406, P548
[6]  
Hanig C., 2008, LREC
[7]  
Hanig C., 2010, P 14 C COMP NAT LANG, P1
[8]  
Jeong YS, 2016, LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, P356
[9]  
Jing HY, 2003, PROCEEDINGS OF THE 2003 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, P200
[10]  
Kuzey E., 2012, TempWeb, P25, DOI 10.1145/2169095.2169101