Content-based Information Retrieval by Named Entity Recognition and Verb Semantic Role Labelling

被引:0
作者
Antony, Betina J. [1 ]
Mahalakshmi, G. Suryanarayanan [1 ]
机构
[1] Anna Univ, CEG, Dept CSE, Madras 600025, Tamil Nadu, India
关键词
Information Retrieval; Tamil Siddha medicine; Named Entity Recognition; Semantic Role Labelling; MODEL;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Tamil Siddha medicine, an ancient medicinal system has yielded us a wide range of untapped information about traditional medicines. In this paper, we explore into the various Natural Language Processing techniques that can be implemented to this syntactically rich corpus. As domain information mostly concentrates on the central concepts, we start our work by identifying the Named Entities and categorizing them. An integrated NER classifier is built which comprises of SVM and Decision Tree classifier with an accuracy as high as 95%. These entities play different roles in different context. Hence their roles are labelled along with the predicates surrounding them. These roles and predicates give rise to a rule based sentence tagging system, trained by an MEM model, to tag different contents in this otherwise unstructured text. These two important techniques are then exploited to develop our Information Retrieval System that combines the methods category tagging done by Named Entity Recognition and content tagging done by Semantic Role Labelling. The system takes full advantage of the rich features of the language and hence can be expanded to other domains.
引用
收藏
页码:1830 / 1848
页数:19
相关论文
共 29 条
[1]  
Anandan P., 2002, INT C NAT LANG PROC
[2]  
Antony J. B., 2013, P 12 INT TAM INT C I, P125
[3]  
ANTONY JB, 2014, CIRC POW COMP TECHN, P1571
[4]   A multi-strategy approach to biological named entity recognition [J].
Atkinson, John ;
Bull, Veronica .
EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (17) :12968-12974
[5]  
BAI J., 2005, Proceedings of ACM CIKM 05, P688, DOI DOI 10.1145/1099554.1099725
[6]   Large Scale Application of Neural Network Based Semantic Role Labeling for Automated Relation Extraction from Biomedical Texts [J].
Barnickel, Thorsten ;
Weston, Jason ;
Collobert, Ronan ;
Mewes, Hans-Werner ;
Stuempflen, Volker .
PLOS ONE, 2009, 4 (07)
[7]  
Bhakkad A., 2013, International Journal of Computer Applications, V68, P9
[8]   A context vector model for information retrieval [J].
Billhardt, H ;
Borrajo, D ;
Maojo, V .
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2002, 53 (03) :236-249
[9]   Graph-based term weighting for information retrieval [J].
Blanco, Roi ;
Lioma, Christina .
INFORMATION RETRIEVAL, 2012, 15 (01) :54-92
[10]   A survey of current work in biomedical text mining [J].
Cohen, AM ;
Hersh, WR .
BRIEFINGS IN BIOINFORMATICS, 2005, 6 (01) :57-71