AN AUTOMATED SYSTEM FOR TAMIL NAMED ENTITY RECOGNITION USING HYBRID APPROACH

被引:1
作者
Jeyashenbagavalli, N. [1 ]
Srinivasagan, K. G. [1 ]
Suganthi, S. [1 ]
机构
[1] Natl Engn Coll, CSE PG, Kovilpatti, Tamil Nadu, India
来源
2014 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING APPLICATIONS (ICICA 2014) | 2014年
关键词
NLP; NER; HMM; POS tagging; Morphological analyzer;
D O I
10.1109/ICICA.2014.95
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Named Entity Recognition is the process of identifying and recognizing named entities such as person, organization, location, date, time and money in the text documents. Named Entity Recognition is a subtask of Information Extraction. Information Extraction is the process of extracting the relevant data from documents. It is one of the research areas in Natural language processing. In this project implement a named entity recognizer using the hybrid approach that uses both Rule based and Hidden Markov Model in succession, which identifies only person, location and organization names respectively. Input data for proposed Named Entity Recognition system is any text document related to the any domain but limited size corpora respectively in Tamil language. In this system are tagging each word by using POS tagger and then imposing certain rules such as Lexical features and use some Gazetteers. HMM model using E-M algorithm is taken output data from trained as input to recognition system. The main purpose of this system identifies unknown entities and solves the problem of same name entity in different positions in the same document. The system is measuring the recall and precision parameters calculate the F-measure score. Goal of this project is to improve the performance of NER system to achieving high F-measure score.
引用
收藏
页码:435 / 439
页数:5
相关论文
共 12 条
[1]  
Amarappa S, 2011, IJECSE MAR, P281
[2]  
[Anonymous], 2009, NO EUROPEAN J LANGUA
[3]  
[Anonymous], IJCSET
[4]  
Chiong Raymond, 2008, NAMED ENTITY RECOGNI
[5]  
Chopra Deepti, 2012, IJIST, V2
[6]  
Gupta V., 2011, INT J COMPUTER APPL, V33, P28
[7]  
Hasanuzzaman Mohammad, 2009, INT J RECENT TRENDS, V1
[8]  
Lakshmana Pandian S, 2012, INT J COMPUT APPL, V46, P36
[9]  
Malarkodi C S., 2012, TAMIL NER COPING REA, P23
[10]  
Pandian Lakshmana, 2008, INFOS2008 MARCH 27 2