Big Data and Named Entity Recognition Approaches for Urdu Language

被引:1
作者
Jamil, Qudsia [1 ]
Zafar, Muhammad Rehman [1 ]
机构
[1] Bahria Univ, Dept Comp Sci, Islamabad, Pakistan
关键词
Big Data; Named Entity Recognition; Urdu Text Processing; Natural Language Processing(NLP);
D O I
10.4108/eai.13-4-2018.154469
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Nowadays data is stored in digital form and Terabyte of data is generated on daily basis. It is difficult task to extract useful information from Big data efficiently. From unstructured text Information extraction is a technique which used to extract information. Named Entity Recognition (NER) is an essential component of information extraction in the field of Natural Language Processing (NLP). Further, Urdu language has various challenges to NER due to its agglutinative, inflectional nature and rich morphology. Therefore, NER systems for Urdu language are not mature yet due to lack of resources and ambiguities. This paper specifically addresses the different approaches to NER and explore the existing work for NER in Urdu language.
引用
收藏
页码:1 / 5
页数:5
相关论文
共 18 条
[1]  
Becker D., 2002, P INT C INT COMP IC, P757
[2]  
Becker D., 2002, 3 WORKSH AS LANG RES, P1
[3]  
Bikel D. M., 1997, ANLP, P194, DOI [10.3115/974557.974586, DOI 10.3115/974557.974586]
[4]   An algorithm that learns what's in a name [J].
Bikel, DM ;
Schwartz, R ;
Weischedel, RM .
MACHINE LEARNING, 1999, 34 (1-3) :211-231
[5]  
Borthwick Andrew, 1999, THESIS
[6]  
Ekbal A., 2008, IJCNLP, P33
[7]  
Fresko M., 2005, P 14 ACM INT C INF K, P361
[8]  
Gali K, 2008, P IJCNLP 08 WORKSH N, P25
[9]  
Gantz J., 2011, IDC IVIEW, V1142, P1
[10]  
Kumar P., 2008, P IJCNLP 08 WORKSH N, P83