Morphologically rich Urdu grammar parsing using Earley algorithm

被引:6
作者
Abbas, Qaiser [1 ]
机构
[1] Univ Konstanz, Fachbereich Sprachwissensch, D-78457 Constance, Germany
关键词
Encoded information - Evaluation results - F-score - State of the art - Treebanks;
D O I
10.1017/S1351324915000133
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This work presents the development and evaluation of an extended Urdu parser. It further focuses on issues related to this parser and describes the changes made in the Earley algorithm to get accurate and relevant results from the Urdu parser. The parser makes use of a morphologically rich context free grammar extracted from a linguistically-rich Urdu treebank. This grammar with sufficient encoded information is comparable with the state-of-the-art parsing requirements for the morphologically rich Urdu language. The extended parsing model and the linguistically rich extracted-grammar both provide us better evaluation results in Urdu/Hindi parsing domain. The parser gives 87% of f-score, which outperforms the existing parsing work of Urdu/Hindi based on the tree-banking approach.
引用
收藏
页码:775 / 810
页数:36
相关论文
共 49 条
[1]  
Abbas Qaiser, 2012, Computational Linguistics and Intelligent Text Processing. Proceedings 13th International Conference (CICLing 2012), P66, DOI 10.1007/978-3-642-28604-9_6
[2]  
Abbas Q., 2014, P LAW 8 THE 8 LINGUI, P75
[3]  
Abbas Q., 2014, THESIS
[4]  
Abbas Q., 2009, INT J ELECT COMPUTER, V9, P231
[5]  
Abbas Q., 2014, P EMNLP 2014 LANG TE, P35
[6]  
Abbas Q., 2015, INT J COMPUTER APPL, V107, P39
[7]  
Abbas Q., 2014, INT J COMPUTER APPL, V85, P1
[8]   Lexical Functional Grammar for Urdu Modal Verbs [J].
Abbas, Qaiser ;
Khan, Ahsan Nabi .
ICET: 2009 INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES, PROCEEDINGS, 2009, :7-12
[9]  
Agrawal Bhasha, 2013, Computational Linguistics and Intelligent Text Processing. 14th International Conference, CICLing 2013. Proceedings, P294, DOI 10.1007/978-3-642-37247-6_24
[10]  
Aho AlfredV., 2007, Compilers: principles, techniques, tools, V1009