Enhancing Deep Learning with Embedded Features for Arabic Named Entity Recognition

被引:0
作者
Lotfy, Ali [1 ]
Sabty, Caroline [2 ]
Abdennadher, Slim [2 ]
机构
[1] German Univ Cairo, El Tagamoa El Khames, New Cairo, Egypt
[2] German Int Univ, Adm Capital,Reg Ring Rd, Cairo, Egypt
来源
LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION | 2022年
关键词
Arabic Natural Language Processing; Named Entity Recognition; Deep learning; HYBRID APPROACH;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The introduction of word embedding models has remarkably changed many Natural Language Processing tasks. Word embeddings can automatically capture the semantics of words and other hidden features. Nonetheless, the Arabic language is highly complex, which results in the loss of important information. This paper uses Madamira, an external knowledge source, to generate additional word features. We evaluate the utility of adding these features to conventional word and character embeddings to perform the Named Entity Recognition (NER) task on Modern Standard Arabic (MSA). Our NER model is implemented using Bidirectional Long Short Term Memory and Conditional Random Fields (BiLSTM-CRF). We add morphological and syntactical features to different word embeddings to train the model. The added features improve the performance by different values depending on the used embedding model. The best performance is achieved by using Bert embeddings. Moreover, our best model outperforms the previous systems to the best of our knowledge.
引用
收藏
页码:4904 / 4912
页数:9
相关论文
共 36 条
[1]  
Al-Jallad Ahmad., 2017, ROUTLEDGE HDB ARABIC, P315, DOI [10.4324/9781315147062-17, DOI 10.4324/9781315147062-17]
[2]   Boosting Arabic Named-Entity Recognition With Multi-Attention Layer [J].
Ali, Mohammed Nadher Abdo ;
Tan, Guanzheng ;
Hussain, Aamir .
IEEE ACCESS, 2019, 7 :46575-46582
[3]  
Antoun W., 2020, P 4 WORKSHOP OPEN SO, P9
[4]   Arabic Name Entity Recognition Using Deep Learning [J].
Awad, David ;
Sabty, Caroline ;
Elmahdy, Mohamed ;
Abdennadher, Slim .
STATISTICAL LANGUAGE AND SPEECH PROCESSING, SLSP 2018, 2018, 11171 :105-116
[5]  
Bazi I. E., 2018, ARXIV180405630
[6]  
Benajiba Y., 2008, Proceedings of Arab International Conference on Information Technology (ACIT 2008), P16
[7]  
Benajiba Y.Paolo., 2008, P WORKSHOP HLT NLP 6, P143
[8]  
Benajiba Y, 2007, LECT NOTES COMPUT SC, V4394, P143
[9]  
Bojanowski P., 2017, T ASSOC COMPUT LING, V5, P135, DOI DOI 10.1162/TACL_A_00051
[10]  
Chiu J.P., 2016, Transactions of the Association for Computational Linguistics, V4, P357, DOI [10.1162/tacl_a_00104, DOI 10.1162/TACLA00104]