Aggregation of Word Embedding and Q-learning for Arabic Anaphora Resolution

被引:0
|
作者
Bouzid, Saoussen Mathlouthi [1 ]
Zribi, Chiraz Ben Othmane [1 ]
机构
[1] Univ Manouba, Natl Sch Comp Sci, RIADI Lab, Manouba, Tunisia
来源
ARABIC LANGUAGE PROCESSING: FROM THEORY TO PRACTICE, ICALP 2019 | 2019年 / 1108卷
关键词
Word2vec; Q-learning; Syntactic; Semantic; Self-training; SVM; Ranking aggregation; Pronominal anaphora; Arabic;
D O I
10.1007/978-3-030-32959-4_7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In many linguistic situations, the repetitions of objects and entities are reduced to the pronoun. The correct interpretation of pronouns plays an important role in the construction of meaning. Thus, the resolution of the pronominal anaphors remains a very important task for most natural language processing applications. This paper presents a novel approach to resolve pronominal anaphora in Arabic texts. At first, we identify non-referential pronouns by using an iterative self-training SVM method. After, we resolve the antecedents by combining a Q-learning method with a Word2Vec based method. The Q-learning method seeks to optimize, for each anaphoric pronoun, a sequence of criteria choice to evaluate the antecedents and look for the best. It uses syntactic criteria as preference factors to favor candidate antecedents over others. The Word2Vec method uses the word embedding model AraVec 3.0. It provides the semantic similarity measures between antecedent word vectors and pronoun context vectors. To combine Q-learning and Word2Vec results, we use a ranking aggregation method. The resolution system is evaluated on literary, journalistic and technical manual texts. Its precision rate reaches until 80.82%.
引用
收藏
页码:93 / 107
页数:15
相关论文
共 50 条
  • [1] FedQL: Q-Learning Guided Aggregation for Federated Learning
    Cao, Mei
    Zhao, Mengying
    Zhang, Tingting
    Yu, Nanxiang
    Lu, Jianbo
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2023, PT I, 2024, 14487 : 263 - 282
  • [2] A Novel Approach Based on Reinforcement Learning for Anaphora Resolution in Arabic texts
    Mathlouthi, Saoussen
    Trabelsi, Feriel Ben Fraj
    Zribi, Chiraz Ben Othmane
    VISION 2020: INNOVATION MANAGEMENT, DEVELOPMENT SUSTAINABILITY, AND COMPETITIVE ECONOMIC GROWTH, 2016, VOLS I - VII, 2016, : 625 - 638
  • [3] Morphological Word Embedding for Arabic
    Salama, Rana Aref
    Youssef, Abdou
    Fahmy, Aly
    ARABIC COMPUTATIONAL LINGUISTICS, 2018, 142 : 83 - 93
  • [4] Efficient Learning Approach for Pronominal Anaphora and Ellipsis Identification and Resolution in Arabic Texts
    Bouzid, Saoussen Mathlouthi
    Zribi, Chiraz Ben Othmane
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 3335 - 3348
  • [5] Word Sense Disambiguation for Arabic Exploiting Arabic WordNet and Word Embedding
    Alkhatlan, Ali
    Kalita, Jugal
    Alhaddad, Ahmed
    ARABIC COMPUTATIONAL LINGUISTICS, 2018, 142 : 50 - 60
  • [6] A Comparative Analysis of Word Embedding and Deep Learning for Arabic Sentiment Classification
    Sabbeh, Sahar F.
    Fasihuddin, Heba A.
    ELECTRONICS, 2023, 12 (06)
  • [7] A Comparative Study of Linguistic and Computational Features Based on a Machine Learning for Arabic Anaphora Resolution
    Abolohom, Abdullatif
    Omar, Nazlia
    Pais, Sebastiao
    Cordeiro, Joao
    AI IN COMPUTATIONAL LINGUISTICS, 2021, 189 : 37 - 47
  • [8] Q-LEARNING
    WATKINS, CJCH
    DAYAN, P
    MACHINE LEARNING, 1992, 8 (3-4) : 279 - 292
  • [9] Arabic Anaphora Resolution: Corpora Annotation with Coreferential Links
    Hammami, Souha
    Belguith, Lamia
    Ben Hamadou, Abdelmajid
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2009, 6 (05) : 481 - 489
  • [10] On the Use of Q-Learning to Solve the Selectable Virtual Network Embedding Problem
    Wang, Sen
    Zhang, Biao
    2018 IEEE 37TH INTERNATIONAL PERFORMANCE COMPUTING AND COMMUNICATIONS CONFERENCE (IPCCC), 2018,