A comprehensive review on feature set used for anaphora resolution

被引:8
作者
Lata, Kusum [1 ]
Singh, Pardeep [1 ]
Dutta, Kamlesh [1 ]
机构
[1] Natl Inst Technol Hamirpur, Comp Sci & Engn Dept, Hamirpur, Himachal Prades, India
关键词
Anaphora; Anaphora resolution; Anaphor; Antecedent; Feature set; Feature selection; Natural language processing; WINOGRAD SCHEMA CHALLENGE; FEATURE-SELECTION; COREFERENCE RESOLUTION; PRONOUN RESOLUTION; CORPUS; INFORMATION; SYSTEM; LINGUISTICS; FRAMEWORK; ALGORITHM;
D O I
10.1007/s10462-020-09917-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In linguistics, the Anaphora Resolution (AR) is the method of identifying the antecedent for anaphora. In simple terms, this is the problem that helps to solve what the expression referring to a referent refers to. It is considered to be one of the tedious tasks in Natural Language Processing (NLP). AR's burgeoning popularity among researchers is attributable to its strong relevance to machine translation, text summarization, chatbot, question answering, and many others. This paper presents a review of AR approaches based on significant features utilized to perform this task and presents the evaluation metrics for this field. The feature is a relevant term related to AR that provides vital information regarding anaphor, antecedent, and relation between them. In this context, features represent the lexical, syntactical, semantical, and positional relationship between anaphor and its possible candidate antecedent. The performance of the Anaphora resolution system is profoundly dependent on the features used in the AR system. Hence, the selection of features for the AR system is highly significant. The main emphasis is to provide an overview of the various features needed to extract both the Anaphora and the Antecedent, respectively, used in different AR systems, present in literature. It is observed that syntactical information enhances the correctness of determining the properties for the existence of an anaphor and antecedent identification. Nowadays the trend is changing from hand-crafted feature dependent methods to deep learning approaches which try to learn feature representation. The performance of deep learning is progressing due to the accessibility of additional data and more powerful computing resources. This survey will provide the state-of art for the better understanding of solving AR problem from the feature selection perspective. The findings of this survey are useful to provide valuable insight into present trends and are helpful for researchers who are looking for developing AR system within given constraints.
引用
收藏
页码:2917 / 3006
页数:90
相关论文
共 275 条
  • [41] Chen YM, 2005, Proceedings of 2005 International Conference on Machine Learning and Cybernetics, Vols 1-9, P1937
  • [42] Cho K, 2014, ARXIV14061078, V2014, P1724, DOI [10.3115/v1/D14-1179, DOI 10.3115/V1/D14-1179]
  • [43] Chomsky N, 1986, KNOWLEDGE LANGUAGE
  • [44] CQASUMM: Building References for Community Question Answering Summarization Corpora
    Chowdhury, Tanya
    Chakraborty, Tanmoy
    [J]. PROCEEDINGS OF THE 6TH ACM IKDD CODS AND 24TH COMAD, 2019, : 18 - 26
  • [45] Clark H.H., 1975, Theoretical issues in natural language processing, P169
  • [46] What does BERT look at? An Analysis of BERT's Attention
    Clark, Kevin
    Khandelwal, Urvashi
    Levy, Omer
    Manning, Christopher D.
    [J]. BLACKBOXNLP WORKSHOP ON ANALYZING AND INTERPRETING NEURAL NETWORKS FOR NLP AT ACL 2019, 2019, : 276 - 286
  • [47] Clark K, 2016, PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, P643
  • [48] Converse SP, 2005, P 4 SIGHAN WORKSH CH
  • [49] Discovery of protein-protein interactions using a combination of linguistic, statistical and graphical information
    Cooper, JW
    Kershenbaum, A
    [J]. BMC BIOINFORMATICS, 2005, 6 (1)
  • [50] Dakwale P, 2014, THESIS