A New Approach for Calculating Semantic Similarity between Words Using WordNet and Set Theory

被引:16
作者
Ezzikouri, Hanane [1 ]
Madani, Youness [1 ]
Erritali, Mohammed [1 ]
Oukessou, Mohamed [1 ]
机构
[1] Sultan Moulay Slimane Univ, Fac Sci & Tech, Beni Mellal, Morocco
来源
10TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT 2019) / THE 2ND INTERNATIONAL CONFERENCE ON EMERGING DATA AND INDUSTRY 4.0 (EDI40 2019) / AFFILIATED WORKSHOPS | 2019年 / 151卷
关键词
Semantic Similarity; Natural Language Processing; WordNet; Set Theory;
D O I
10.1016/j.procs.2019.04.182
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Calculating semantic similarity between words is a challenging task of a lot of domains such as Natural language processing (NLP), information retrieval and plagiarism detection. WordNet is a lexical dictionary conceptually organized, where each concept has several characteristics: Synsets and Glosses. Synset represent sets of synonyms of a given word and Glosses are a short description. In this paper, we propose a new approach for calculating semantic similarity between two concepts. The proposed method is based on set theory's concepts and WordNet properties, by calculating the relatedness between the synsets' and glosses's of the two concepts. (C) 2019 The Authors. Published by Elsevier B.V. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/) Peer-review under responsibility of the Conference Program Chairs.
引用
收藏
页码:1261 / 1265
页数:5
相关论文
共 14 条
  • [1] [Anonymous], 2007, P 16 INT WORLD WID W, DOI DOI 10.1145/1242572.1242675
  • [2] [Anonymous], 1998, WORDNET ELECT LEXICA
  • [3] Gupta D, 2014, 2014 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), P2694, DOI 10.1109/ICACCI.2014.6968314
  • [4] Hirst G, 1998, LANG SPEECH & COMMUN, P305
  • [5] Lin D., 1998, Machine Learning. Proceedings of the Fifteenth International Conference (ICML'98), P296
  • [6] Lin Dekang, 1993, P 31 ANN M ASS COMP, P112, DOI DOI 10.3115/981574.981590
  • [7] The Stanford CoreNLP Natural Language Processing Toolkit
    Manning, Christopher D.
    Surdeanu, Mihai
    Bauer, John
    Finkel, Jenny
    Bethard, Steven J.
    McClosky, David
    [J]. PROCEEDINGS OF 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: SYSTEM DEMONSTRATIONS, 2014, : 55 - 60
  • [8] WORDNET - A LEXICAL DATABASE FOR ENGLISH
    MILLER, GA
    [J]. COMMUNICATIONS OF THE ACM, 1995, 38 (11) : 39 - 41
  • [9] DEVELOPMENT AND APPLICATION OF A METRIC ON SEMANTIC NETS
    RADA, R
    MILI, H
    BICKNELL, E
    BLETTNER, M
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1989, 19 (01): : 17 - 30
  • [10] Semantic similarity in a taxonomy: An information-based measure and its application to problems of ambiguity in natural language
    Resnik, P
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1999, 11 : 95 - 130