EVALUATING SEMANTIC RELATEDNESS USING WIKIPEDIA-BASED REPRESENTATIVE FEATURES ANALYSIS

被引:0
作者
Cui, Qing-jun [1 ]
Zhang, Hui [1 ]
Liu, Rui [1 ]
机构
[1] Beihang Univ, State Key Lab Software Dev Environm, Beijing, Peoples R China
来源
2011 INTERNATIONAL CONFERENCE ON INSTRUMENTATION, MEASUREMENT, CIRCUITS AND SYSTEMS (ICIMCS 2011), VOL 3: COMPUTER-AIDED DESIGN, MANUFACTURING AND MANAGEMENT | 2011年
关键词
Representative Features; semantic relatedness; Wikipedia; Concept Interpreting Network;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In order to evaluate semantic relatedness of natural language concepts automatically, we propose Representative Features Analysis (RFA), a novel approach that represents the meaning of concepts in a high-dimensional space of representative features as a semantic-surrounding concept vector. The vector elements are weighted by the combination of TF-IDF scheme and the link status of Concept Interpreting Network in which nodes represent the concepts and edges represent the interpreting relation between concepts. Assessing the relatedness amounts to comparing the corresponding vectors using conventional metrics. Compared with the previous state of the art, using RFA results in substantial improvements in correlation of computed relatedness scores with human judgments: from r = 0.75 to 0.78 for concepts and performs better in recalling the top n relevant concepts than ESA method. Importantly, the RFA model could evaluate semantic similarity for concepts with low occurrence in Wikipeida articles and eliminate the negative effect caused by the meaningless occurrence of words in the Wikipedia articles, which the approach of ESA neglects.
引用
收藏
页码:467 / 472
页数:6
相关论文
共 50 条
  • [31] Semantic Relatedness Measurement from Wikipedia and WordNet Using Modified Normalized Google Distance
    Karve, Saket
    Shende, Vasisht
    Hople, Swaroop
    DATA ANALYTICS AND LEARNING, 2019, 43 : 143 - 154
  • [32] Comparing Semantic Relatedness between Word Pairs in Portuguese Using Wikipedia
    Granada, Roger
    Trojahn, Cassia
    Vieira, Renata
    COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, 2014, 8775 : 170 - 175
  • [33] Taxonomy-based information content and wordnet-wiktionary-wikipedia glosses for semantic relatedness
    Ben Aouicha, Mohamed
    Taieb, Mohamed Ali Hadj
    Ben Hamadou, Abdelmajid
    APPLIED INTELLIGENCE, 2016, 45 (02) : 475 - 511
  • [34] EVALUATING RERANKING METHODS USING WIKIPEDIA FEATURES
    Kurakado, Koji
    Oishi, Tetsuya
    Hasegawa, Ryuzo
    Fujita, Hiroshi
    Koshimura, Miyuki
    ICAART 2011: PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 1, 2011, : 376 - 381
  • [35] Taxonomy-based information content and wordnet-wiktionary-wikipedia glosses for semantic relatedness
    Mohamed Ben Aouicha
    Mohamed Ali Hadj Taieb
    Abdelmajid Ben Hamadou
    Applied Intelligence, 2016, 45 : 475 - 511
  • [36] Open domain question answering using Wikipedia-based knowledge model
    Ryu, Pum-Mo
    Jang, Myung-Gil
    Kim, Hyun-Ki
    INFORMATION PROCESSING & MANAGEMENT, 2014, 50 (05) : 683 - 692
  • [37] Graph-Based Domain-Specific Semantic Relatedness from Wikipedia
    Sajadi, Armin
    ADVANCES IN ARTIFICIAL INTELLIGENCE, CANADIAN AI 2014, 2014, 8436 : 381 - 386
  • [38] A new semantic relatedness measurement using WordNet features
    Taieb, Mohamed Ali Hadj
    Ben Aouicha, Mohamed
    Ben Hamadou, Abdelmajid
    KNOWLEDGE AND INFORMATION SYSTEMS, 2014, 41 (02) : 467 - 497
  • [39] Keyterm Extraction from Microblogs' Messages using Wikipedia-based Keyphraseness Measure
    Korshunov, Anton
    2012 6TH INTERNATIONAL CONFERENCE ON SCIENCES OF ELECTRONICS, TECHNOLOGIES OF INFORMATION AND TELECOMMUNICATIONS (SETIT), 2012, : 925 - 931
  • [40] A new semantic relatedness measurement using WordNet features
    Mohamed Ali Hadj Taieb
    Mohamed Ben Aouicha
    Abdelmajid Ben Hamadou
    Knowledge and Information Systems, 2014, 41 : 467 - 497