Strong similarity measures for ordered sets of documents in information retrieval

被引:24
|
作者
Egghe, L
Michel, C
机构
[1] Limburgs Univ Ctr, B-3590 Diepenbeek, Belgium
[2] Univ Instelling Antwerp, B-2610 Antwerp, Belgium
[3] DU Bordeaux 3, MSHA, CEM GRESIC, F-33607 Pessac, France
关键词
D O I
10.1016/S0306-4573(01)00051-6
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A general method is presented to construct ordered similarity measures (OS-measures), i.e., similarity measures for ordered sets of documents (as, e.g., being the result of an IR-process), based on classical, well-known similarity measures for ordinary sets (measures such as Jaccard, Dice, Cosine or overlap measures). To this extent, we first present a review of these measures and their relationships. The method given here to construct OS-measures extends the one given by Michel in a previous paper so that it becomes applicable on any pair of ordered sets. Concrete expressions of this method, applied to the classical similarity measures, are given. Some of these measures are then tested in the IR-system Profil-Doc. The engine SPIRIT extracts ranked document sets in three different contexts, each for 550 requests. The practical usability of the OS-measures is then discussed based on these experiments. (C) 2002 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:823 / 848
页数:26
相关论文
共 50 条
  • [1] Construction of weak and strong similarity measures for ordered sets of documents using fuzzy set techniques
    Egghe, L
    Michel, C
    INFORMATION PROCESSING & MANAGEMENT, 2003, 39 (05) : 771 - 807
  • [2] Construction of weak and strong similarity measures for ordered sets of documents using fuzzy set techniques
    Egghe, L.
    Michel, C.
    Inf. Process. Manage., 1600, 5 (771-807):
  • [4] PICTURE INFORMATION MEASURES FOR SIMILARITY RETRIEVAL
    CHANG, SK
    YANG, CC
    COMPUTER VISION GRAPHICS AND IMAGE PROCESSING, 1983, 23 (03): : 366 - 375
  • [5] Similarity Measures for Music Information Retrieval
    Della Ventura, Michele
    ADVANCED COMPUTATIONAL METHODS FOR KNOWLEDGE ENGINEERING, ICCSAMA 2017, 2018, 629 : 165 - 174
  • [6] Semantic similarity similarity measures for enhancing information retrieval in folksonomies
    Uddin, Mohammed Nazim
    Trong Hai Duong
    Ngoc Thanh Nguyen
    Qi, Xin-Min
    Jo, Geun Sik
    EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (05) : 1645 - 1653
  • [7] On the Use of Fuzzy Information Retrieval for Gauging Similarity of Arabic Documents
    Alzahrani, Salha Mohammed
    Salim, Naomie
    2009 SECOND INTERNATIONAL CONFERENCE ON THE APPLICATIONS OF DIGITAL INFORMATION AND WEB TECHNOLOGIES (ICADIWT 2009), 2009, : 539 - +
  • [8] Probability model of sensitive similarity measures in information retrieval
    Gu, Xiaolong
    Zhang, Jie
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2020, 17 (01)
  • [9] Query-sensitive similarity measures for information retrieval
    Tombros, A
    van Rijsbergen, CJ
    KNOWLEDGE AND INFORMATION SYSTEMS, 2004, 6 (05) : 617 - 642
  • [10] Query-Sensitive Similarity Measures for Information Retrieval
    Anastasios Tombros
    C.J. van Rijsbergen
    Knowledge and Information Systems, 2004, 6 : 617 - 642