A Novel TF-IDF Weighting Scheme for Effective Ranking

被引:0
|
作者
Paik, Jiaul H. [1 ]
机构
[1] Indian Stat Inst, Kolkata, India
来源
SIGIR'13: THE PROCEEDINGS OF THE 36TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH & DEVELOPMENT IN INFORMATION RETRIEVAL | 2013年
关键词
Document ranking; Retrieval model; Term weighting; INFORMATION-RETRIEVAL; MODEL;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Term weighting schemes are central to the study of information retrieval systems. This article proposes a novel TF-IDF term weighting scheme that employs two different within document term frequency normalizations to capture two different aspects of term saliency. One component of the term frequency is effective for short queries, while the other performs better on long queries. The final weight is then measured by taking a weighted combination of these components, which is determined on the basis of the length of the corresponding query. Experiments conducted on a large number of TREC news and web collections demonstrate that the proposed scheme almost always outperforms five state of the art retrieval models with remarkable significance and consistency. The experimental results also show that the proposed model achieves significantly better precision than the existing models.
引用
收藏
页码:343 / 352
页数:10
相关论文
共 50 条
  • [31] Improvement of TF-IDF Algorithm Based on Knowledge Graph
    Wang, Yanpeng
    Zhang, Dehai
    Yuan, Ye
    Liu, Qing
    Yang, Yun
    2018 IEEE/ACIS 16TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING RESEARCH, MANAGEMENT AND APPLICATION (SERA), 2018, : 19 - 24
  • [32] Evaluation of the Delta TF-IDF Features for Sentiment Analysis
    Samoylov, Andrew B.
    ANALYSIS OF IMAGES, SOCIAL NETWORKS AND TEXTS, 2014, 436 : 207 - 212
  • [33] KNN with TF-IDF Based Framework for Text Categorization
    Trstenjak, Bruno
    Mikac, Sasa
    Donko, Dzenana
    24TH DAAAM INTERNATIONAL SYMPOSIUM ON INTELLIGENT MANUFACTURING AND AUTOMATION, 2013, 2014, 69 : 1356 - 1364
  • [34] An information-theoretic perspective of tf-idf measures
    Aizawa, A
    INFORMATION PROCESSING & MANAGEMENT, 2003, 39 (01) : 45 - 65
  • [35] Research on case reasoning method based on TF-IDF
    Zhang, Lin
    INTERNATIONAL JOURNAL OF SYSTEM ASSURANCE ENGINEERING AND MANAGEMENT, 2021, 12 (03) : 608 - 615
  • [36] 基于TF-IDF的Webshell文件检测
    赵瑞杰
    施勇
    张涵
    龙军
    薛质
    计算机科学, 2020, 47(S2) (S2) : 363 - 367
  • [37] Investigating Verbal Intelligence using the TF-IDF Approach
    Zablotskaya, Kseniya
    Fernandez Martinez, Fernando
    Minker, Wolfgang
    LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 1573 - 1576
  • [38] Continuous speech recognition with a TF-IDF acoustic model
    Microsoft Research, Redmond, WA, United States
    Proc. Annu. Conf. Int. Speech Commun. Assoc., INTERSPEECH, (2854-2857):
  • [39] A Scalable Method for Detecting Multiple Loci Associated with Traits using TF-IDF Weighting and Association Rule Mining
    Lee, Sunwon
    Kang, Jaewoo
    Oh, Junho
    2010 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE WORKSHOPS (BIBMW), 2010, : 318 - 323
  • [40] Interpreting TF-IDF term weights as making relevance decisions
    Wu, Ho Chung
    Luk, Robert Wing Pong
    Wong, Kam Fai
    Kwok, Kui Lam
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2008, 26 (03)