A Novel TF-IDF Weighting Scheme for Effective Ranking

被引:0
|
作者
Paik, Jiaul H. [1 ]
机构
[1] Indian Stat Inst, Kolkata, India
来源
SIGIR'13: THE PROCEEDINGS OF THE 36TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH & DEVELOPMENT IN INFORMATION RETRIEVAL | 2013年
关键词
Document ranking; Retrieval model; Term weighting; INFORMATION-RETRIEVAL; MODEL;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Term weighting schemes are central to the study of information retrieval systems. This article proposes a novel TF-IDF term weighting scheme that employs two different within document term frequency normalizations to capture two different aspects of term saliency. One component of the term frequency is effective for short queries, while the other performs better on long queries. The final weight is then measured by taking a weighted combination of these components, which is determined on the basis of the length of the corresponding query. Experiments conducted on a large number of TREC news and web collections demonstrate that the proposed scheme almost always outperforms five state of the art retrieval models with remarkable significance and consistency. The experimental results also show that the proposed model achieves significantly better precision than the existing models.
引用
收藏
页码:343 / 352
页数:10
相关论文
共 50 条
  • [21] Research on Chinese Classification Based on TF-IDF
    Xiao, Liang
    Yao, Nianmin
    2021 INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, INFORMATION AND COMMUNICATION ENGINEERING, 2021, 11933
  • [22] TERM WEIGHTING: NOVEL FUZZY LOGIC BASED METHOD VS. CLASSICAL TF-IDF METHOD FOR WEB INFORMATION EXTRACTION
    Ropero, Jorge
    Gomez, Ariel
    Leon, Carlos
    Carrasco, Alejandro
    ICEIS 2009 : PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS, VOL AIDSS, 2009, : 130 - 137
  • [23] Improvement in TF-IDF scheme for web pages based on the contents of their hyperlinked neighboring pages
    Sugiyama, Kazunari
    Hatano, Kenji
    Yoshikawa, Masatoshi
    Uemura, Shunsuke
    Systems and Computers in Japan, 2005, 36 (14): : 56 - 68
  • [24] Implementation of Information Retrieval Using Tf-Idf Weighting Method On Detik.Com's Website
    Khusna, Arfiani Nur
    Agustina, Indri
    2018 12TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATION SYSTEMS, SERVICES, AND APPLICATIONS (TSSA), 2018,
  • [25] A new neutrosophic TF-IDF term weighting for text mining tasks: text classification use case
    Bounabi, Mariem
    Elmoutaouakil, Karim
    Satori, Khalid
    INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2021, 17 (03) : 229 - 249
  • [26] Continuous Speech Recognition with a TF-IDF Acoustic Model
    Zweig, Geoffrey
    Patrick Nguyen
    Droppo, Jasha
    Acero, Alex
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2858 - 2861
  • [27] Can TF-IDF and fuzzy logic improve onomasiological inference ranking?: Or keywords frequency is good enough?
    Barron-Cedeno, Alberto
    Sierra, Gerardo
    Kemper, Nicolas
    WSEAS: ADVANCES ON APPLIED COMPUTER AND APPLIED COMPUTATIONAL SCIENCE, 2008, : 358 - +
  • [28] Arabic Questions Classification Using Modified TF-IDF
    Alammary, Ali Saleh
    IEEE ACCESS, 2021, 9 : 95109 - 95122
  • [29] Research on case reasoning method based on TF-IDF
    Lin Zhang
    International Journal of System Assurance Engineering and Management, 2021, 12 : 608 - 615
  • [30] Construction of Military Intelligence Model Based on TF-IDF
    Han, Min-Qian
    Chai, Han-Peng
    Zong, Qiang
    SECOND IYSF ACADEMIC SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND COMPUTER ENGINEERING, 2021, 12079