Term Weighting Schemes Experiment Based on SVD for Malay Text Retrieval

被引:0
|
作者
Ab Samat, Nordianah [1 ]
Murad, Masrah Azrifah Azmi [1 ]
Abdullah, Muhamad Taufik [1 ]
Atan, Rodziah [1 ]
机构
[1] Univ Putra Malaysia, Fac Comp Sci & Informat Technol, Serdang 43400, Selangor, Malaysia
关键词
Singular Value Decomposition; term weighting; Malay document;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The goal in information retrieval is to locate relevant documents in response to a user's query at the same time retrieving as few as possible of the irrelevant documents. One possible approach to this problem is to use the Singular Value Decomposition (SVD) which models documents and queries as vectors in reduced space. The components of the vector are determined by the term weighting scheme, a function of the frequencies of the terms in the document or query. In this paper, we discuss term weighting schemes and the results from experiment on Malay text retrieval using a set of Malay document collection.
引用
收藏
页码:357 / 361
页数:5
相关论文
共 50 条
  • [1] On entropy-based term weighting schemes for text categorization
    Tao Wang
    Yi Cai
    Ho-fung Leung
    Raymond Y. K. Lau
    Haoran Xie
    Qing Li
    Knowledge and Information Systems, 2021, 63 : 2313 - 2346
  • [2] On entropy-based term weighting schemes for text categorization
    Wang, Tao
    Cai, Yi
    Leung, Ho-fung
    Lau, Raymond Y. K.
    Xie, Haoran
    Li, Qing
    KNOWLEDGE AND INFORMATION SYSTEMS, 2021, 63 (09) : 2313 - 2346
  • [3] From Text to Images: Weighting Schemes for Image Retrieval
    Tirilly, Pierre
    Claveau, Vincent
    Gros, Patrick
    JOURNAL OF MULTIMEDIA, 2015, 10 (01): : 1 - 21
  • [4] A survey of term weighting schemes for text classification
    Alsaeedi, Abdullah
    INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2020, 12 (02) : 237 - 254
  • [5] Supporting Text Retrieval by Typographical Term Weighting
    Werner, Lars
    Boettcher, Stefan
    INTERNATIONAL JOURNAL OF INTELLIGENT INFORMATION TECHNOLOGIES, 2007, 3 (02) : 1 - 16
  • [6] Modified frequency-based term weighting schemes for text classification
    Sabbah, Thabit
    Selamat, Ali
    Selamat, Md Hafiz
    Al-Anzi, Fawaz S.
    Viedma, Enrique Herrera
    Krejcar, Ondrej
    Fujita, Hamido
    APPLIED SOFT COMPUTING, 2017, 58 : 193 - 206
  • [7] Entropy-based Term Weighting Schemes for Text Categorization in VSM
    Wang, Tao
    Cai, Yi
    Leung, Ho-fung
    Cai, Zhiwei
    Min, Huaqing
    2015 IEEE 27TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2015), 2015, : 325 - 332
  • [8] Structural Information Based Term Weighting in Text Retrieval for Feature Location
    Bassett, Blake
    Kraft, Nicholas A.
    2013 IEEE 21ST INTERNATIONAL CONFERENCE ON PROGRAM COMPREHENSION (ICPC), 2013, : 133 - 141
  • [9] A Comparative Study on Term Weighting Schemes for Text Classification
    Mazyad, Ahmad
    Teytaud, Fabien
    Fonlupt, Cyril
    MACHINE LEARNING, OPTIMIZATION, AND BIG DATA, MOD 2017, 2018, 10710 : 100 - 108
  • [10] Analytical evaluation of term weighting schemes for text categorization
    Altincay, Hakan
    Erenel, Zafer
    PATTERN RECOGNITION LETTERS, 2010, 31 (11) : 1310 - 1323