Improved Algorithm Based on Decision Tree for Semantic Information Retrieval

被引:2
|
作者
Wang, Zhe [1 ,2 ]
Zhao, Yingying [1 ]
Dong, Hai [3 ]
Xu, Yulong [1 ]
Lv, Yali [1 ]
机构
[1] Henan Univ Chinese Med, Sch Informat Technol, Zhengzhou 450046, Peoples R China
[2] Donghua Univ, Sch Informat Sci & Technol, Shanghai 201620, Peoples R China
[3] RMIT Univ, Sch Comp Technol, Melbourne, Vic 3001, Australia
关键词
Semantic; information retrieval; decision tree; SIMILARITY; DESIGN; FUSION;
D O I
10.32604/iasc.2021.016434
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The quick retrieval of target information from a massive amount of information has become a core research area in the field of information retrieval. Semantic information retrieval provides effective methods based on semantic comprehension, whose traditional models focus on multiple rounds of detection to differentiate information. Since a large amount of information must be excluded, retrieval efficiency is low. One of the most common methods used in classification, the decision tree algorithm, first selects attributes with higher information entropy to construct a decision tree. However, the tree only matches words on the grammatical level and does not consider the semantic of the information and lacks understanding of the information; meanwhile, it increases the amount of calculation and the complexity of the algorithm on synonymous fields, and the classification quality is not high. We investigate the retrieval method, unstructured processing with different semantic data, extracting the attribute features of semantic information, creating a multi-layered structure for the attribute features, calculating the window function according to the theory of multi-level analytic fusion, and fusing different levels of data. Then, we calculate the expected entropy of semantic information, undertake the boundary treatment of the attributes, calculate the information gain and information gain ratio of the attributes, and set the largest gain ratio of semantic data as the nodes of the decision tree. Our results reveal the algorithm's superior effectiveness in semantic information retrieval. Experimental results verify that the algorithm improves the expressing ability of knowledge in the information retrieval system and improves the time efficiency of semantic information retrieval.
引用
收藏
页码:419 / 429
页数:11
相关论文
共 50 条
  • [41] Discipline Decision Tree Classification Algorithm and Application based on Weighted Information Gain Ratio
    Xia, Yan
    Shu, Jian
    Xu, Na
    Feng, Hui
    PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED EDUCATION, VOL 2 (CSEDU), 2016, : 77 - 84
  • [42] Research and Application on Web Information Retrieval Based on Improved FP-Growth Algorithm
    JIAO Minghai~ 1
    2.School of Information Science and Engineering
    WuhanUniversityJournalofNaturalSciences, 2006, (05) : 1065 - 1068
  • [43] Biomedical Semantic Information Retrieval
    Lopez-Ubeda, Pilar
    Carlos Diaz-Galiano, Manuel
    Montejo-Raez, Arturo
    Martinez-Santiago, Fernando
    Andreu-Marin, Alberto
    Teresa Martin-Valdivia, M.
    Urena Lopez, L. Alfonso
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2018, (61): : 189 - 192
  • [44] Information retrieval by semantic similarity
    Hliaoutakis, Angelos
    Varelas, Giannis
    Voutsakis, Epimenidis
    Petrakis, Euripides G. M.
    Milios, Evangelos
    INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2006, 2 (03) : 55 - 73
  • [45] An Improved Sample Selection Algorithm in Fuzzy Decision Tree Induction
    Dong, Ling-Cai
    Wang, Dan
    Wang, Xi-Zhao
    2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 629 - 634
  • [46] Research on Information Retrieval Algorithm Based on TextRank
    Xu, Chenchen
    2019 34RD YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION (YAC), 2019, : 185 - 188
  • [47] Battery SOH estimation based on decision tree and improved support vector machine regression algorithm
    Qian, Lijun
    Xuan, Liang
    Chen, Jian
    FRONTIERS IN ENERGY RESEARCH, 2023, 11
  • [48] A decision tree using ID3 algorithm for English semantic analysis
    Phu V.N.
    Tran V.T.N.
    Chau V.T.N.
    Dat N.D.
    Duy K.L.D.
    International Journal of Speech Technology, 2017, 20 (03) : 593 - 613
  • [49] Research on Police Performance Appraisal Early Warning Methods Based on Improved Decision Tree Algorithm
    Cui, Yumei
    Xu, Jinduo
    Xu, Linying
    CONTEMPORARY RESEARCH ON E-BUSINESS TECHNOLOGY AND STRATEGY, 2012, 332 : 571 - 583