An improved method for the feature extraction of Chinese text by combining rough set theory with automatic abstracting technology

被引:0
|
作者
Shen, Min [1 ]
Dong, Baosen [1 ]
Xu, Linying [1 ]
机构
[1] School of Computer Science and Technology, Tianjin University, 300072 Tianjin, China
关键词
Data mining - Forestry - Abstracting - Information retrieval systems - Semantics - Information retrieval - Search engines;
D O I
10.1007/978-3-642-34447-3_44
中图分类号
学科分类号
摘要
The Rough Set Theory can reduce features of Chinese text effectively [1], but it is often encountered that the reduction will need a very long time in the case of a large number of training sets [2]. To solve the problem, this article proposes a method of associating Rough Set Theory with Automatic Abstracting Technology (AAT). Firstly, by calculating the weight of each node-it consists of the Self-Frequency, Tree Frequency, Concept Generalization Degree and Concept Selection Degree -in the Concept Hierarchy Tree [3] which based on Tongyici Cilin semantic dictionary [4] [5], it can determine theme concepts of Chinese Text. Secondly, it will extract the topic sentences [6] by calculating the importance of sentences [7]. Finally, it reduces features of these topic sentences again by IQR (Improved Quick Reduct Algorithm), and constructs the vector. Then from the whole information retrieval system perspective, it is clear that this method can save time for Automatic Abstracting and reduction. © Springer-Verlag Berlin Heidelberg 2012.
引用
收藏
页码:496 / 509
相关论文
共 50 条
  • [21] A Feature Extraction Method Using Base Phrase and keyword In Chinese Text
    Li, Xin-fu
    Zhao, Lei-lei
    Wu, Li-hong
    2008 3RD INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEM AND KNOWLEDGE ENGINEERING, VOLS 1 AND 2, 2008, : 680 - +
  • [22] An improved method on rough set theory and application in prediction of pest attack
    Jiang, Q. (45817603@qq.com), 1600, Trade Science Inc, 126,Prasheel Park,Sanjay Raj Farm House,Nr. Saurashtra Unive, Rajkot, Gujarat, 360 005, India (08):
  • [23] `Research on Feature Selection/Attribute Reduction Method Based on Rough Set Theory
    Wang, Shi Qiang
    Gao, Cai Yun
    Luo, Chang
    Zheng, Gui Mei
    Zhou, Yan Nian
    PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE OF INFORMATION AND COMMUNICATION TECHNOLOGY [ICICT-2019], 2019, 154 : 194 - 198
  • [24] Feature Selection of Combining Relieff and Rough Set for Syndrome Classification of Chronic Gastritis in Traditional Chinese Medicine
    Yan, Jianjun
    Chen, Qiyue
    Liu, Guoping
    Lu, Xiong
    Wang, Yiqin
    Guo, Rui
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON APPLIED SCIENCE AND ENGINEERING INNOVATION, 2015, 12 : 1233 - 1237
  • [25] A Novel Feature Selection Method With Neighborhood Rough Set and Improved Particle Swarm Optimization
    Feng, Jindong
    Gong, Zengtai
    IEEE ACCESS, 2022, 10 : 33301 - 33312
  • [26] A novel hybrid feature selection method based on rough set and improved harmony search
    H. Hannah Inbarani
    M. Bagyamathi
    Ahmad Taher Azar
    Neural Computing and Applications, 2015, 26 : 1859 - 1880
  • [27] A novel hybrid feature selection method based on rough set and improved harmony search
    Inbarani, H. Hannah
    Bagyamathi, M.
    Azar, Ahmad Taher
    NEURAL COMPUTING & APPLICATIONS, 2015, 26 (08): : 1859 - 1880
  • [28] Feature extraction using rough set theory in service sector application from incremental perspective
    Huang, Chun-Che
    Tseng, Tzu-Liang
    Tang, Chia-Ying
    COMPUTERS & INDUSTRIAL ENGINEERING, 2016, 91 : 30 - 41
  • [29] Apriori and N-gram Based Chinese Text Feature Extraction Method
    王晔
    黄上腾
    Journal of Shanghai Jiaotong University, 2004, (04) : 11 - 14
  • [30] An Improved Rough Set Theory based Feature Selection Approach for Intrusion Detection in SCADA Systems
    Priyanga, S.
    Raman, M. R. Gauthama
    Jagtap, Sujeet S.
    Aswin, N.
    Kirthivasan, Kannan
    Sriram, V. S. Shankar
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 36 (05) : 3993 - 4003