Feature Competitive Algorithm for Dimension Reduction of the Self-Organizing Map Input Space

被引:0
作者
Huilin Ye
Bruce W.N. Lo
机构
[1] The University of Newcastle Callaghan,Department of Computer Science and Software Engineering
[2] Southern Cross University,School of Multimedia and Information Technology
来源
Applied Intelligence | 2000年 / 13卷
关键词
self-organizing map; dimension reduction; document classification; software classification and retrieval;
D O I
暂无
中图分类号
学科分类号
摘要
The self-organizing map (SOM) can classify documents by learning about their interrelationships from its input data. The dimensionality of the SOM input data space based on a document collection is generally high. As the computational complexity of the SOM increases in proportion to the dimension of its input space, high dimensionality not only lowers the efficiency of the initial learning process but also lowers the efficiencies of the subsequent retrieval and the relearning process whenever the input data is updated. A new method called feature competitive algorithm (FCA) is proposed to overcome this problem. The FCA can capture the most significant features that characterize the underlying interrelationships of the entities in the input space to form a dimensionally reduced input space without excessively losing of essential information about the interrelationships. The proposed method was applied to a document collection, consisting of 97 UNIX command manual pages, to test its feasibility and effectiveness. The test results are encouraging. Further discussions on several crucial issues about the FCA are also presented.
引用
收藏
页码:215 / 230
页数:15
相关论文
共 21 条
  • [1] Ritter H.(1989)Self-organizing semantic maps Biological Cybernetics 61 241-254
  • [2] Kohonen T.(1997)A graphical, selforganizing approach to classifying electronic meeting output Journal of the American Society for Information Science 48 157-170
  • [3] Orwig R.E.(1998)Feature selection and effective classifiers Journal of the American Society for Information Science 49 423-434
  • [4] Chen H.(1994)Learning Boolean concepts in the presence of many irrelevant features Artificial Intelligence 69 279-305
  • [5] Nunamaker J.F.(1990)Indexing by latent semantic analysis Journal of the American Society for Information Science 41 391-407
  • [6] Deogun J.S.(1998)Incremental feature selection Applied Intelligence 9 217-230
  • [7] Choubey S.K.(1994)Automated learning of decision rules for text categorization ACM Transactions on Information Systems 12 233-251
  • [8] Raghavan V.V.(undefined)undefined undefined undefined undefined-undefined
  • [9] Sever H.(undefined)undefined undefined undefined undefined-undefined
  • [10] Almuallim H.(undefined)undefined undefined undefined undefined-undefined