Information fusion experiments for text classification

被引：6

作者：

Dasigi, V ^{[1
]}

机构：

[1] So Polytech State Univ, Dept Comp Sci, Marietta, GA 30060 USA

来源：

1998 IEEE INFORMATION TECHNOLOGY CONFERENCE, PROCEEDINGS | 1998年

关键词：

D O I：

10.1109/IT.1998.713373

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper summarizes our recent experiments and results in employing information fusion for automatic classification of free text documents into a given number of categories. We try to characterize this information fusion work in terms of the Joint Directors of Laboratories scheme. The text used in the experiments is taken from the Reuters-22173 collection, which not only comes pre-analyzed, but facilitates training of the neural networks, as well as evaluation of the classification decisions. We use different kinds of feature extractors to derive information from documents, and use neural networks for both learning and fusion. We compare the effectiveness of individual feature extractors in classifying the text with that of information fusion from different interesting combinations of feature extractors. The results indicate that information fusion almost always performs better than the individual feature extractors, and certain combinations seem to do better than the others. Additional parameters can have varying degrees of effectiveness, and remain to be investigated.

引用

页码：23 / 26

页数：4