Neural networks for data mining electronic text collections

被引:0
作者
Walker, N
Truman, G
机构
来源
APPLICATIONS AND SCIENCE OF ARTIFICIAL NEURAL NETWORKS III | 1997年 / 3077卷
关键词
artificial neural networks; information retrieval; clustering;
D O I
10.1117/12.271490
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The use of neural networks in information retrieval and text analysis has primarily suffered from the issues of adequate document representation, the ability to scale to very large collections, dynamism in the face of new information and the practical difficulties of basing the design on the use of supervised training sets. Perhaps the most important approach to begin solving these problems is the use of 'intermediate entities' which reduce the dimensionality of document representations and the size of documents collections to manageable levels coupled with the use of unsupervised neural network paradigms. This papers describes the issues, a fully configured neural network-based text analysis system - dataHARVEST - aimed at data mining text collections which begins this process, along with the remaining difficulties and potential ways forward.
引用
收藏
页码:299 / 306
页数:8
相关论文
empty
未找到相关数据