Clustering and classification in structured data domains using Fuzzy Lattice Neurocomputing (FLN)

被引:35
作者
Petridis, V [1 ]
Kaburlasos, VG [1 ]
机构
[1] Aristotelian Univ Salonika, Dept Elect & Comp Engn, GR-54006 Salonika, Greece
关键词
text classification; neural networks; clustering; graphs; framework of fuzzy lattices;
D O I
10.1109/69.917564
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A connectionist scheme, namely, sigma -Fuzzy Lattice Neurocomputing scheme or sigma -FLN for short, which has been introduced in the literature lately for clustering in a lattice data domain, is employed in this work for computing clusters of directed graphs in a master-graph. New tools are presented and used here, including a convenient inclusion measure function for clustering graphs. A directed graph is treated by sigma -FLN as a single datum in the mathematical lattice of subgraphs stemming from a master-graph. A series of experiments is detailed where the master-graph emanates from a Thesaurus of spoken language synonyms. The words of the Thesaurus are fed to sigma -FLN in order to compute clusters of semantically related words, namely, hyperwords. The arithmetic parameters of sigma -FLN can be adjusted so as to calibrate the total number of hyperwords computed in a specific application. It is demonstrated how the employment of hyperwords implies a reduction, based on the a priori knowledge of semantics contained in the Thesaurus, in the number of features to be used for document classification. In a series of comparative experiments for document classification, it appears that the proposed method favorably improves classification accuracy in problems involving longer documents, whereas performance deteriorates in problems involving short documents.
引用
收藏
页码:245 / 260
页数:16
相关论文
共 34 条
[1]  
[Anonymous], 1997, INT J COMPUTERS THEI
[2]  
BAKER LD, 1998, P 21 ANN INT ACM SIG
[3]  
Birkhoff G., 1967, LATTICE THEORY, V25
[4]   FUZZY ART - FAST STABLE LEARNING AND CATEGORIZATION OF ANALOG PATTERNS BY AN ADAPTIVE RESONANCE SYSTEM [J].
CARPENTER, GA ;
GROSSBERG, S ;
ROSEN, DB .
NEURAL NETWORKS, 1991, 4 (06) :759-771
[5]   A MASSIVELY PARALLEL ARCHITECTURE FOR A SELF-ORGANIZING NEURAL PATTERN-RECOGNITION MACHINE [J].
CARPENTER, GA ;
GROSSBERG, S .
COMPUTER VISION GRAPHICS AND IMAGE PROCESSING, 1987, 37 (01) :54-115
[6]   Enabling concept-based relevance feedback for information retrieval on the WWW [J].
Chang, CH ;
Hsu, CC .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1999, 11 (04) :595-609
[7]  
COHEN WW, 1996, 1996 AAAI P 13 NAT C
[8]   Support vector machines for spam categorization [J].
Drucker, H ;
Wu, DH ;
Vapnik, VN .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1999, 10 (05) :1048-1054
[9]   A general framework for adaptive processing of data structures [J].
Frasconi, P ;
Gori, M ;
Sperduti, A .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1998, 9 (05) :768-786
[10]  
Georgiou P, 1996, INT J ONCOL, V9, P9