Clustering and classification in structured data domains using Fuzzy Lattice Neurocomputing (FLN)

被引:35
作者
Petridis, V [1 ]
Kaburlasos, VG [1 ]
机构
[1] Aristotelian Univ Salonika, Dept Elect & Comp Engn, GR-54006 Salonika, Greece
关键词
text classification; neural networks; clustering; graphs; framework of fuzzy lattices;
D O I
10.1109/69.917564
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A connectionist scheme, namely, sigma -Fuzzy Lattice Neurocomputing scheme or sigma -FLN for short, which has been introduced in the literature lately for clustering in a lattice data domain, is employed in this work for computing clusters of directed graphs in a master-graph. New tools are presented and used here, including a convenient inclusion measure function for clustering graphs. A directed graph is treated by sigma -FLN as a single datum in the mathematical lattice of subgraphs stemming from a master-graph. A series of experiments is detailed where the master-graph emanates from a Thesaurus of spoken language synonyms. The words of the Thesaurus are fed to sigma -FLN in order to compute clusters of semantically related words, namely, hyperwords. The arithmetic parameters of sigma -FLN can be adjusted so as to calibrate the total number of hyperwords computed in a specific application. It is demonstrated how the employment of hyperwords implies a reduction, based on the a priori knowledge of semantics contained in the Thesaurus, in the number of features to be used for document classification. In a series of comparative experiments for document classification, it appears that the proposed method favorably improves classification accuracy in problems involving longer documents, whereas performance deteriorates in problems involving short documents.
引用
收藏
页码:245 / 260
页数:16
相关论文
共 34 条
[11]   Are multilayer perceptrons adequate for pattern recognition and verification? [J].
Gori, M ;
Scarselli, F .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1998, 20 (11) :1121-1132
[12]   Building hypertext links by computing semantic similarity [J].
Green, SJ .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1999, 11 (05) :713-730
[13]  
GUERSEY L, NY TIMES 0629
[14]   FUZZY ART PROPERTIES [J].
HUANG, JX ;
GEORGIOPOULOS, M ;
HEILEMAN, GL .
NEURAL NETWORKS, 1995, 8 (02) :203-213
[15]  
JUNKER M, 1997, P RANLP 97 2 INT C R, P202
[16]   Fuzzy lattice neurocomputing (FLN) models [J].
Kaburlasos, VG ;
Petridis, V .
NEURAL NETWORKS, 2000, 13 (10) :1145-1170
[17]   Searching the World Wide Web [J].
Lawrence, S ;
Giles, CL .
SCIENCE, 1998, 280 (5360) :98-100
[18]   Gradient-based learning applied to document recognition [J].
Lecun, Y ;
Bottou, L ;
Bengio, Y ;
Haffner, P .
PROCEEDINGS OF THE IEEE, 1998, 86 (11) :2278-2324
[19]   Text-learning and related intelligent agents: A survey [J].
Mladenic, D .
IEEE INTELLIGENT SYSTEMS & THEIR APPLICATIONS, 1999, 14 (04) :44-54
[20]  
Mladenic D., 1998, THESIS U LJUBLJANA S