Web mining with relational clustering

被引:50
|
作者
Runkler, TA
Bezdek, JC
机构
[1] Siemens Corp Technol, D-81730 Munich, Germany
[2] Univ W Florida, Dept Comp Sci, Pensacola, FL 32514 USA
关键词
fuzzy clustering; relational data; keyword extraction; web content mining; web log mining; click stream analysis;
D O I
10.1016/S0888-613X(02)00084-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering is an unsupervised learning method that determines partitions and (possibly) prototypes from pattern sets. Sets of numerical patterns can be clustered by alternating optimization (AO) of clustering objective functions or by alternating cluster estimation (ACE). Sets of non-numerical patterns can often be represented numerically by (pairwise) relations. These relational data sets can be clustered by relational AO and by relational ACE (RACE). We consider two kinds of non-numerical patterns provided by the World Wide Web: document contents such as the text parts of web pages, and sequences of web pages visited by particular users, so-called web logs. The analysis of document contents is often called web content mining, and the analysis of log files with web page sequences is called web log mining. For both non-numerical pattern types (text and web page sequences) relational data sets can be automatically generated using the Levenshtein (edit) distance or using graph distances. The prototypes found for text data can be interpreted as keywords that serve for document classification and automatic archiving. The prototypes found for web page sequences can be interpreted as prototypical click streams that indicate typical user interests, and therefore serve as a basis for web content and web structure management. (C) 2002 Elsevier Science Inc. All rights reserved.
引用
收藏
页码:217 / 236
页数:20
相关论文
共 50 条
  • [1] Web mining with relational clustering
    Runkler, T.A. (thomas.runkler@mchp.siemens.de), 1600, Elsevier Inc. (32): : 2 - 3
  • [2] Relational clustering based on a new robust estimator with application to web mining
    Nasraoui, O
    Krishnapuram, R
    Joshi, A
    18TH INTERNATIONAL CONFERENCE OF THE NORTH AMERICAN FUZZY INFORMATION PROCESSING SOCIETY - NAFIPS, 1999, : 705 - 709
  • [3] Low-complexity fuzzy relational clustering algorithms for Web mining
    Krishnapuram, R
    Joshi, A
    Nasraoui, O
    Yi, LY
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2001, 9 (04) : 595 - 607
  • [4] Clustering for Knowledgeable Web Mining
    Charulatha, B. S.
    Rodrigues, Paul
    Chitralekha, T.
    Rajaraman, Arun
    ARTIFICIAL INTELLIGENCE AND EVOLUTIONARY ALGORITHMS IN ENGINEERING SYSTEMS, VOL 1, 2015, 324 : 491 - 498
  • [5] A probabilistic relational approach for web document clustering
    Fersini, E.
    Messina, E.
    Archetti, F.
    INFORMATION PROCESSING & MANAGEMENT, 2010, 46 (02) : 117 - 130
  • [6] Hybrid clustering with application to web mining
    Xu, Y
    PROCEEDINGS OF THE 2005 INTERNATIONAL CONFERENCE ON ACTIVE MEDIA TECHNOLOGY (AMT 2005), 2005, : 574 - 578
  • [7] Rough set clustering for web mining
    Lingras, P
    PROCEEDINGS OF THE 2002 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOL 1 & 2, 2002, : 1039 - 1044
  • [8] CLUE: Clustering for Mining Web URLs
    Morichetta, Andrea
    Bocchi, Enrico
    Metwalley, Hassan
    Mellia, Marco
    2016 28TH INTERNATIONAL TELETRAFFIC CONGRESS (ITC 28), VOL 1, 2016, : 286 - 294
  • [9] Clustering for Web information hierarchy mining
    Kao, HY
    Ho, JM
    Chen, MS
    IEEE/WIC INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE, PROCEEDINGS, 2003, : 698 - 701
  • [10] Bidirectional hierarchical clustering for web mining
    Yao, ZM
    Choi, B
    IEEE/WIC INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE, PROCEEDINGS, 2003, : 620 - 624