Intelligent Web-History Based on a Hybrid Clustering Algorithm for Future-Internet Systems

被引:2
|
作者
Marin, Andrei [1 ]
Pop, Florin [1 ]
机构
[1] Univ Politehn Bucuresti, Fac Automat & Comp Sci, Splaiul Independentei 313, Bucharest 060042, Romania
关键词
Web History; Itemset; Web Mining; Semantic Clustering; Fuzzy Algorithms; Genetic Algorithms;
D O I
10.1109/SYNASC.2011.24
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Internet grows larger year by year and makes users to be confronted with large quantities of data that they cannot fully comprehend. The ongoing transition from Web 2.0 to the Semantic Web makes the development of intelligent services with the ability to discern, classify and simplify web information of vital importance. In this paper we present a new model for web-history organizing in order to improve the user action over the Internet. Based on this model we proposed an application, delivered as a Google Chrome browser extension, which organizes the web-history into semantic clusters, providing the user with an easy-to-follow hierarchal structure. The paper covers the main algorithms in the field, offering a comprehensive critical analysis, such as document vectorization, relational clustering, fuzzy and genetic variations and the item-set-based approach. Our work consists of adapting these algorithms to support an ever-increasing set of input data. The result is a hybrid variation that rapidly offers an acceptable solution, which is optimized in time, a quality preserved during the extensive web explorations a user may undergo. A variety of test results is presented in the end, with under-stress behavior and a selection of user experience.
引用
收藏
页码:145 / 152
页数:8
相关论文
共 50 条
  • [41] A hybrid tabu search based clustering algorithm
    Liu, YG
    Liu, Y
    Wang, LB
    Chen, KF
    KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 2, PROCEEDINGS, 2005, 3682 : 186 - 192
  • [42] A Parallel Hybrid Web Document Clustering Algorithm and its Performance Study
    Shuting Xu
    Jun Zhang
    The Journal of Supercomputing, 2004, 30 : 117 - 131
  • [43] A parallel hybrid web document clustering algorithm and its performance study
    Xu, ST
    Zhang, J
    JOURNAL OF SUPERCOMPUTING, 2004, 30 (02): : 117 - 131
  • [44] Intelligent Web Services System based on matchmaking algorithm
    Choi, Okkyung
    Moon, Seong Hwan
    Han, Sangyong
    Abraham, Ajith
    WSEAS Transactions on Circuits and Systems, 2006, 5 (08): : 1166 - 1172
  • [45] A New Web Text Clustering Algorithm Based on DFSSM
    Yang, Bingru
    Song, Zefeng
    Wang, Yinglong
    Song, Wei
    PROCEEDINGS OF THE INTERNATIONAL SYMPOSIUM ON ELECTRONIC COMMERCE AND SECURITY, 2008, : 27 - 32
  • [46] Fuzzy Set Based Clustering Algorithm of Web Text
    Wan, Hongxin
    Peng, Yun
    ADVANCES IN MECHATRONICS AND CONTROL ENGINEERING III, 2014, 678 : 19 - +
  • [47] WTCA: A Web Text Clustering Algorithm Based on DFSSM
    Zheng, Yu
    Rong, Qian
    PROCEEDINGS OF THE 27TH CHINESE CONTROL CONFERENCE, VOL 5, 2008, : 811 - +
  • [48] A fuzzy-based algorithm for Web document clustering
    Friedman, M
    Kandel, A
    Schneider, M
    Last, M
    Shapira, B
    Elovici, Y
    Zaafrany, O
    NAFIPS 2004: ANNUAL MEETING OF THE NORTH AMERICAN FUZZY INFORMATION PROCESSING SOCIETY, VOLS 1AND 2: FUZZY SETS IN THE HEART OF THE CANADIAN ROCKIES, 2004, : 524 - 527
  • [49] A web document clustering algorithm based on concept of neighbor
    Song, JC
    Shen, JY
    2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, : 46 - 50
  • [50] Algorithm of Web Session Clustering Based on Increase of Similarities
    Li, Chaofeng
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATION MANAGEMENT, INNOVATION MANAGEMENT AND INDUSTRIAL ENGINEERING, VOL II, 2008, : 316 - 319