Visualization and Integration of Databases using Self-Organizing Map

被引:2
|
作者
Bourennani, Farid [1 ]
Pu, Ken Q. [1 ]
Zhu, Ying [1 ]
机构
[1] Univ Ontario, Inst Technol, Toronto, ON, Canada
来源
2009 FIRST INTERNATIONAL CONFERENCE ON ADVANCES IN DATABASES, KNOWLEDGE, AND DATA APPLICATIONS | 2009年
关键词
SOM; Common Item Based Classifier (CIBC); Data Integration; Information Retrieval (IR);
D O I
10.1109/DBKDA.2009.30
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the growing computer networks, accessible data is becoming increasingly distributed. Understanding and integrating remote and unfamiliar data sources are important data management issues. In this paper, we propose to utilize self-organizing maps (SOM) clustering to aid with the visualization of similar columns, and integration of relational database tables and attributes based on the content. In order to accommodate heterogeneous data types found in relational databases, we extended the TFIDF measure to handle, in addition to text, numerical attribute types for coincident meaning extraction. We present a SOM clustering based visualization algorithm allowing the user to browse the heterogeneously typed database attributes and discover semantically similar clusters. Additionally, we propose a new algorithm Common Item Based Classifier (CIBC) to smoothen the homogeneity of the clusters obtained by SOM. The discovered semantic clusters can significantly aid in manual or automated constructions of data integrity constraints in data cleaning or schema mappings in data integration.
引用
收藏
页码:155 / 160
页数:6
相关论文
共 50 条
  • [1] ASSOCIATIVE SELF-ORGANIZING MAP
    Johnsson, Magnus
    Balkenius, Christian
    Hesslow, Germund
    IJCCI 2009: PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON COMPUTATIONAL INTELLIGENCE, 2009, : 363 - +
  • [2] Essentials of the self-organizing map
    Kohonen, Teuvo
    NEURAL NETWORKS, 2013, 37 : 52 - 65
  • [3] The diffuse self-organizing map
    Wang, Y
    Zeng, CH
    Mei, T
    Liu, WQ
    2003 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOLS 1-5, CONFERENCE PROCEEDINGS, 2003, : 3530 - 3535
  • [4] A conditional clustering algorithm using self-organizing map
    Tateyama, T
    Kawata, S
    Ohta, H
    SICE 2003 ANNUAL CONFERENCE, VOLS 1-3, 2003, : 3259 - 3264
  • [5] Using self-organizing feature map for signature verification
    Mautner, P
    Matousek, V
    Marsálek, T
    Soule, M
    Proceedings of the Eighth IASTED International Conference on Artificial Intelligence and Soft Computing, 2004, : 272 - 275
  • [6] Evolutionary mechanisms in self-organizing map
    Weng, SF
    Wong, F
    Zhang, CS
    2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, : 2020 - 2024
  • [7] Grid topologies for the self-organizing map
    Lopez-Rubio, Ezequiel
    Diaz Ramos, Antonio
    NEURAL NETWORKS, 2014, 56 : 35 - 48
  • [8] Feature selection for self-organizing map
    Benabdeslem, Khalid
    Lebbah, Mustapha
    PROCEEDINGS OF THE ITI 2007 29TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY INTERFACES, 2007, : 45 - +
  • [9] LazySOM: Image Compression Using an Enhanced Self-Organizing Map
    Tsai, Cheng-Fa
    Lin, Yu-Jiun
    ADVANCES IN IMAGE AND VIDEO TECHNOLOGY, PROCEEDINGS, 2009, 5414 : 118 - 129
  • [10] Classifying habitat characteristics of wetlands using a self-organizing map
    Kim, Seong-Hyeon
    Cho, Kwang-Jin
    Kim, Tae -Su
    Lee, Chang-Su
    Dhakal, Thakur
    Jang, Gab-Sue
    ECOLOGICAL INFORMATICS, 2023, 75