Towards exploring interactive relationship between clusters and outliers in multi-dimensional data analysis

被引:0
|
作者
Shi, Y [1 ]
Zhang, AD [1 ]
机构
[1] SUNY Buffalo, Dept Comp Sci & Engn, Buffalo, NY 14260 USA
来源
ICDE 2005: 21ST INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS | 2005年
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Nowadays many data mining algorithms focus on clustering methods. There are also a lot of approaches designed for outlier detection. We observe that, in many situations, clusters and outliers are concepts whose meanings are inseparable to each other especially for those data sets with noise. Thus, it is necessary to treat clusters and outliers as concepts of the same importance in data analysis. In this paper we present a cluster-outlier iterative detection algorithm, tending to detect the clusters and outliers in another perspective for noisy data sets. In this algorithm, clusters are detected and adjusted according to the intra-relationship within clusters and the inter-relationship between clusters and outliers, and vice versa. The adjustment and modification of the clusters and outliers are performed iteratively until a certain termination condition is reached. This data processing algorithm can be applied in many fields such as pattern recognition, data clustering and signal processing. Experimental results demonstrate the advantages of our approach.
引用
收藏
页码:518 / 519
页数:2
相关论文
共 50 条
  • [1] Towards exploring interactive relationship between clusters and outliers in multi-dimensional data analysis
    Shi, Y. (yongshi@cse.buffalo.edu), IEEE Computer Society; The Database Society of Japan, DBSJ; Information Processing Society of Japan, IPSJ; Institute of Electronics, Info. Commun. Engineers, IEICE (Institute of Electrical and Electronics Engineers Computer Society):
  • [2] Detecting clusters and Outliers for multi-dimensional data
    Shi, Yong
    MUE: 2008 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND UBIQUITOUS ENGINEERING, PROCEEDINGS, 2008, : 429 - 432
  • [3] Exploring linear projections for revealing clusters, outliers, and trends in subsets of multi-dimensional datasets
    Xia, Jiazhi
    Gao, Le
    Kong, Kezhi
    Zhao, Ying
    Chen, Yi
    Kui, Xiaoyan
    Liang, Yixiong
    JOURNAL OF VISUAL LANGUAGES AND COMPUTING, 2018, 48 : 52 - 60
  • [4] In Pursuit of Outliers in Multi-dimensional Data Streams
    Sadik, Shiblee
    Gruenwald, Le
    Leal, Eleazar
    2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 512 - 521
  • [5] An Interactive Interface for Multi-Dimensional Data Stream Analysis
    Marques, Nuno C.
    Santos, Hugo
    Silva, Bruno
    Proceedings 2016 20th International Conference Information Visualisation IV 2016, 2016, : 223 - 229
  • [6] An Interactive Visual Analysis Method for Multi-Dimensional Data Deduplication
    Zhu, Haiyang
    Qian, Zhonghao
    Yan, Fan
    Mao, Ketian
    Ying, Haojian
    Wang, Jie
    Chen, Wei
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2022, 34 (06): : 841 - 851
  • [7] Wadjet: Finding Outliers in Multiple Multi-dimensional Heterogeneous Data Streams
    Sadik, Shiblee
    Gruenwald, Le
    Leal, Eleazar
    2018 IEEE 34TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2018, : 1232 - 1235
  • [8] Glyph sorting: Interactive visualization for multi-dimensional data
    Chung, David H. S.
    Legg, Philip A.
    Parry, Matthew L.
    Bown, Rhodri
    Griffiths, Iwan W.
    Laramee, Robert S.
    Chen, Min
    INFORMATION VISUALIZATION, 2015, 14 (01) : 76 - 90
  • [9] Interactive visualization of multi-dimensional data in dairy production
    Pietersma, D
    Holthuis, JPH
    Lacroix, R
    Wade, KM
    APPLIED ENGINEERING IN AGRICULTURE, 2005, 21 (06) : 1081 - 1088
  • [10] StretchPlot: Interactive Visualization of Multi-Dimensional Trajectory Data
    Murray, Paul
    Forbes, Angus
    2014 IEEE CONFERENCE ON VISUAL ANALYTICS SCIENCE AND TECHNOLOGY (VAST), 2014, : 261 - 262