Clustering-Friendly Representation Learning for Enhancing Salient Features

被引:0
|
作者
Oshima, Toshiyuki [1 ]
Takagi, Kentaro [1 ]
Nakata, Kouta [1 ]
机构
[1] Toshiba Co Ltd, Corp R&D Ctr, 1 Komukai Toshiba Cho,Saiwai Ku, Kawasaki, Kanagawa, Japan
来源
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PT I, PAKDD 2024 | 2024年 / 14645卷
关键词
D O I
10.1007/978-981-97-2242-6_17
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, representation learning with contrastive learning algorithms has been successfully applied to challenging unlabeled datasets. However, these methods are unable to distinguish important features from unimportant ones under simply unsupervised settings, and definitions of importance vary according to the type of downstream task or analysis goal, such as the identification of objects or backgrounds. In this paper, we focus on unsupervised image clustering as the downstream task and propose a representation learning method that enhances features critical to the clustering task. We extend a clustering-friendly contrastive learning method and incorporate a contrastive analysis approach, which utilizes a reference dataset to separate important features from unimportant ones, into the design of loss functions. Conducting an experimental evaluation of image clustering for three datasets with characteristic backgrounds, we show that for all datasets, our method achieves higher clustering scores compared with conventional contrastive analysis and deep clustering methods.
引用
收藏
页码:209 / 220
页数:12
相关论文
共 50 条
  • [1] Unsupervised discriminative feature learning via finding a clustering-friendly embedding space
    Cao, Wenming
    Zhang, Zhongfan
    Liu, Cheng
    Li, Rui
    Jiao, Qianfen
    Yu, Zhiwen
    Wong, Hau-San
    PATTERN RECOGNITION, 2022, 129
  • [2] Supervised Clustering Loss for Clustering-Friendly Sentence Embeddings: an Application to Intent Clustering
    Barnabo, Giorgio
    Uva, Antonio
    Pollastrini, Sandro
    Rubagotti, Chiara
    Bernardi, Davide
    13TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING AND THE 3RD CONFERENCE OF THE ASIA-PACIFIC CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, IJCNLP-AACL 2023, 2023, : 412 - 430
  • [3] Towards Clustering-friendly Representations: Subspace Clustering via Graph Filtering
    Ma, Zhengrui
    Kang, Zhao
    Luo, Guangchun
    Tian, Ling
    Chen, Wenyu
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 3081 - 3089
  • [4] Learning clustering-friendly representations via partial information discrimination and cross-level interaction
    Zhang, Hai-Xin
    Huang, Dong
    Ling, Hua-Bao
    Sun, Weijun
    Wen, Zihao
    NEURAL NETWORKS, 2024, 180
  • [5] Salient and consensus representation learning based incomplete multiview clustering
    Zhao, Shuping
    Cui, Zhongwei
    Wu, Lian
    Xu, Yong
    Zuo, Yu
    Fei, Lunke
    APPLIED INTELLIGENCE, 2023, 53 (03) : 2723 - 2737
  • [6] Salient and consensus representation learning based incomplete multiview clustering
    Shuping Zhao
    Zhongwei Cui
    Lian Wu
    Yong Xu
    Yu Zuo
    Lunke Fei
    Applied Intelligence, 2023, 53 : 2723 - 2737
  • [7] Word Representation With Salient Features
    Zhang, Ming
    Palade, Vasile
    Wang, Yan
    Ji, Zhicheng
    IEEE ACCESS, 2019, 7 : 30157 - 30173
  • [8] Enhancing Graph Representation Learning with Localized Topological Features
    Yan, Zuoyu
    Zhao, Qi
    Ye, Ze
    Ma, Tengfei
    Gao, Liangcai
    Tang, Zhi
    Wang, Yusu
    Chen, Chao
    JOURNAL OF MACHINE LEARNING RESEARCH, 2025, 26 : 1 - 36
  • [9] Salient features in mammograms using Gabor filters and clustering
    Perconti, P
    Loew, MH
    MEDICAL IMAGING 2004: IMAGE PERCEPTION, OBSERVER PERFORMANCE, AND TECHNOLOGY ASSESSMENT, 2004, 5372 : 72 - 81
  • [10] Salient Subsequence Learning for Time Series Clustering
    Zhang, Qin
    Wu, Jia
    Zhang, Peng
    Long, Guodong
    Zhang, Chengqi
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (09) : 2193 - 2207