Learning Neural Models for End-to-End Clustering

被引:7
|
作者
Meier, Benjamin Bruno [1 ,2 ,3 ]
Elezi, Ismail [1 ,2 ,4 ]
Amirian, Mohammadreza [1 ,2 ,5 ]
Duerr, Oliver [1 ,2 ,6 ]
Stadelmann, Thilo [1 ,2 ]
机构
[1] ZHAW Datalab, Winterthur, Switzerland
[2] Sch Engn, Winterthur, Switzerland
[3] ARGUS DATA INSIGHTS Schweiz AG, Zurich, Switzerland
[4] Ca Foscari Univ Venice, Venice, Italy
[5] Ulm Univ, Inst Neural Informat Proc, Ulm, Germany
[6] HTWG Konstanz, Inst Opt Syst, Constance, Germany
来源
ARTIFICIAL NEURAL NETWORKS IN PATTERN RECOGNITION, ANNPR 2018 | 2018年 / 11081卷
关键词
Perceptual grouping; Learning to cluster; Speech & image clustering; RECOGNITION; NETWORK;
D O I
10.1007/978-3-319-99978-4_10
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a novel end-to-end neural network architecture that, once trained, directly outputs a probabilistic clustering of a batch of input examples in one pass. It estimates a distribution over the number of clusters k, and for each 1 <= k <= k(max), a distribution over the individual cluster assignment for each data point. The network is trained in advance in a supervised fashion on separate data to learn grouping by any perceptual similarity criterion based on pairwise labels (same/ different group). It can then be applied to different data containing different groups. We demonstrate promising performance on high-dimensional data like images (COIL-100) and speech (TIMIT). We call this "learning to cluster" and show its conceptual difference to deep metric learning, semi-supervise clustering and other related approaches while having the advantage of performing learnable clustering fully end-to-end.
引用
收藏
页码:126 / 138
页数:13
相关论文
共 50 条
  • [1] Neural mixture models with expectation-maximization for end-to-end deep clustering
    Tissera, Dumindu
    Vithanage, Kasun
    Wijesinghe, Rukshan
    Xavier, Alex
    Jayasena, Sanath
    Fernando, Subha
    Rodrigo, Ranga
    NEUROCOMPUTING, 2022, 505 : 249 - 262
  • [2] Robust End-to-end Speaker Diarization with Generic Neural Clustering
    Yang, Chenyu
    Wang, Yu
    INTERSPEECH 2022, 2022, : 1471 - 1475
  • [3] End-to-End Neural Segmental Models for Speech Recognition
    Tang, Hao
    Lu, Liang
    Kong, Lingpeng
    Gimpel, Kevin
    Livescu, Karen
    Dyer, Chris
    Smith, Noah A.
    Renals, Steve
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2017, 11 (08) : 1254 - 1264
  • [4] Neural End-to-End Learning for Computational Argumentation Mining
    Eger, Steffen
    Daxenberger, Johannes
    Gurevych, Iryna
    PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, : 11 - 22
  • [5] Neural Dynamic Policies for End-to-End Sensorimotor Learning
    Bahl, Shikhar
    Mukadam, Mustafa
    Gupta, Abhinav
    Pathak, Deepak
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [6] END-TO-END LEARNING OF PARSING MODELS FOR INFORMATION RETRIEVAL
    Gillenwater, Jennifer
    He, Xiaodong
    Gao, Jianfeng
    Deng, Li
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 3312 - 3316
  • [7] Learning Diverse Models for End-to-End Ensemble Tracking
    Wang, Ning
    Zhou, Wengang
    Li, Houqiang
    IEEE Transactions on Image Processing, 2021, 30 : 2220 - 2231
  • [8] Learning Diverse Models for End-to-End Ensemble Tracking
    Wang, Ning
    Zhou, Wengang
    Li, Houqiang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 2220 - 2231
  • [9] TOWARDS END-TO-END SPEAKER DIARIZATION WITH GENERALIZED NEURAL SPEAKER CLUSTERING
    Zhang, Chunlei
    Shi, Jiatong
    Weng, Chao
    Yu, Meng
    Yu, Dong
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 8372 - 8376
  • [10] ITERATIVE POLICY LEARNING IN END-TO-END TRAINABLE TASK-ORIENTED NEURAL DIALOG MODELS
    Liu, Bing
    Lane, Ian
    2017 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2017, : 482 - 489