Learning Neural Models for End-to-End Clustering

被引：7

作者：

Meier, Benjamin Bruno ^{[1
,2
,3
]}

Elezi, Ismail ^{[1
,2
,4
]}

Amirian, Mohammadreza ^{[1
,2
,5
]}

Duerr, Oliver ^{[1
,2
,6
]}

Stadelmann, Thilo ^{[1
,2
]}

机构：

[1] ZHAW Datalab, Winterthur, Switzerland

[2] Sch Engn, Winterthur, Switzerland

[3] ARGUS DATA INSIGHTS Schweiz AG, Zurich, Switzerland

[4] Ca Foscari Univ Venice, Venice, Italy

[5] Ulm Univ, Inst Neural Informat Proc, Ulm, Germany

[6] HTWG Konstanz, Inst Opt Syst, Constance, Germany

来源：

ARTIFICIAL NEURAL NETWORKS IN PATTERN RECOGNITION, ANNPR 2018 | 2018年 / 11081卷

关键词：

Perceptual grouping; Learning to cluster; Speech & image clustering; RECOGNITION; NETWORK;

D O I：

10.1007/978-3-319-99978-4_10

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We propose a novel end-to-end neural network architecture that, once trained, directly outputs a probabilistic clustering of a batch of input examples in one pass. It estimates a distribution over the number of clusters k, and for each 1 <= k <= k(max), a distribution over the individual cluster assignment for each data point. The network is trained in advance in a supervised fashion on separate data to learn grouping by any perceptual similarity criterion based on pairwise labels (same/ different group). It can then be applied to different data containing different groups. We demonstrate promising performance on high-dimensional data like images (COIL-100) and speech (TIMIT). We call this "learning to cluster" and show its conceptual difference to deep metric learning, semi-supervise clustering and other related approaches while having the advantage of performing learnable clustering fully end-to-end.

引用

页码：126 / 138

页数：13

共 50 条

[1] Neural mixture models with expectation-maximization for end-to-end deep clustering
Tissera, Dumindu
Vithanage, Kasun
Wijesinghe, Rukshan
Xavier, Alex
Jayasena, Sanath
Fernando, Subha
Rodrigo, Ranga
NEUROCOMPUTING, 2022, 505 : 249 - 262
[2] Robust End-to-end Speaker Diarization with Generic Neural Clustering
Yang, Chenyu
Wang, Yu
INTERSPEECH 2022, 2022, : 1471 - 1475
[3] End-to-End Neural Segmental Models for Speech Recognition
Tang, Hao
Lu, Liang
Kong, Lingpeng
Gimpel, Kevin
Livescu, Karen
Dyer, Chris
Smith, Noah A.
Renals, Steve
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2017, 11 (08) : 1254 - 1264
[4] Neural End-to-End Learning for Computational Argumentation Mining
Eger, Steffen
Daxenberger, Johannes
Gurevych, Iryna
PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, : 11 - 22
[5] Neural Dynamic Policies for End-to-End Sensorimotor Learning
Bahl, Shikhar
Mukadam, Mustafa
Gupta, Abhinav
Pathak, Deepak
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
[6] END-TO-END LEARNING OF PARSING MODELS FOR INFORMATION RETRIEVAL
Gillenwater, Jennifer
He, Xiaodong
Gao, Jianfeng
Deng, Li
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 3312 - 3316
[7] Learning Diverse Models for End-to-End Ensemble Tracking
Wang, Ning
Zhou, Wengang
Li, Houqiang
IEEE Transactions on Image Processing, 2021, 30 : 2220 - 2231
[8] Learning Diverse Models for End-to-End Ensemble Tracking
Wang, Ning
Zhou, Wengang
Li, Houqiang
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 2220 - 2231
[9] TOWARDS END-TO-END SPEAKER DIARIZATION WITH GENERALIZED NEURAL SPEAKER CLUSTERING
Zhang, Chunlei
Shi, Jiatong
Weng, Chao
Yu, Meng
Yu, Dong
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 8372 - 8376
[10] ITERATIVE POLICY LEARNING IN END-TO-END TRAINABLE TASK-ORIENTED NEURAL DIALOG MODELS
Liu, Bing
Lane, Ian
2017 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2017, : 482 - 489

← 1 2 3 4 5 →