Divergence-Based Vector Quantization

被引:53
作者
Villmann, Thomas [1 ]
Haase, Sven [1 ]
机构
[1] Univ Appl Sci Mittweida, Dept Math Nat & Comp Sci, D-09648 Mittweida, Germany
关键词
FUZZY CLASSIFICATION; INFORMATION; ROBUST;
D O I
10.1162/NECO_a_00110
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Supervised and unsupervised vector quantization methods for classification and clustering traditionally use dissimilarities, frequently taken as Euclidean distances. In this article, we investigate the applicability of divergences instead, focusing on online learning. We deduce the mathematical fundamentals for its utilization in gradient-based online vector quantization algorithms. It bears on the generalized derivatives of the divergences known as Frechet derivatives in functional analysis, which reduces in finite-dimensional problems to partial derivatives in a natural way. We demonstrate the application of this methodology for widely applied supervised and unsupervised online vector quantization schemes, including self-organizing maps, neural gas, and learning vector quantization. Additionally, principles for hyperparameter optimization and relevance learning for parameterized divergences in the case of supervised vector quantization are given to achieve improved classification accuracy.
引用
收藏
页码:1343 / 1392
页数:50
相关论文
共 71 条
  • [11] A TEMPERING APPROACH FOR ITAKURA-SAITO NON-NEGATIVE MATRIX FACTORIZATION. WITH APPLICATION TO MUSIC TRANSCRIPTION
    Bertin, Nancy
    Fevotte, Cedric
    Badeau, Roland
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 1545 - 1548
  • [12] GTM: The generative topographic mapping
    Bishop, CM
    Svensen, M
    Williams, CKI
    [J]. NEURAL COMPUTATION, 1998, 10 (01) : 215 - 234
  • [13] Bregman L. M., 1967, USSR Comput Math Math Phys, V7, P200, DOI [10.1016/0041-5553(67)90040-7, DOI 10.1016/0041-5553(67)90040-7]
  • [14] BUNTE K, 2010, P EUR S ART NEUR NET, P87
  • [15] Non-negative matrix factorization with α-divergence
    Cichocki, Andrzej
    Lee, Hyekyoung
    Kim, Yong-Deok
    Choi, Seungjin
    [J]. PATTERN RECOGNITION LETTERS, 2008, 29 (09) : 1433 - 1440
  • [16] Nonnegative matrix and tensor factorization
    Cichocki, Andrzej
    Zdunek, Rafal
    Amari, Shun-Ichi
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2008, 25 (01) : 142 - 145
  • [17] Families of Alpha- Beta- and Gamma- Divergences: Flexible and Robust Measures of Similarities
    Cichocki, Andrzej
    Amari, Shun-ichi
    [J]. ENTROPY, 2010, 12 (06) : 1532 - 1568
  • [18] Crammer K., 2003, Proceedings of the 17th Conference on Neural Information Processing Systems, V15, P462
  • [19] Csiszar I, 1967, Stud Sci Math Hung, V2, P229
  • [20] EGUCHI S, 2001, 802 TOKY I STAT MATH