Clustering by sparse orthogonal NMF and interpretable neural network

被引:3
作者
Gai, Yongwei [1 ]
Liu, Jinglei [1 ]
机构
[1] Yantai Univ, Sch Comp & Control Engn, Yantai 264005, Shandong, Peoples R China
基金
中国国家自然科学基金;
关键词
Differentiable programming; Interpretability; Neural clustering; Sparse orthogonal NMF; Transparent neural networks; NONNEGATIVE MATRIX FACTORIZATION;
D O I
10.1007/s00530-023-01187-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Employing differentiable reconstruction to interpret the clustering layers of neural networks presents a potent solution to the interpretability challenges encountered when applying neural networks within the multimedia sector. However, most of the existing approaches proposed for interpretability of machine learning make post hoc interpretations of neural networks. While existing approaches have made progress in post hoc interpretation, they still fall short in providing clear explanations for black-box models. There are two main challenges: the first is how we should design an interpretable neural network using sparse orthogonal non-negative matrix factorization (NMF) instead of performing post hoc interpretation. The second is how to implement sparse orthogonal NMF and perform clustering using the designed neural network. To solve these two problems, we address this issue by designing a New Interpretable Neural Network (NINN) that uses sparse orthogonal NMF as a basis for its architecture. We innovate in three aspects: first, NINN is a transparent neural network model with greater interpretability in the clustering layer. Second, NINN automatically performs gradient-based optimization of the parameters in the program and has the advantage of end-to-end learning. Third, since NINN is a differentiable reformulation of sparse orthogonal NMF, NINN inherits many properties of both NMF and neural networks, such as parallel solving, online clustering, dimensionality reduction, and unsupervised learning. Experimental evidence demonstrates that the NINN outperforms a myriad of conventional clustering algorithms, such as k-means and NMF, as well as some of the latest models, in terms of clustering performance.
引用
收藏
页码:3341 / 3356
页数:16
相关论文
共 46 条
  • [1] Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
    Barredo Arrieta, Alejandro
    Diaz-Rodriguez, Natalia
    Del Ser, Javier
    Bennetot, Adrien
    Tabik, Siham
    Barbado, Alberto
    Garcia, Salvador
    Gil-Lopez, Sergio
    Molina, Daniel
    Benjamins, Richard
    Chatila, Raja
    Herrera, Francisco
    [J]. INFORMATION FUSION, 2020, 58 : 82 - 115
  • [2] Network Dissection: Quantifying Interpretability of Deep Visual Representations
    Bau, David
    Zhou, Bolei
    Khosla, Aditya
    Oliva, Aude
    Torralba, Antonio
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 3319 - 3327
  • [3] Bera S. K., 2019, Advances in Neural Information Processing Systems, P4955
  • [4] Bilen H, 2015, PROC CVPR IEEE, P1081, DOI 10.1109/CVPR.2015.7298711
  • [5] Fuzzy clustering in community detection based on nonnegative matrix factorization with two novel evaluation criteria
    Binesh, Neda
    Rezghi, Mansoor
    [J]. APPLIED SOFT COMPUTING, 2018, 69 : 689 - 703
  • [6] Bradbury J., 2018, Version 0.2, V5, P14
  • [7] Large Scale Spectral Clustering Via Landmark-Based Sparse Representation
    Cai, Deng
    Chen, Xinlei
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2015, 45 (08) : 1669 - 1680
  • [8] Locally Consistent Concept Factorization for Document Clustering
    Cai, Deng
    He, Xiaofei
    Han, Jiawei
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2011, 23 (06) : 902 - 913
  • [9] A Hybrid Autoencoder Network for Unsupervised Image Clustering
    Chen, Pei-Yin
    Huang, Jih-Jeng
    [J]. ALGORITHMS, 2019, 12 (06)
  • [10] Ding C, 2005, SIAM PROC S, P606