Population-Based Hierarchical Non-Negative Matrix Factorization for Survey Data

被引:0
作者
Ding, Xiaofu [1 ]
Dong, Xinyu [1 ]
McGough, Olivia [2 ]
Shen, Chenxin [1 ]
Ulichney, Annie [3 ]
Xu, Ruiyao [1 ]
Swartworth, William [1 ]
Chi, Jocelyn T. [1 ]
Needell, Deanna [1 ]
机构
[1] Univ Calif Los Angeles, Dept Math, Los Angeles, CA 90024 USA
[2] Reed Coll, Dept Math, Portland, OR USA
[3] Yale Univ, Dept Appl Math, New Haven, CT USA
来源
2022 IEEE/ACM INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING, APPLICATIONS AND TECHNOLOGIES, BDCAT | 2022年
关键词
Non-negative matrix factorization; hierarchical clustering; survey data; latent classes; population structure; CLIMATE-CHANGE; ALGORITHMS; POLCA;
D O I
10.1109/BDCAT56447.2022.00035
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Motivated by the problem of identifying potential hierarchical population structure on modern survey data containing a wide range of complex data types, we introduce population-based hierarchical non-negative matrix factorization (PHNMF). PHNMF is a variant of hierarchical non-negative matrix factorization based on feature similarity. As such, it enables an automatic and interpretable approach for identifying and understanding hierarchical structure in a data matrix constructed from a wide range of data types. Our numerical experiments on synthetic and real survey data demonstrate that PHNMF can recover latent hierarchical population structure in complex data with high accuracy. Moreover, the recovered subpopulation structure is meaningful and can be useful for improving downstream inference.
引用
收藏
页码:184 / 193
页数:10
相关论文
共 50 条
  • [1] Non-negative Matrix Factorization: A Survey
    Gan, Jiangzhang
    Liu, Tong
    Li, Li
    Zhang, Jilian
    COMPUTER JOURNAL, 2021, 64 (07) : 1080 - 1092
  • [2] Non-negative Matrix Factorization for Binary Data
    Larsen, Jacob Sogaard
    Clemmensen, Line Katrine Harder
    2015 7TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT (IC3K), 2015, : 555 - 563
  • [3] Topic Splitting: A Hierarchical Topic Model Based on Non-Negative Matrix Factorization
    Liu, Rui
    Wang, Xingguang
    Wang, Deqing
    Zuo, Yuan
    Zhang, He
    Zheng, Xianzhu
    JOURNAL OF SYSTEMS SCIENCE AND SYSTEMS ENGINEERING, 2018, 27 (04) : 479 - 496
  • [4] Robust Hierarchical Learning for Non-Negative Matrix Factorization with Outliers
    Li, Yinan
    Sun, Meng
    Van Hamme, Hugo
    Zhang, Xiongwei
    Yang, Jibin
    IEEE ACCESS, 2019, 7 : 10546 - 10558
  • [5] A Survey on Surrogate Approaches to Non-negative Matrix Factorization
    Fernsel P.
    Maass P.
    Vietnam Journal of Mathematics, 2018, 46 (4) : 987 - 1021
  • [6] Topic Splitting: A Hierarchical Topic Model Based on Non-Negative Matrix Factorization
    Rui Liu
    Xingguang Wang
    Deqing Wang
    Yuan Zuo
    He Zhang
    Xianzhu Zheng
    Journal of Systems Science and Systems Engineering, 2018, 27 : 479 - 496
  • [7] A Survey of Polyphonic Sound Event Detection Based on Non-negative Matrix Factorization
    Manh-Quan Bui
    Viet-Hang Duong
    Mathulaprangsan, Seksan
    Bach-Tung Pham
    Lee, Wei-Jing
    Wang, Jia-Ching
    2016 INTERNATIONAL COMPUTER SYMPOSIUM (ICS), 2016, : 351 - 354
  • [8] Non-negative Matrix Factorization: A Short Survey on Methods and Applications
    Huang, Zhengyu
    Zhou, Aimin
    Zhang, Guixu
    COMPUTATIONAL INTELLIGENCE AND INTELLIGENT SYSTEMS, 2012, 316 : 331 - 340
  • [9] Lifelong Hierarchical Topic Modeling via Non-negative Matrix Factorization
    Lin, Zhicheng
    Yan, Jiaxing
    Lei, Zhiqi
    Rao, Yanghui
    WEB AND BIG DATA, PT IV, APWEB-WAIM 2023, 2024, 14334 : 155 - 170
  • [10] Probabilistic Sparse Non-negative Matrix Factorization
    Hinrich, Jesper Love
    Morup, Morten
    LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION (LVA/ICA 2018), 2018, 10891 : 488 - 498