Online video scene clustering by competitive incremental NMF

被引:3
|
作者
Bucak, Serhat Selcuk [1 ]
Gunsel, Bilge [2 ]
机构
[1] Michigan State Univ, Dept Comp Sci & Engn, E Lansing, MI 48824 USA
[2] Istanbul Tech Univ, Dept Elect & Commun Engn, Multimedia Signal Proc & Pattern Recognit Lab, TR-34469 Maslak, Turkey
关键词
Online video segmentation; Unsupervised video clustering; Matrix factorization;
D O I
10.1007/s11760-011-0264-2
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Efficient clustering and categorizing of video are becoming more and more vital in various applications including video summarization, content-based representation and so on. The large volume of video data is the biggest challenge that this task presents, for most the clustering techniques suffer from high dimensional data in terms of both accuracy and efficiency. In addition to this, most video applications require online processing; therefore, clustering should also be done online for such tasks. This paper presents an online video scene clustering/segmentation method that is based on incremental nonnegative matrix factorization (INMF), which has been shown to be a powerful content representation tool for high dimensional data. The proposed algorithm (Comp-INMF) enables online representation of video content and increases efficiency significantly by integrating a competitive learning scheme into INMF. It brings a systematic solution to the issue of rank selection in nonnegative matrix factorization, which is equivalent to specifying the number of clusters. The clustering performance is evaluated by tests on TRECVID video sequences, and a performance comparison to baseline methods including Adaptive Resonance Theory (ART) is provided in order to demonstrate the efficiency and efficacy of the proposed video clustering scheme. Clustering performance reported in terms of recall, precision and F1 measures shows that the labeling accuracy of the algorithm is notable, especially at edit effect regions that constitute a challenging point in video analysis.
引用
收藏
页码:723 / 739
页数:17
相关论文
共 50 条
  • [21] Online Appearance Manifold Learning for Video Classification and Clustering
    Yang, Li
    Wang, Xiaokun
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2016, PT II, 2016, 9787 : 551 - 561
  • [22] Video summaries through mosaic-based shot and scene clustering
    Aner, A
    Kender, JR
    COMPUTER VISION - ECCV 2002, PT IV, 2002, 2353 : 388 - 402
  • [23] A Description Scheme for Video Overview Based on Scene Detection and Face Clustering
    Tang, Boyuan
    Chen, Weiting
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2021, 30 (01)
  • [24] Deep Convex NMF for Image Clustering
    Qian, Bin
    Shen, Xiaobo
    Tang, Zhenmin
    Zhang, Tao
    BIOMETRIC RECOGNITION, 2016, 9967 : 583 - 590
  • [25] Refinement of document clustering by using NMF
    Shinnou, Hiroyuki
    Sasaki, Minoru
    PACLIC 21 - The 21st Pacific Asia Conference on Language, Information and Computation, Proceedings, 2007, : 430 - 439
  • [26] Online Incremental Clustering with Distance Metric Learning for High Dimensional Datae
    Okada, Shogo
    Nishida, Toyoaki
    2011 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2011, : 2047 - 2054
  • [27] Incremental Scene Synthesis
    Planche, Benjamin
    Rong, Xuejian
    Wu, Ziyan
    Karanam, Srikrishna
    Kosch, Harald
    Tian, YingLi
    Ernst, Jan
    Hutter, Andreas
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [28] A Gaussian process-based Incremental Neural Network for Online Clustering
    Wang, Xiaoyu
    Imura, Jun-ichi
    4TH IEEE INTERNATIONAL CONFERENCE ON SMART CLOUD (SMARTCLOUD 2019) / 3RD INTERNATIONAL SYMPOSIUM ON REINFORCEMENT LEARNING (ISRL 2019), 2019, : 143 - 148
  • [29] Adaptive online solder joint inspection algorithm based on incremental clustering
    Xie, H.
    Zhang, X.
    ELECTRONICS LETTERS, 2011, 47 (15) : 850 - U1932
  • [30] Refinement of Document Clustering by Using NMF
    Shinnou, Hiroyuki
    Sasaki, Minoru
    PACLIC 21: THE 21ST PACIFIC ASIA CONFERENCE ON LANGUAGE, INFORMATION AND COMPUTATION, PROCEEDINGS, 2007, : 430 - 439