Incremental learning patch-based bag of facial words representation for face recognition in videos

被引:0
|
作者
Chao Wang
Yunhong Wang
Zhaoxiang Zhang
Yiding Wang
机构
[1] Beihang University,Laboratory of Intelligent Recognition and Image Processing, Beijing Key Laboratory of Digital Media, School of Computer Science and Engineering
[2] North China University of Technology,School of Information Engineering
来源
Multimedia Tools and Applications | 2014年 / 72卷
关键词
Video analysis; Face recognition; Biometrics; Incremental learning; Bag of words;
D O I
暂无
中图分类号
学科分类号
摘要
Video-based face recognition is a fundamental topic in image processing and video analysis, and presents various challenges and opportunities. In this paper, we introduce an incremental learning approach to video-based face recognition which efficiently exploits the spatiotemporal information in videos. Face image sequences are incrementally clustered based on their descriptors, and the representative face images of each cluster are picked out. The incremental algorithm of creating facial visual words is applied to construct a codebook using the descriptors of the representative face images. Continuously, with the quantization of the facial visual words, each descriptor extracted from patches is converted into codes, and codes from each region are pooled together into a histogram. The representation of the face image is generated by concatenating the histograms from all regions, which is employed to perform the categorization. In the online recognition, a similarity score matrix and a voting algorithm are employed to judge a face video’s identity. Recognition is performed online while face video sequence is continuous and the proposed method gives nearly realtime feedback. The proposed method achieves a 100 % verification rate on the Honda/UCSD database and 82 % on the YouTube datebase. Experimental results demonstrate the effectiveness and flexibility of the proposed method.
引用
收藏
页码:2439 / 2467
页数:28
相关论文
共 50 条
  • [1] Incremental learning patch-based bag of facial words representation for face recognition in videos
    Wang, Chao
    Wang, Yunhong
    Zhang, Zhaoxiang
    Wang, Yiding
    MULTIMEDIA TOOLS AND APPLICATIONS, 2014, 72 (03) : 2439 - 2467
  • [2] Patch-based locality-enhanced collaborative representation for face recognition
    Ding, Ru-Xi
    Huang, He
    Shang, Jin
    IET IMAGE PROCESSING, 2015, 9 (03) : 211 - 217
  • [3] Tracking and recognition face in videos with incremental local sparse representation model
    Wang, Chao
    Wang, Yunhong
    Zhang, Zhaoxiang
    OPTICAL ENGINEERING, 2013, 52 (10)
  • [4] PATCH-BASED FACE RECOGNITION FROM VIDEO
    Hu, Changbo
    Harguess, Josh
    Aggarwal, J. K.
    2009 16TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-6, 2009, : 3321 - 3324
  • [5] Patch-based Face Recognition under Plastic Surgery
    Khedgaonkar, Roshni
    Raghuwanshi, M. M.
    Singh, K. R.
    2018 FIRST INTERNATIONAL CONFERENCE ON SECURE CYBER COMPUTING AND COMMUNICATIONS (ICSCCC 2018), 2018, : 364 - 368
  • [6] Face recognition with Patch-based Local Walsh Transform
    Uzun-Per, Meryem
    Gokmen, Muhittin
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2018, 61 : 85 - 96
  • [7] PATCH-BASED ALIGNMENT-FREE GENERIC SPARSE REPRESENTATION FOR POSE ROBUST FACE RECOGNITION
    Gu, Jianquan
    Hu, Haifeng
    Li, Haoxi
    Hu, Weipeng
    2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 3006 - 3010
  • [8] Resource-Allocating Codebook for Patch-based Face Recognition
    Ramanan, Amirthalingam
    Niranjan, Mahesan
    2009 INTERNATIONAL CONFERENCE ON INDUSTRIAL AND INFORMATION SYSTEMS, 2009, : 268 - 271
  • [9] Face recognition by incremental learning
    Huang, WM
    Lee, BH
    Li, LY
    Leman, K
    2003 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOLS 1-5, CONFERENCE PROCEEDINGS, 2003, : 4718 - 4723
  • [10] Face Recognition Using the Improved Bag of Words Model
    Li, Xiao-cui
    Zhao, Chun-hui
    Cang, Yan
    2013 THIRD INTERNATIONAL CONFERENCE ON INSTRUMENTATION & MEASUREMENT, COMPUTER, COMMUNICATION AND CONTROL (IMCCC), 2013, : 772 - 775