Audiovisual synchronization and fusion using canonical correlation analysis

被引:127
|
作者
Sargin, Mehmet Entre [1 ]
Yemez, Yuecel
Erzin, Engin
Tekalp, A. Murat
机构
[1] Koc Univ, Dept Comp Engn, Istanbul, Turkey
[2] Koc Univ, Dept Elect & Elect Engn, Istanbul, Turkey
关键词
audiovisual synchronization; correlation; multimodal; fusion; speaker recognition;
D O I
10.1109/TMM.2007.906583
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
It is well-known that early integration (also called data fusion) is effective when the modalities are correlated, and late integration (also called decision or opinion fusion) is optimal when modalities are uncorrelated. In this paper, we propose a new multimodal fusion strategy for open-set speaker identification using a combination of early and late integration following canonical correlation analysis (CCA) of speech and lip texture features. We also propose a method for high precision synchronization of the speech and lip features using CCA prior to the proposed fusion. Experimental results show that i) the proposed fusion strategy yields the best equal error rates (EER), which are used to quantify the performance of the fusion strategy for open-set speaker identification, and ii) precise synchronization prior to fusion improves the EER; hence, the best EER is obtained when the proposed synchronization scheme is employed together with the proposed fusion strategy. We note that the proposed fusion strategy outperforms others because the features used in the late integration are truly uncorrelated, since they are output of the CCA analysis.
引用
收藏
页码:1396 / 1403
页数:8
相关论文
共 50 条
  • [1] Canonical Correlation Analysis for Data Fusion in Multimodal Emotion Recognition
    Nemati, Shahla
    2018 9TH INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS (IST), 2018, : 676 - 681
  • [2] Feature Fusion for Multimodal Emotion Recognition Based on Deep Canonical Correlation Analysis
    Zhang, Ke
    Li, Yuanqing
    Wang, Jingyu
    Wang, Zhen
    Li, Xuelong
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 1898 - 1902
  • [3] Discriminative Multiple Canonical Correlation Analysis For Multi-Feature Information Fusion
    Gao, Lei
    Qi, Lin
    Chen, Enqing
    Guan, Ling
    2012 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2012, : 36 - 43
  • [4] Facial Expression Recognition Using Improved Canonical Correlation Analysis
    Gang, Lei
    Yong, Zhang
    ADVANCES IN CIVIL ENGINEERING, PTS 1-6, 2011, 255-260 : 2183 - 2187
  • [5] Face and Iris Wavelet Feature Fusion through Canonical Correlation Analysis for Person Identification
    Angadi, Shanmukhappa A.
    Kagawade, Vishwanath C.
    2018 3RD INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, COMMUNICATION, COMPUTER, AND OPTIMIZATION TECHNIQUES (ICEECCOT - 2018), 2018, : 172 - 178
  • [6] A Feature-Based Fusion Method for Making Group Inference in Epileptic fMRI and DTI using Canonical Correlation Analysis
    Riazi, Amir Hosein
    Soltanian-Zadeh, Hamid
    Hossein-Zadeh, Gholam-Ali
    2014 22nd Iranian Conference on Electrical Engineering (ICEE), 2014, : 1888 - 1891
  • [7] A Survey on Canonical Correlation Analysis
    Yang, Xinghao
    Liu, Weifeng
    Liu, Wei
    Tao, Dacheng
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2021, 33 (06) : 2349 - 2368
  • [8] Locality Discriminative Canonical Correlation Analysis For Kinship Verification
    Lei, Xiaohui
    Li, Bo
    Xie, Jing
    PROCEEDINGS OF THE 2017 12TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA), 2017, : 1870 - 1874
  • [9] Discriminative Learning for Alzheimer's Disease Diagnosis via Canonical Correlation Analysis and Multimodal Fusion
    Lei, Baiying
    Chen, Siping
    Ni, Dong
    Wang, Tianfu
    FRONTIERS IN AGING NEUROSCIENCE, 2016, 8
  • [10] Human Action Recognition Using Hybrid Centroid Canonical Correlation Analysis
    El Madany, Nour El Din
    He, Yifeng
    Guan, Ling
    2015 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2015, : 205 - 210