An Optimal Edge-weighted Graph Semantic Correlation Framework for Multi-view Feature Representation Learning

被引:0
作者
Gao, Lei [1 ]
Guo, Zheng [1 ]
Guan, Ling [1 ]
机构
[1] Toronto Metropolitan Univ, 350 Victoria St, Toronto, ON M5B 2K3, Canada
关键词
Multi-view feature representation; graph model; semantic correlation; data visualization; face recognition; emotion recognition; text-image recognition; object recognition; CANONICAL CORRELATION-ANALYSIS; PRESERVING PROJECTIONS; MODEL; SPARSE; CLASSIFICATION; RETRIEVAL;
D O I
10.1145/3649466
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this article, we present an optimal edge-weighted graph semantic correlation (EWGSC) framework for multi-view feature representation learning. Different from most existing multi-view representation methods, local structural information and global correlation in multi-view feature spaces are exploited jointly in the EWGSC framework, leading to a new and high-qualitymulti-view feature representation. Specifically, a novel edge-weighted graph model is first conceptualized and developed to preserve local structural information in each of the multi-view feature spaces. Then, the explored structural information is integrated with a semantic correlation algorithm, labeled multiple canonical correlation analysis (LMCCA), to form a powerful platform for effectively exploiting local and global relations across multi-view feature spaces jointly. We then theoretically verified the relation between the upper limit on the number of projected dimensions and the optimal solution to the multi-view feature representation problem. To validate the effectiveness and generality of the proposed framework, we conducted experiments on five datasets of different scales, including visual-based (University of California Irvine (UCI) iris database, Olivetti Research Lab (ORL) face database, and Caltech 256 database), text-image-based (Wiki database), and video-based (Ryerson Multimedia Lab (RML) audio-visual emotion database) examples. The experimental results show the superiority of the proposed framework on multi-view feature representation over state-of-the-art algorithms.
引用
收藏
页码:1 / 23
页数:23
相关论文
共 89 条
[1]   A novel approach for multimodal facial expression recognition using deep learning techniques [J].
Begum, Nazmin ;
Mustafa, A. Syed .
MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (13) :18521-18529
[2]   LSTM model for visual speech recognition through facial expressions [J].
Bhaskar, Shabina ;
Thasleema, T. M. .
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (04) :5455-5472
[3]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[4]   Multi-View Nonparametric Discriminant Analysis for Image Retrieval and Recognition [J].
Cao, Guanqun ;
Iosifidis, Alexandros ;
Gabbouj, Moncef .
IEEE SIGNAL PROCESSING LETTERS, 2017, 24 (10) :1537-1541
[5]   Hierarchical Graph Neural Networks for Few-Shot Learning [J].
Chen, Cen ;
Li, Kenli ;
Wei, Wei ;
Zhou, Joey Tianyi ;
Zeng, Zeng .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (01) :240-252
[6]   PCANet: A Simple Deep Learning Baseline for Image Classification? [J].
Chan, Tsung-Han ;
Jia, Kui ;
Gao, Shenghua ;
Lu, Jiwen ;
Zeng, Zinan ;
Ma, Yi .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (12) :5017-5032
[7]   Graph Multiview Canonical Correlation Analysis [J].
Chen, Jia ;
Wang, Gang ;
Giannakis, Georgios B. .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2019, 67 (11) :2826-2838
[8]   Face recognition using Histograms of Oriented Gradients [J].
Deniz, O. ;
Bueno, G. ;
Salido, J. ;
De la Torre, F. .
PATTERN RECOGNITION LETTERS, 2011, 32 (12) :1598-1603
[9]   An Autuencoder-based Data Augmentation Strategy for Generalization Improvement of DCNNs [J].
Feng, Xiexing ;
Wu, Q. M. Jonathan ;
Yang, Yimin ;
Cao, Libo .
NEUROCOMPUTING, 2020, 402 :283-297
[10]  
Feng yang, 2021, Journal of Electronic Science and Technology, P1, DOI 10.1016/j.jnlest.2021.100096