An Optimal Edge-weighted Graph Semantic Correlation Framework for Multi-view Feature Representation Learning

被引：1

作者：

Gao, Lei ^{[1
]}

Guo, Zheng ^{[1
]}

Guan, Ling ^{[1
]}

机构：

[1] Toronto Metropolitan Univ, 350 Victoria St, Toronto, ON M5B 2K3, Canada

来源：

ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS | 2024年 / 20卷 / 07期

关键词：

Multi-view feature representation; graph model; semantic correlation; data visualization; face recognition; emotion recognition; text-image recognition; object recognition; CANONICAL CORRELATION-ANALYSIS; PRESERVING PROJECTIONS; MODEL; SPARSE; CLASSIFICATION; RETRIEVAL;

D O I：

10.1145/3649466

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this article, we present an optimal edge-weighted graph semantic correlation (EWGSC) framework for multi-view feature representation learning. Different from most existing multi-view representation methods, local structural information and global correlation in multi-view feature spaces are exploited jointly in the EWGSC framework, leading to a new and high-qualitymulti-view feature representation. Specifically, a novel edge-weighted graph model is first conceptualized and developed to preserve local structural information in each of the multi-view feature spaces. Then, the explored structural information is integrated with a semantic correlation algorithm, labeled multiple canonical correlation analysis (LMCCA), to form a powerful platform for effectively exploiting local and global relations across multi-view feature spaces jointly. We then theoretically verified the relation between the upper limit on the number of projected dimensions and the optimal solution to the multi-view feature representation problem. To validate the effectiveness and generality of the proposed framework, we conducted experiments on five datasets of different scales, including visual-based (University of California Irvine (UCI) iris database, Olivetti Research Lab (ORL) face database, and Caltech 256 database), text-image-based (Wiki database), and video-based (Ryerson Multimedia Lab (RML) audio-visual emotion database) examples. The experimental results show the superiority of the proposed framework on multi-view feature representation over state-of-the-art algorithms.

引用

页码：1 / 23

页数：23

共 89 条

[1] A novel approach for multimodal facial expression recognition using deep learning techniques [J].

Begum, Nazmin ;

Mustafa, A. Syed .

MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (13) :18521-18529

[2] LSTM model for visual speech recognition through facial expressions [J].

Bhaskar, Shabina ;

Thasleema, T. M. .

MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (04) :5455-5472

[3] Latent Dirichlet allocation [J].

Blei, DM ;

Ng, AY ;

Jordan, MI .

JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022

[4] Multi-View Nonparametric Discriminant Analysis for Image Retrieval and Recognition [J].

Cao, Guanqun ;

Iosifidis, Alexandros ;

Gabbouj, Moncef .

IEEE SIGNAL PROCESSING LETTERS, 2017, 24 (10) :1537-1541

[5] Hierarchical Graph Neural Networks for Few-Shot Learning [J].

Chen, Cen ;

Li, Kenli ;

Wei, Wei ;

Zhou, Joey Tianyi ;

Zeng, Zeng .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (01) :240-252

[6] PCANet: A Simple Deep Learning Baseline for Image Classification? [J].

Chan, Tsung-Han ;

Jia, Kui ;

Gao, Shenghua ;

Lu, Jiwen ;

Zeng, Zinan ;

Ma, Yi .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (12) :5017-5032

[7] Graph Multiview Canonical Correlation Analysis [J].

Chen, Jia ;

Wang, Gang ;

Giannakis, Georgios B. .

IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2019, 67 (11) :2826-2838

[8] Face recognition using Histograms of Oriented Gradients [J].

Deniz, O. ;

Bueno, G. ;

Salido, J. ;

De la Torre, F. .

PATTERN RECOGNITION LETTERS, 2011, 32 (12) :1598-1603

[9] An Autuencoder-based Data Augmentation Strategy for Generalization Improvement of DCNNs [J].

Feng, Xiexing ;

Wu, Q. M. Jonathan ;

Yang, Yimin ;

Cao, Libo .

NEUROCOMPUTING, 2020, 402 :283-297

[10]

Feng yang, 2021, Journal of Electronic Science and Technology, P1, DOI [10.1016/j.jnlest.2021.100096, 10.1016/j.jnlest.2021.100096]

← 1 2 3 4 5 6 7 8 9 →