Supervised cross-modal factor analysis for multiple modal data classification

被引：13

作者：

Wang, Jingbin ^{[1
,2
]}

Zhou, Yihua ^{[3
]}

Duan, Kanghong ^{[4
]}

Wang, Jim Jing-Yan ^{[5
]}

Bensmail, Halima ^{[6
]}

机构：

[1] Chinese Acad Sci, Natl Time Serv Ctr, Xian 710600, Peoples R China

[2] Chinese Acad Sci, Grad Univ, Beijing 100039, Peoples R China

[3] Lehigh Univ, Dept Mech Engn & Mech, Bethlehem, PA 18015 USA

[4] State Ocean Adm, North China Sea Marine Tech Support Ctr, Qingdao 266033, Peoples R China

[5] King Abdullah Univ Sci & Technol, Comp Elect & Math Sci & Engn Div, Thuwal 23955, Saudi Arabia

[6] Qatar Comp Res Inst, Doha 5825, Qatar

来源：

2015 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2015): BIG DATA ANALYTICS FOR HUMAN-CENTRIC SYSTEMS | 2015年

关键词：

Multiple modal learning; Cross-modal factor analysis; Supervised learning; SPARSE REPRESENTATION; TEXT CLASSIFICATION; SURFACE; ACTIVATION; NETWORK;

D O I：

10.1109/SMC.2015.329

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper we study the problem of learning from multiple modal data for purpose of document classification. In this problem, each document is composed two different modals of data, i.e., an image and a text. Cross-modal factor analysis (CFA) has been proposed to project the two different modals of data to a shared data space, so that the classification of a image or a text can be performed directly in this space. A disadvantage of CFA is that it has ignored the supervision information. In this paper, we improve CFA by incorporating the supervision information to represent and classify both image and text modals of documents. We project both image and text data to a shared data space by factor analysis, and then train a class label predictor in the shared space to use the class label information. The factor analysis parameter and the predictor parameter are learned jointly by solving one single objective function. With this objective function, we minimize the distance between the projections of image and text of the same document, and the classification error of the projection measured by hinge loss function. The objective function is optimized by an alternate optimization strategy in an iterative algorithm. Experiments in two different multiple modal document data sets show the advantage of the proposed algorithm over other CFA methods.

引用

页码：1882 / 1888

页数：7

共 50 条

[1] Joint learning of cross-modal classifier and factor analysis for multimedia data classification
Duan, Kanghong
Zhang, Hongxin
Wang, Jim Jing-Yan
NEURAL COMPUTING & APPLICATIONS, 2016, 27 (02) : 459 - 468
[2] Joint learning of cross-modal classifier and factor analysis for multimedia data classification
Kanghong Duan
Hongxin Zhang
Jim Jing-Yan Wang
Neural Computing and Applications, 2016, 27 : 459 - 468
[3] A semi-supervised cross-modal memory bank for cross-modal retrieval
Huang, Yingying
Hu, Bingliang
Zhang, Yipeng
Gao, Chi
Wang, Quan
NEUROCOMPUTING, 2024, 579
[4] Federated learning for supervised cross-modal retrieval
Li, Ang
Li, Yawen
Shao, Yingxia
WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2024, 27 (04):
[5] Supervised Contrastive Discrete Hashing for cross-modal retrieval
Li, Ze
Yao, Tao
Wang, Lili
Li, Ying
Wang, Gang
KNOWLEDGE-BASED SYSTEMS, 2024, 295
[6] Information Fusion via Deep Cross-Modal Factor Analysis
Gao, Lei
Guan, Ling
2019 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2019,
[7] Discriminative deep asymmetric supervised hashing for cross-modal retrieval
Qiang, Haopeng
Wan, Yuan
Liu, Ziyi
Xiang, Lun
Meng, Xiaojing
KNOWLEDGE-BASED SYSTEMS, 2020, 204
[8] Global and cross-modal feature aggregation for multi-omics data classification and on
Zheng, Xiao
Wang, Minhui
Huang, Kai
Zhu, En
INFORMATION FUSION, 2024, 102
[9] Self-Supervised Intra-Modal and Cross-Modal Contrastive Learning for Point Cloud Understanding
Wu, Yue
Liu, Jiaming
Gong, Maoguo
Gong, Peiran
Fan, Xiaolong
Qin, A. K.
Miao, Qiguang
Ma, Wenping
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 1626 - 1638
[10] Online supervised collective matrix factorization hashing for cross-modal retrieval
Shu, Zhenqiu
Li, Li
Yu, Jun
Zhang, Donglin
Yu, Zhengtao
Wu, Xiao-Jun
APPLIED INTELLIGENCE, 2023, 53 (11) : 14201 - 14218

← 1 2 3 4 5 →