Semantic Reconstruction of Multimodal Process Data With Dual Latent Space Constraints

被引：0

作者：

Qiu, Kepeng ^{[1
]}

Yang, Jiayu ^{[2
]}

Rong, Baowei ^{[1
]}

Wang, Weiwei ^{[1
]}

Liu, Yu ^{[1
]}

机构：

[1] Beijing Inst Petrochem Technol, Sch Informat Engn, Beijing 102617, Peoples R China

[2] China Nucl Power Engn Co Ltd, Beijing 100048, Peoples R China

来源：

IEEE SENSORS JOURNAL | 2024年 / 24卷 / 20期

关键词：

Feature extraction; Semantics; Process monitoring; Data models; Sensors; Image reconstruction; Contrastive learning; Industrial process; interpretability; latent features; multimodal data; semantic reconstruction; FAULT-DETECTION; MODEL; NETWORK;

D O I：

10.1109/JSEN.2024.3451190

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Industrial processes often generate multimodal data with complex dynamics and distinct characteristics across multiple stages or conditions. Reconstructing the intrinsic semantic information from such data is essential for process monitoring and fault diagnosis. However, existing feature extraction methods often prioritize minimizing reconstruction error, which can overlook the importance of semantic interpretation and lead to limited accuracy and interpretability in the reconstructed results. To address this limitation, a novel semantic reconstruction framework for multimodal process data, driven by dual latent space constraints, is proposed. This approach utilizes a semantic consistency constraint and a multimodal characteristic constraint to extract latent space representations that effectively capture the intrinsic characteristics of the multimodal data. The core innovation of this framework lies in the integration of these dual constraints to obtain a comprehensive and interpretable representation of multimodal data. By jointly optimizing the dual latent space constraints and balancing reconstruction accuracy with interpretability, the proposed approach goes beyond simply minimizing the reconstruction error and focuses on learning expressive latent features that enable effective semantic interpretation. Experiments on three industrial benchmark datasets demonstrate the excellent performance of the proposed method, achieving an average accuracy of 96.73% and a maximum improvement of 13.34% compared to other methods.

引用

页码：32782 / 32791

页数：10

共 50 条

[1] Latent Space Model for Process Data
Chen, Yi
Zhang, Jingru
Yang, Yi
Lee, Young-Sun
JOURNAL OF EDUCATIONAL MEASUREMENT, 2022, 59 (04) : 517 - 535
[2] First Steps: Latent-Space Control with Semantic Constraints for Quadruped Locomotion
Mitchell, Alexander L.
Engelcke, Martin
Jones, Oiwi Parker
Surovik, David
Gangapurwala, Siddhant
Melon, Oliwier
Havoutis, Ioannis
Posner, Ingmar
2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 5343 - 5350
[3] A Hybrid Latent Space Data Fusion Method for Multimodal Emotion Recognition
Nemati, Shahla
Rohani, Reza
Basiri, Mohammad Ehsan
Abdar, Moloud
Yen, Neil Y.
Makarenkov, Vladimir
IEEE ACCESS, 2019, 7 : 172948 - 172964
[4] Latent space unsupervised semantic segmentation
Strommen, Knut J. J.
Torresen, Jim
Cote-Allard, Ulysse
FRONTIERS IN PHYSIOLOGY, 2023, 14
[5] SEMANTIC UNFOLDING OF STYLEGAN LATENT SPACE
Shukor, Mustafa
Yao, Xu
Damodaran, Bharath Bushan
Hellier, Pierre
2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 221 - 225
[6] Multimodal sensor fusion in the latent representation space
Piechocki, Robert J.
Wang, Xiaoyang
Bocus, Mohammud J.
SCIENTIFIC REPORTS, 2023, 13 (01)
[7] Multimodal sensor fusion in the latent representation space
Robert J. Piechocki
Xiaoyang Wang
Mohammud J. Bocus
Scientific Reports, 13
[8] Multimodal Recommendation Method Integrating Latent Structures and Semantic Information
Zhang X.
Liang Z.
Yao C.
Li Z.
Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2024, 37 (03): : 231 - 241
[9] Latent Semantic Analysis for Multimodal User Input With Speech and Gestures
Hui, Pui-Yu
Meng, Helen
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (02) : 417 - 429
[10] Latent Space Translation via Semantic Alignment
Maiorca, Valentino
Moschella, Luca
Norelli, Antonio
Fumero, Marco
Locatello, Francesco
Rodola, Emanuele
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,

← 1 2 3 4 5 →