Semantic Reconstruction of Multimodal Process Data With Dual Latent Space Constraints

被引:0
|
作者
Qiu, Kepeng [1 ]
Yang, Jiayu [2 ]
Rong, Baowei [1 ]
Wang, Weiwei [1 ]
Liu, Yu [1 ]
机构
[1] Beijing Inst Petrochem Technol, Sch Informat Engn, Beijing 102617, Peoples R China
[2] China Nucl Power Engn Co Ltd, Beijing 100048, Peoples R China
关键词
Feature extraction; Semantics; Process monitoring; Data models; Sensors; Image reconstruction; Contrastive learning; Industrial process; interpretability; latent features; multimodal data; semantic reconstruction; FAULT-DETECTION; MODEL; NETWORK;
D O I
10.1109/JSEN.2024.3451190
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Industrial processes often generate multimodal data with complex dynamics and distinct characteristics across multiple stages or conditions. Reconstructing the intrinsic semantic information from such data is essential for process monitoring and fault diagnosis. However, existing feature extraction methods often prioritize minimizing reconstruction error, which can overlook the importance of semantic interpretation and lead to limited accuracy and interpretability in the reconstructed results. To address this limitation, a novel semantic reconstruction framework for multimodal process data, driven by dual latent space constraints, is proposed. This approach utilizes a semantic consistency constraint and a multimodal characteristic constraint to extract latent space representations that effectively capture the intrinsic characteristics of the multimodal data. The core innovation of this framework lies in the integration of these dual constraints to obtain a comprehensive and interpretable representation of multimodal data. By jointly optimizing the dual latent space constraints and balancing reconstruction accuracy with interpretability, the proposed approach goes beyond simply minimizing the reconstruction error and focuses on learning expressive latent features that enable effective semantic interpretation. Experiments on three industrial benchmark datasets demonstrate the excellent performance of the proposed method, achieving an average accuracy of 96.73% and a maximum improvement of 13.34% compared to other methods.
引用
收藏
页码:32782 / 32791
页数:10
相关论文
共 50 条
  • [1] Latent Space Model for Process Data
    Chen, Yi
    Zhang, Jingru
    Yang, Yi
    Lee, Young-Sun
    JOURNAL OF EDUCATIONAL MEASUREMENT, 2022, 59 (04) : 517 - 535
  • [2] First Steps: Latent-Space Control with Semantic Constraints for Quadruped Locomotion
    Mitchell, Alexander L.
    Engelcke, Martin
    Jones, Oiwi Parker
    Surovik, David
    Gangapurwala, Siddhant
    Melon, Oliwier
    Havoutis, Ioannis
    Posner, Ingmar
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 5343 - 5350
  • [3] A Hybrid Latent Space Data Fusion Method for Multimodal Emotion Recognition
    Nemati, Shahla
    Rohani, Reza
    Basiri, Mohammad Ehsan
    Abdar, Moloud
    Yen, Neil Y.
    Makarenkov, Vladimir
    IEEE ACCESS, 2019, 7 : 172948 - 172964
  • [4] Latent space unsupervised semantic segmentation
    Strommen, Knut J. J.
    Torresen, Jim
    Cote-Allard, Ulysse
    FRONTIERS IN PHYSIOLOGY, 2023, 14
  • [5] SEMANTIC UNFOLDING OF STYLEGAN LATENT SPACE
    Shukor, Mustafa
    Yao, Xu
    Damodaran, Bharath Bushan
    Hellier, Pierre
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 221 - 225
  • [6] Multimodal sensor fusion in the latent representation space
    Piechocki, Robert J.
    Wang, Xiaoyang
    Bocus, Mohammud J.
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [7] Multimodal sensor fusion in the latent representation space
    Robert J. Piechocki
    Xiaoyang Wang
    Mohammud J. Bocus
    Scientific Reports, 13
  • [8] Multimodal Recommendation Method Integrating Latent Structures and Semantic Information
    Zhang X.
    Liang Z.
    Yao C.
    Li Z.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2024, 37 (03): : 231 - 241
  • [9] Latent Semantic Analysis for Multimodal User Input With Speech and Gestures
    Hui, Pui-Yu
    Meng, Helen
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (02) : 417 - 429
  • [10] Latent Space Translation via Semantic Alignment
    Maiorca, Valentino
    Moschella, Luca
    Norelli, Antonio
    Fumero, Marco
    Locatello, Francesco
    Rodola, Emanuele
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,