MultiCAD: Contrastive Representation Learning for Multi-modal 3D Computer-Aided Design Models

被引:5
作者
Ma, Weijian [1 ]
Xu, Minyang [1 ]
Li, Xueyang [1 ]
Zhou, Xiangdong [1 ]
机构
[1] Fudan Univ, Sch Comp Sci, Shanghai, Peoples R China
来源
PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023 | 2023年
关键词
Multimodal Machine Learning; Representation Learning; Contrastive Learning; Computer Aided Design;
D O I
10.1145/3583780.3614982
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
CAD models are multimodal data where information and knowledge contained in construction sequences and shapes are complementary to each other and representation learning methods should consider both of them. Such traits have been neglected in previous methods learning unimodal representations. To leverage the information from both modalities, we develop a multimodal contrastive learning strategy where features from different modalities interact via contrastive learning paradigm, driven by a novel multimodal contrastive loss. Two pretext tasks on both geometry and sequence domains are designed along with a two-stage training strategy to make the representation focus on encoding geometric details and decoding representations into construction sequences, thus being more applicable to downstream tasks such as multimodal retrieval and CAD sequence reconstruction. Experimental results show that the performance of our multimodal representation learning scheme has surpassed the baselines and unimodal methods significantly.
引用
收藏
页码:1766 / 1776
页数:11
相关论文
共 50 条
  • [11] Multi-Modal Representation via Contrastive Learning with Attention Bottleneck Fusion and Attentive Statistics Features
    Guo, Qinglang
    Liao, Yong
    Li, Zhe
    Liang, Shenglin
    ENTROPY, 2023, 25 (10)
  • [12] Applying the 3D Morphological Approach Using the Computer-Aided Product Design
    Mohamed, Tarek Ismail
    2019 3RD INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (ICDSP 2019), 2019, : 151 - 156
  • [13] Multi-modal brain tumor segmentation via disentangled representation learning and region-aware contrastive learning
    Zhou, Tongxue
    PATTERN RECOGNITION, 2024, 149 (149)
  • [14] The multi-user computer-aided design collaborative learning framework
    Deng, Yuanzhe
    Mueller, Matthew
    Rogers, Chris
    Olechowski, Alison
    ADVANCED ENGINEERING INFORMATICS, 2022, 51
  • [15] Mutual Information Driven Equivariant Contrastive Learning for 3D Action Representation Learning
    Lin, Lilang
    Zhang, Jiahang
    Liu, Jiaying
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 1883 - 1897
  • [16] CureGraph: Contrastive multi-modal graph representation learning for urban living circle health profiling and prediction
    Li, Jinlin
    Zhou, Xiao
    ARTIFICIAL INTELLIGENCE, 2025, 340
  • [17] Skeleton-Contrastive 3D Action Representation Learning
    Thoker, Fida Mohammad
    Doughty, Hazel
    Snoek, Cees G. M.
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 1655 - 1663
  • [18] 3D Shape Contrastive Representation Learning With Adversarial Examples
    Wen, Congcong
    Li, Xiang
    Huang, Hao
    Liu, Yu-Shen
    Fang, Yi
    IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 679 - 692
  • [19] 3D Printing and Computer-Aided Design for Precision Osteotomy-Aided Modules in Bone Biomechanical Study
    Wang, Daofeng
    Han, Lin
    Xu, Gaoxiang
    Zhang, Wupeng
    Li, Hua
    Xu, Cheng
    Li, Huanyu
    Li, Jitian
    Zhang, Hao
    Li, Jiantao
    INTERNATIONAL JOURNAL OF BIOPRINTING, 2022, 8 (04) : 108 - 116
  • [20] Supervised Contrastive Learning for 3D Cross-Modal Retrieval
    Choo, Yeon-Seung
    Kim, Boeun
    Kim, Hyun-Sik
    Park, Yong-Suk
    APPLIED SCIENCES-BASEL, 2024, 14 (22):