MultiCAD: Contrastive Representation Learning for Multi-modal 3D Computer-Aided Design Models

被引:9
作者
Ma, Weijian [1 ]
Xu, Minyang [1 ]
Li, Xueyang [1 ]
Zhou, Xiangdong [1 ]
机构
[1] Fudan Univ, Sch Comp Sci, Shanghai, Peoples R China
来源
PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023 | 2023年
关键词
Multimodal Machine Learning; Representation Learning; Contrastive Learning; Computer Aided Design;
D O I
10.1145/3583780.3614982
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
CAD models are multimodal data where information and knowledge contained in construction sequences and shapes are complementary to each other and representation learning methods should consider both of them. Such traits have been neglected in previous methods learning unimodal representations. To leverage the information from both modalities, we develop a multimodal contrastive learning strategy where features from different modalities interact via contrastive learning paradigm, driven by a novel multimodal contrastive loss. Two pretext tasks on both geometry and sequence domains are designed along with a two-stage training strategy to make the representation focus on encoding geometric details and decoding representations into construction sequences, thus being more applicable to downstream tasks such as multimodal retrieval and CAD sequence reconstruction. Experimental results show that the performance of our multimodal representation learning scheme has surpassed the baselines and unimodal methods significantly.
引用
收藏
页码:1766 / 1776
页数:11
相关论文
共 50 条
[11]   FMCS: Improving Code Search by Multi-Modal Representation Fusion and Momentum Contrastive Learning [J].
Liu, Wenjie ;
Chen, Gong ;
Xie, Xiaoyuan .
2024 IEEE 24TH INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY, QRS, 2024, :632-638
[12]   JM3D & JM3D-LLM: Elevating 3D Representation With Joint Multi-Modal Cues [J].
Ji, Jiayi ;
Wang, Haowei ;
Wu, Changli ;
Ma, Yiwei ;
Sun, Xiaoshuai ;
Ji, Rongrong .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2025, 47 (04) :2475-2492
[13]   Applying the 3D Morphological Approach Using the Computer-Aided Product Design [J].
Mohamed, Tarek Ismail .
2019 3RD INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (ICDSP 2019), 2019, :151-156
[14]   Multi-Modal Representation via Contrastive Learning with Attention Bottleneck Fusion and Attentive Statistics Features [J].
Guo, Qinglang ;
Liao, Yong ;
Li, Zhe ;
Liang, Shenglin .
ENTROPY, 2023, 25 (10)
[15]   Multi-modal brain tumor segmentation via disentangled representation learning and region-aware contrastive learning [J].
Zhou, Tongxue .
PATTERN RECOGNITION, 2024, 149
[16]   The multi-user computer-aided design collaborative learning framework [J].
Deng, Yuanzhe ;
Mueller, Matthew ;
Rogers, Chris ;
Olechowski, Alison .
ADVANCED ENGINEERING INFORMATICS, 2022, 51
[17]   Mutual Information Driven Equivariant Contrastive Learning for 3D Action Representation Learning [J].
Lin, Lilang ;
Zhang, Jiahang ;
Liu, Jiaying .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 :1883-1897
[18]   CureGraph: Contrastive multi-modal graph representation learning for urban living circle health profiling and prediction [J].
Li, Jinlin ;
Zhou, Xiao .
ARTIFICIAL INTELLIGENCE, 2025, 340
[19]   3D Printing and Computer-Aided Design for Precision Osteotomy-Aided Modules in Bone Biomechanical Study [J].
Wang, Daofeng ;
Han, Lin ;
Xu, Gaoxiang ;
Zhang, Wupeng ;
Li, Hua ;
Xu, Cheng ;
Li, Huanyu ;
Li, Jitian ;
Zhang, Hao ;
Li, Jiantao .
INTERNATIONAL JOURNAL OF BIOPRINTING, 2022, 8 (04) :108-116
[20]   Skeleton-Contrastive 3D Action Representation Learning [J].
Thoker, Fida Mohammad ;
Doughty, Hazel ;
Snoek, Cees G. M. .
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, :1655-1663