MultiCAD: Contrastive Representation Learning for Multi-modal 3D Computer-Aided Design Models

被引:9
作者
Ma, Weijian [1 ]
Xu, Minyang [1 ]
Li, Xueyang [1 ]
Zhou, Xiangdong [1 ]
机构
[1] Fudan Univ, Sch Comp Sci, Shanghai, Peoples R China
来源
PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023 | 2023年
关键词
Multimodal Machine Learning; Representation Learning; Contrastive Learning; Computer Aided Design;
D O I
10.1145/3583780.3614982
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
CAD models are multimodal data where information and knowledge contained in construction sequences and shapes are complementary to each other and representation learning methods should consider both of them. Such traits have been neglected in previous methods learning unimodal representations. To leverage the information from both modalities, we develop a multimodal contrastive learning strategy where features from different modalities interact via contrastive learning paradigm, driven by a novel multimodal contrastive loss. Two pretext tasks on both geometry and sequence domains are designed along with a two-stage training strategy to make the representation focus on encoding geometric details and decoding representations into construction sequences, thus being more applicable to downstream tasks such as multimodal retrieval and CAD sequence reconstruction. Experimental results show that the performance of our multimodal representation learning scheme has surpassed the baselines and unimodal methods significantly.
引用
收藏
页码:1766 / 1776
页数:11
相关论文
共 58 条
[1]   CrossPoint: Self-Supervised Cross-Modal Contrastive Learning for 3D Point Cloud Understanding [J].
Afham, Mohamed ;
Dissanayake, Isuru ;
Dissanayake, Dinithi ;
Dharmasiri, Amaya ;
Thilakarathna, Kanchana ;
Rodrigo, Ranga .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, :9892-9902
[2]  
Bhardwaj Rishabh, 2022, P 29 INT C COMP LING, P4801
[3]   Parametric CAD modeling: An analysis of strategies for design reusability [J].
Camba, Jorge D. ;
Contero, Manuel ;
Company, Pedro .
COMPUTER-AIDED DESIGN, 2016, 74 :18-31
[4]  
Chang Angel X., 2015, arXiv
[5]   Wasserstein Contrastive Representation Distillation [J].
Chen, Liqun ;
Wang, Dong ;
Gan, Zhe ;
Liu, Jingjing ;
Henao, Ricardo ;
Carin, Lawrence .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :16291-16300
[6]   Unsupervised Learning of Intrinsic Structural Representation Points [J].
Chen, Nenglun ;
Liu, Lingjie ;
Cui, Zhiming ;
Chen, Runnan ;
Ceylan, Duygu ;
Tu, Changhe ;
Wang, Wenping .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :9118-9127
[7]   TWIST: Two-Way Inter-label Self-Training for Semi-supervised 3D Instance Segmentation [J].
Chu, Ruihang ;
Ye, Xiaoqing ;
Liu, Zhengzhe ;
Tan, Xiao ;
Qi, Xiaojuan ;
Fu, Chi-Wing ;
Jia, Jiaya .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :1090-1099
[8]  
Chuang C.-Y., 2020, P ADV NEUR INF PROC, V33, P8765, DOI 10.5555/3495724.3496459
[9]  
Community B.O., 2018, BLENDER 3D MODELLING
[10]   Parametric Contrastive Learning [J].
Cui, Jiequan ;
Zhong, Zhisheng ;
Liu, Shu ;
Yu, Bei ;
Jia, Jiaya .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :695-704