MultiCAD: Contrastive Representation Learning for Multi-modal 3D Computer-Aided Design Models

被引:5
|
作者
Ma, Weijian [1 ]
Xu, Minyang [1 ]
Li, Xueyang [1 ]
Zhou, Xiangdong [1 ]
机构
[1] Fudan Univ, Sch Comp Sci, Shanghai, Peoples R China
来源
PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023 | 2023年
关键词
Multimodal Machine Learning; Representation Learning; Contrastive Learning; Computer Aided Design;
D O I
10.1145/3583780.3614982
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
CAD models are multimodal data where information and knowledge contained in construction sequences and shapes are complementary to each other and representation learning methods should consider both of them. Such traits have been neglected in previous methods learning unimodal representations. To leverage the information from both modalities, we develop a multimodal contrastive learning strategy where features from different modalities interact via contrastive learning paradigm, driven by a novel multimodal contrastive loss. Two pretext tasks on both geometry and sequence domains are designed along with a two-stage training strategy to make the representation focus on encoding geometric details and decoding representations into construction sequences, thus being more applicable to downstream tasks such as multimodal retrieval and CAD sequence reconstruction. Experimental results show that the performance of our multimodal representation learning scheme has surpassed the baselines and unimodal methods significantly.
引用
收藏
页码:1766 / 1776
页数:11
相关论文
共 50 条
  • [1] ContrastCAD: Contrastive Learning-Based Representation Learning for Computer-Aided Design Models
    Jung, Minseop
    Kim, Minseong
    Kim, Jibum
    IEEE ACCESS, 2024, 12 : 84830 - 84842
  • [2] Multi-Modal 3D Shape Clustering with Dual Contrastive Learning
    Lin, Guoting
    Zheng, Zexun
    Chen, Lin
    Qin, Tianyi
    Song, Jiahui
    APPLIED SCIENCES-BASEL, 2022, 12 (15):
  • [3] Contrastive Multi-Modal Knowledge Graph Representation Learning
    Fang, Quan
    Zhang, Xiaowei
    Hu, Jun
    Wu, Xian
    Xu, Changsheng
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (09) : 8983 - 8996
  • [4] Graph Embedding Contrastive Multi-Modal Representation Learning for Clustering
    Xia, Wei
    Wang, Tianxiu
    Gao, Quanxue
    Yang, Ming
    Gao, Xinbo
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 1170 - 1183
  • [5] CLMTR: a generic framework for contrastive multi-modal trajectory representation learning
    Liang, Anqi
    Yao, Bin
    Xie, Jiong
    Zheng, Wenli
    Shen, Yanyan
    Ge, Qiqi
    GEOINFORMATICA, 2024, : 233 - 253
  • [6] mmMCL3DMOT: Multi-Modal Momentum Contrastive Learning for 3D Multi-Object Tracking
    Hong, Ru
    Yang, Jiming
    Zhou, Weidian
    Da, Feipeng
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 1895 - 1899
  • [7] CF-CAD: A Contrastive Fusion Network For 3D Computer-Aided Design Generative Modeling
    Li, Xueyang
    Chen, Haotian
    Lou, Yunzhong
    Zhou, Xiangdong
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2024, PT 3, 2025, 14852 : 435 - 450
  • [8] 3-D Models for Town Domain Computer-Aided Design
    Mazur, Vitaliy
    EXPERIENCE OF DESIGNING AND APPLICATION OF CAD SYSTEMS IN MICROELECTRONICS: PROCEEDINGS OF THE XTH INTERNATIONAL CONFERENCE CADSM 2009, 2009, : 451 - 452
  • [9] FMCS: Improving Code Search by Multi-Modal Representation Fusion and Momentum Contrastive Learning
    Liu, Wenjie
    Chen, Gong
    Xie, Xiaoyuan
    2024 IEEE 24TH INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY, QRS, 2024, : 632 - 638
  • [10] JM3D & JM3D-LLM: Elevating 3D Representation With Joint Multi-Modal Cues
    Ji, Jiayi
    Wang, Haowei
    Wu, Changli
    Ma, Yiwei
    Sun, Xiaoshuai
    Ji, Rongrong
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2025, 47 (04) : 2475 - 2492