MultiCAD: Contrastive Representation Learning for Multi-modal 3D Computer-Aided Design Models

被引：5

作者：

Ma, Weijian ^{[1
]}

Xu, Minyang ^{[1
]}

Li, Xueyang ^{[1
]}

Zhou, Xiangdong ^{[1
]}

机构：

[1] Fudan Univ, Sch Comp Sci, Shanghai, Peoples R China

来源：

PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023 | 2023年

关键词：

Multimodal Machine Learning; Representation Learning; Contrastive Learning; Computer Aided Design;

D O I：

10.1145/3583780.3614982

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

CAD models are multimodal data where information and knowledge contained in construction sequences and shapes are complementary to each other and representation learning methods should consider both of them. Such traits have been neglected in previous methods learning unimodal representations. To leverage the information from both modalities, we develop a multimodal contrastive learning strategy where features from different modalities interact via contrastive learning paradigm, driven by a novel multimodal contrastive loss. Two pretext tasks on both geometry and sequence domains are designed along with a two-stage training strategy to make the representation focus on encoding geometric details and decoding representations into construction sequences, thus being more applicable to downstream tasks such as multimodal retrieval and CAD sequence reconstruction. Experimental results show that the performance of our multimodal representation learning scheme has surpassed the baselines and unimodal methods significantly.

引用

页码：1766 / 1776

页数：11

共 50 条

[1] ContrastCAD: Contrastive Learning-Based Representation Learning for Computer-Aided Design Models
Jung, Minseop
Kim, Minseong
Kim, Jibum
IEEE ACCESS, 2024, 12 : 84830 - 84842
[2] Multi-Modal 3D Shape Clustering with Dual Contrastive Learning
Lin, Guoting
Zheng, Zexun
Chen, Lin
Qin, Tianyi
Song, Jiahui
APPLIED SCIENCES-BASEL, 2022, 12 (15):
[3] Contrastive Multi-Modal Knowledge Graph Representation Learning
Fang, Quan
Zhang, Xiaowei
Hu, Jun
Wu, Xian
Xu, Changsheng
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (09) : 8983 - 8996
[4] Graph Embedding Contrastive Multi-Modal Representation Learning for Clustering
Xia, Wei
Wang, Tianxiu
Gao, Quanxue
Yang, Ming
Gao, Xinbo
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 1170 - 1183
[5] CLMTR: a generic framework for contrastive multi-modal trajectory representation learning
Liang, Anqi
Yao, Bin
Xie, Jiong
Zheng, Wenli
Shen, Yanyan
Ge, Qiqi
GEOINFORMATICA, 2024, : 233 - 253
[6] mmMCL3DMOT: Multi-Modal Momentum Contrastive Learning for 3D Multi-Object Tracking
Hong, Ru
Yang, Jiming
Zhou, Weidian
Da, Feipeng
IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 1895 - 1899
[7] CF-CAD: A Contrastive Fusion Network For 3D Computer-Aided Design Generative Modeling
Li, Xueyang
Chen, Haotian
Lou, Yunzhong
Zhou, Xiangdong
DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2024, PT 3, 2025, 14852 : 435 - 450
[8] 3-D Models for Town Domain Computer-Aided Design
Mazur, Vitaliy
EXPERIENCE OF DESIGNING AND APPLICATION OF CAD SYSTEMS IN MICROELECTRONICS: PROCEEDINGS OF THE XTH INTERNATIONAL CONFERENCE CADSM 2009, 2009, : 451 - 452
[9] FMCS: Improving Code Search by Multi-Modal Representation Fusion and Momentum Contrastive Learning
Liu, Wenjie
Chen, Gong
Xie, Xiaoyuan
2024 IEEE 24TH INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY, QRS, 2024, : 632 - 638
[10] JM3D & JM3D-LLM: Elevating 3D Representation With Joint Multi-Modal Cues
Ji, Jiayi
Wang, Haowei
Wu, Changli
Ma, Yiwei
Sun, Xiaoshuai
Ji, Rongrong
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2025, 47 (04) : 2475 - 2492

← 1 2 3 4 5 →