Attention-Enhanced Multimodal Learning for Conceptual Design Evaluations

被引:11
作者
Song, Binyang [1 ]
Miller, Scarlett [2 ]
Ahmed, Faez [1 ]
机构
[1] MIT, Dept Mech Engn, Cambridge, MA 02139 USA
[2] Penn State Univ, Sch Engn Design & Innovat, State Coll, PA 16802 USA
关键词
conceptual design; creativity and concept generation; design evaluation; machine learning; multimodal learning; CREATIVITY; NOVELTY;
D O I
10.1115/1.4056669
中图分类号
TH [机械、仪表工业];
学科分类号
0802 ;
摘要
Conceptual design evaluation is an indispensable component of innovation in the early stage of engineering design. Properly assessing the effectiveness of conceptual design requires a rigorous evaluation of the outputs. Traditional methods to evaluate conceptual designs are slow, expensive, and difficult to scale because they rely on human expert input. An alternative approach is to use computational methods to evaluate design concepts. However, most existing methods have limited utility because they are constrained to unimodal design representations (e.g., texts or sketches). To overcome these limitations, we propose an attention-enhanced multimodal learning (AEMML)-based machine learning (ML) model to predict five design metrics: drawing quality, uniqueness, elegance, usefulness, and creativity. The proposed model utilizes knowledge from large external datasets through transfer learning (TL), simultaneously processes text and sketch data from early-phase concepts, and effectively fuses the multimodal information through a mutual cross-attention mechanism. To study the efficacy of multimodal learning (MML) and attention-based information fusion, we compare (1) a baseline MML model and the unimodal models and (2) the attention-enhanced models with baseline models in terms of their explanatory power for the variability of the design metrics. The results show that MML improves the model explanatory power by 0.05-0.12 and the mutual cross-attention mechanism further increases the explanatory power of the approach by 0.05-0.09, leading to the highest explanatory power of 0.44 for drawing quality, 0.60 for uniqueness, 0.45 for elegance, 0.43 for usefulness, and 0.32 for creativity. Our findings highlight the benefit of using multimodal representations for design metric assessment.
引用
收藏
页数:12
相关论文
共 79 条
  • [1] Ahmed F., 2022, ASME 2022 INT DESIGN
  • [2] Ahmed F, 2018, PROCEEDINGS OF THE ASME INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, 2018, VOL 7
  • [3] Interpreting Idea Maps: Pairwise Comparisons Reveal What Makes Ideas Novel
    Ahmed, Faez
    Ramachandran, Sharath Kumar
    Fuge, Mark
    Hunter, Samuel
    Miller, Scarlett
    [J]. JOURNAL OF MECHANICAL DESIGN, 2019, 141 (02)
  • [4] Capturing Winning Ideas in Online Design Communities
    Ahmed, Faez
    Fuge, Mark
    [J]. CSCW'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK AND SOCIAL COMPUTING, 2017, : 1675 - 1687
  • [6] Amabile TM., 1996, CREATIVITY CONTEXT U
  • [7] Anastasopoulos A, 2019, Arxiv, DOI arXiv:1903.02930
  • [8] Baer J., 2018, PALGRAVE HDB SOCIAL, P27
  • [9] Bahdanau D, 2016, Arxiv, DOI [arXiv:1409.0473, 10.48550/arXiv.1409.0473, DOI 10.48550/ARXIV.1409.0473]
  • [10] Brown D. C., 2014, 6 INT C DESIGN COMPU, P1