Design Synthesis of Structural Systems as a Markov Decision Process Solved With Deep Reinforcement Learning

被引:9
作者
Ororbia, Maximilian E. [1 ]
Warn, Gordon P. [1 ]
机构
[1] Penn State Univ, Dept Civil Engn, University Pk, PA 16802 USA
关键词
design process; machine learning; DISCRETE BAR AREAS; TRUSS TOPOLOGY; GLOBAL OPTIMIZATION; ALGORITHM;
D O I
10.1115/1.4056693
中图分类号
TH [机械、仪表工业];
学科分类号
0802 ;
摘要
Recently, it was demonstrated that the design synthesis of truss structures can be modeled as a Markov decision process (MDP) and solved using a tabular reinforcement learning method. In this setting, each state corresponds to a specific design configuration represented as a finite graph. However, when the structural design domain is relatively large, and depending on the constraints, the dimensionality of the state space becomes quite large rendering tabular reinforcement learning algorithms inefficient. Hence, in this study, the design synthesis MDP framework is significantly extended to solve structural design problems with large state spaces, by integrating deep reinforcement learning (DRL) into the general MDP framework. This is beneficial because with DRL, a deep neural network can be used to approximate the state-action value function, such that the network has much fewer parameters than the cardinality of the state space. This parameterization relies upon a problem relevant set of features and reward function. Thus, for this extended DRL design synthesis (DRLDS) framework, a compact set of features and a reward function are devised that are suitable for structural design problems where structural configurations are represented as finite graphs. Through the application of seven different structural design synthesis examples, the DRLDS framework is demonstrated to be capable of adeptly learning optimal policies that synthesize high, if not the highest, performing design solutions more frequently. The DRLDS framework does this with fewer finite element model evaluations than other considered alternative methods, further demonstrating the effectiveness of the developed set of features and reward function.
引用
收藏
页数:11
相关论文
共 54 条
  • [1] Global optimization of truss topology with discrete bar areas - Part I: theory of relaxed problems
    Achtziger, Wolfgang
    Stolpe, Mathias
    [J]. COMPUTATIONAL OPTIMIZATION AND APPLICATIONS, 2008, 40 (02) : 247 - 280
  • [2] Global optimization of truss topology with discrete bar areas-Part II: Implementation and numerical results
    Achtziger, Wolfgang
    Stolpe, Mathias
    [J]. COMPUTATIONAL OPTIMIZATION AND APPLICATIONS, 2009, 44 (02) : 315 - 341
  • [3] Antonsson E.K., 2005, Formal Engineering Design Synthesis
  • [4] Bathe KJ, 1996, Finite element proceedures
  • [5] Burnap A, 2016, PROCEEDINGS OF THE ASME INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, 2016, VOL 2A
  • [6] A framework for computational design synthesis: Model and applications
    Cagan, J
    Campbell, MI
    Finger, S
    Tomiyama, T
    [J]. JOURNAL OF COMPUTING AND INFORMATION SCIENCE IN ENGINEERING, 2005, 5 (03) : 171 - 181
  • [7] Computational design synthesis
    Campbell, Matthew I.
    Shea, Kristina
    [J]. AI EDAM-ARTIFICIAL INTELLIGENCE FOR ENGINEERING DESIGN ANALYSIS AND MANUFACTURING, 2014, 28 (03): : 207 - 208
  • [8] PaDGAN: Learning to Generate High-Quality Novel Designs
    Chen, Wei
    Ahmed, Faez
    [J]. JOURNAL OF MECHANICAL DESIGN, 2021, 143 (03)
  • [9] Dering M. L., 2017, 2017 AAAI FALL S SER
  • [10] Dering ML, 2017, IEEE INT CONF BIG DA, P2595, DOI 10.1109/BigData.2017.8258219