Reinforcement learning based process optimization and strategy development in conventional tunneling

被引:22
作者
Erharter, Georg H. [1 ]
Hansen, Tom F. [2 ]
Liu, Zhongqiang [2 ]
Marcher, Thomas [1 ]
机构
[1] Graz Univ Technol, Inst Rock Mech & Tunnelling, Rechbauerstr 12, Graz, Austria
[2] Norwegian Geotech Inst, Oslo, Norway
关键词
Conventional tunneling; Reinforcement learning; Tunnel excavation strategy; Machine learning; Excavation sequences; LEVEL CONTROL; SOIL;
D O I
10.1016/j.autcon.2021.103701
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Reinforcement learning (RL) - a branch of machine learning - refers to the process of an agent learning to achieve a certain goal by interaction with its environment. The process of conventional tunneling shows many similarities, where a geotechnician (agent) tries to achieve a breakthrough (goal) by excavating the rockmass (environment) in an optimum way. In this paper we present a novel RL based framework for strategy development for conventional tunneling. We developed a virtual environment with the goal of a tunnel breakthrough and with a deep Q-network as the agent's architecture. It can choose from different excavation sequences to reach that goal and learns to do so in an economical and safe way by getting feedback from a specially designed reward system. Result analyses show that the optimal policies have great similarities to current practices of sequential tunneling and the framework has the potential to discover new tunneling strategies.
引用
收藏
页数:12
相关论文
共 52 条
  • [31] Radinger A., 2014, GEOMECHANIK TUNNELBA, V7, P540, DOI DOI 10.1002/GEOT.201400038
  • [32] Raschka S., 2019, SCIKIT LEARN TENSORF, V2
  • [33] Schwung D., 2019, INT J COMPUT, V18, P360
  • [34] Shahin M.A., 2009, Advances in Artificial Neural Systems, V2009, P1, DOI [DOI 10.1155/2009/308239, 10.1155/2009/308239]
  • [35] A reinforcement learning approach to parameter estimation in dynamic job shop scheduling
    Shahrabi, Jamal
    Adibi, Mohammad Amin
    Mahootchi, Masoud
    [J]. COMPUTERS & INDUSTRIAL ENGINEERING, 2017, 110 : 75 - 82
  • [36] Use of soft computing techniques for tunneling optimization of tunnel boring machines
    Shahrour, Isam
    Zhang, Wengang
    [J]. UNDERGROUND SPACE, 2021, 6 (03) : 233 - 239
  • [37] Sheil Brian B., 2020, Proceedings of the Institution of Civil Engineers - Smart Infrastructure and Construction, V173, P74, DOI 10.1680/jsmic.20.00011
  • [38] Application of soft computing techniques in tunnelling and underground excavations: state of the art and future prospects
    Shreyas, S. K.
    Dey, Arindam
    [J]. INNOVATIVE INFRASTRUCTURE SOLUTIONS, 2019, 4 (01)
  • [39] Mastering the game of Go with deep neural networks and tree search
    Silver, David
    Huang, Aja
    Maddison, Chris J.
    Guez, Arthur
    Sifre, Laurent
    van den Driessche, George
    Schrittwieser, Julian
    Antonoglou, Ioannis
    Panneershelvam, Veda
    Lanctot, Marc
    Dieleman, Sander
    Grewe, Dominik
    Nham, John
    Kalchbrenner, Nal
    Sutskever, Ilya
    Lillicrap, Timothy
    Leach, Madeleine
    Kavukcuoglu, Koray
    Graepel, Thore
    Hassabis, Demis
    [J]. NATURE, 2016, 529 (7587) : 484 - +
  • [40] Stipek W., 2012, 50 YEARS NATM EXPERI