Dynamic matching with deep reinforcement learning for a two-sided Manufacturing-as-a-Service (MaaS) marketplace

被引:8
作者
Pahwa, Deepak [1 ]
Starly, Binil [1 ]
机构
[1] North Carolina State Univ, Edward P Fitts Dept Ind & Syst Engn, 111 Lampe Dr, Raleigh, NC 27695 USA
基金
美国国家科学基金会;
关键词
Cloud manufacturing; Cyber-enabled manufacturing; Resource allocation; Two-sided matching; Dynamic and stochastic knapsack problem (DSKP); Cloud based design and manufacturing (CBDM); ALLOCATION; ALGORITHM; SELECTION;
D O I
10.1016/j.mfglet.2021.05.005
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Suppliers registered within a manufacturing-as-a-service (MaaS) marketplace require near real time decision making to accept or reject orders received on the platform. Myopic decision-making such as a first come, first serve method in this dynamic and stochastic environment can lead to suboptimal revenue generation. In this paper, this sequential decision making problem is formulated as a Markov Decision Process and solved using deep reinforcement learning (DRL). Empirical simulations demonstrate that DRL has considerably better performance compared to four baselines. This early work demonstrates a learning approach for near real-time decision making for suppliers participating in a MaaS marketplace. (C) 2021 Society of Manufacturing Engineers (SME). Published by Elsevier Ltd. All rights reserved.
引用
收藏
页码:11 / 14
页数:4
相关论文
共 20 条
  • [1] DeepPool: Distributed Model-Free Algorithm for Ride-Sharing Using Deep Reinforcement Learning
    Al-Abbasi, Abubakr O.
    Ghosh, Arnob
    Aggarwal, Vaneet
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2019, 20 (12) : 4714 - 4727
  • [2] Anshelevich E., 2013, 27 AAAI C ART INT
  • [3] Real-Time Bidding by Reinforcement Learning in Display Advertising
    Cai, Han
    Ren, Kan
    Zhang, Weinan
    Malialis, Kleanthis
    Wang, Jun
    Yu, Yong
    Guo, Defeng
    [J]. WSDM'17: PROCEEDINGS OF THE TENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2017, : 661 - 670
  • [4] Decision support in concurrent engineering -: The utility-based selection decision support problem
    Fernández, MG
    Seepersad, CC
    Rosen, DW
    Allen, JK
    Mistree, F
    [J]. CONCURRENT ENGINEERING-RESEARCH AND APPLICATIONS, 2005, 13 (01): : 13 - 27
  • [5] A multi-objective algorithm for task scheduling and resource allocation in cloud-based disassembly
    Jiang, Hui
    Yi, Jianjun
    Chen, Shaoli
    Zhu, Xiaomin
    [J]. JOURNAL OF MANUFACTURING SYSTEMS, 2016, 41 : 239 - 255
  • [6] Jintao K.E., 2020, IEEE Transactions on Knowledge and Data Engineering
  • [7] King DB, 2015, ACS SYM SER, V1214, P1
  • [8] The dynamic and stochastic knapsack problem
    Kleywegt, AJ
    Papastavrou, JD
    [J]. OPERATIONS RESEARCH, 1998, 46 (01) : 17 - 35
  • [9] Scheduling uniform manufacturing resources via the Internet: A review
    Li, Kai
    Xiao, Wei
    Yang, Shan-lin
    [J]. JOURNAL OF MANUFACTURING SYSTEMS, 2019, 50 : 247 - 262
  • [10] A Hierarchical Framework of Cloud Resource Allocation and Power Management Using Deep Reinforcement Learning
    Liu, Ning
    Li, Zhe
    Xu, Jielong
    Xu, Zhiyuan
    Lin, Sheng
    Qiu, Qinru
    Tang, Jian
    Wang, Yanzhi
    [J]. 2017 IEEE 37TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2017), 2017, : 372 - 382