Dynamic matching with deep reinforcement learning for a two-sided Manufacturing-as-a-Service (MaaS) marketplace

被引：8

作者：

Pahwa, Deepak ^{[1
]}

Starly, Binil ^{[1
]}

机构：

[1] North Carolina State Univ, Edward P Fitts Dept Ind & Syst Engn, 111 Lampe Dr, Raleigh, NC 27695 USA

来源：

MANUFACTURING LETTERS | 2021年 / 29卷

基金：

美国国家科学基金会;

关键词：

Cloud manufacturing; Cyber-enabled manufacturing; Resource allocation; Two-sided matching; Dynamic and stochastic knapsack problem (DSKP); Cloud based design and manufacturing (CBDM); ALLOCATION; ALGORITHM; SELECTION;

D O I：

10.1016/j.mfglet.2021.05.005

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

Suppliers registered within a manufacturing-as-a-service (MaaS) marketplace require near real time decision making to accept or reject orders received on the platform. Myopic decision-making such as a first come, first serve method in this dynamic and stochastic environment can lead to suboptimal revenue generation. In this paper, this sequential decision making problem is formulated as a Markov Decision Process and solved using deep reinforcement learning (DRL). Empirical simulations demonstrate that DRL has considerably better performance compared to four baselines. This early work demonstrates a learning approach for near real-time decision making for suppliers participating in a MaaS marketplace. (C) 2021 Society of Manufacturing Engineers (SME). Published by Elsevier Ltd. All rights reserved.

引用

页码：11 / 14

页数：4

共 20 条

[1] DeepPool: Distributed Model-Free Algorithm for Ride-Sharing Using Deep Reinforcement Learning
Al-Abbasi, Abubakr O.
Ghosh, Arnob
Aggarwal, Vaneet
[J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2019, 20 (12) : 4714 - 4727
[2] Anshelevich E., 2013, 27 AAAI C ART INT
[3] Real-Time Bidding by Reinforcement Learning in Display Advertising
Cai, Han
Ren, Kan
Zhang, Weinan
Malialis, Kleanthis
Wang, Jun
Yu, Yong
Guo, Defeng
[J]. WSDM'17: PROCEEDINGS OF THE TENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2017, : 661 - 670
[4] Decision support in concurrent engineering -: The utility-based selection decision support problem
Fernández, MG
Seepersad, CC
Rosen, DW
Allen, JK
Mistree, F
[J]. CONCURRENT ENGINEERING-RESEARCH AND APPLICATIONS, 2005, 13 (01): : 13 - 27
[5] A multi-objective algorithm for task scheduling and resource allocation in cloud-based disassembly
Jiang, Hui
Yi, Jianjun
Chen, Shaoli
Zhu, Xiaomin
[J]. JOURNAL OF MANUFACTURING SYSTEMS, 2016, 41 : 239 - 255
[6] Jintao K.E., 2020, IEEE Transactions on Knowledge and Data Engineering
[7] King DB, 2015, ACS SYM SER, V1214, P1
[8] The dynamic and stochastic knapsack problem
Kleywegt, AJ
Papastavrou, JD
[J]. OPERATIONS RESEARCH, 1998, 46 (01) : 17 - 35
[9] Scheduling uniform manufacturing resources via the Internet: A review
Li, Kai
Xiao, Wei
Yang, Shan-lin
[J]. JOURNAL OF MANUFACTURING SYSTEMS, 2019, 50 : 247 - 262
[10] A Hierarchical Framework of Cloud Resource Allocation and Power Management Using Deep Reinforcement Learning
Liu, Ning
Li, Zhe
Xu, Jielong
Xu, Zhiyuan
Lin, Sheng
Qiu, Qinru
Tang, Jian
Wang, Yanzhi
[J]. 2017 IEEE 37TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2017), 2017, : 372 - 382

← 1 2 →