TY - JOUR
T1 - Dual deep reinforcement learning agents-based integrated order acceptance and scheduling of mass individualized prototyping
AU - Leng, Jiewu
AU - Guo, Jiwei
AU - Zhang, Hu
AU - Xu, Kailin
AU - Qiao, Yan
AU - Zheng, Pai
AU - Shen, Weiming
N1 - Funding Information:
This work was supported by the National Natural Science Foundation of China under Grant No. U20A6004 and 52075107 ; and the Natural Science Fund of Guangdong Province under Grant No. 2022B1515020006 and 2022A1515010991 .
Publisher Copyright:
© 2023 Elsevier Ltd
PY - 2023/11/15
Y1 - 2023/11/15
N2 - Coordinating order acceptance decisions with production scheduling to maximize revenue is challenging for Mass Individualized Prototyping (MIP) service providers. This paper presents a dual deep reinforcement learning agents-based (DDRLA) integrated order acceptance and scheduling (IOAS) for improving revenue. Firstly, a deep reinforcement learning-based virtual production scheduling (VPS) agent together with 8 state features and 11 action rules is designed. The VPS agent quickly and virtually reschedules a dynamically-arriving accepted order to evaluate the overall impact of accepting this order, including consumed capacity and increased revenue. Then, a deep reinforcement learning-based order acceptance decision (OAD) agent is designed. Based on the information guidance resulting from an interaction with the VPS agent, the OAD agent selectively accepts orders to maximize long-term gains, as well as to improve system resilience in the presence of a high ratio of urgent orders. The experiment results show that the proposed DDRLA method has better performance, compared with other IOAS approaches.
AB - Coordinating order acceptance decisions with production scheduling to maximize revenue is challenging for Mass Individualized Prototyping (MIP) service providers. This paper presents a dual deep reinforcement learning agents-based (DDRLA) integrated order acceptance and scheduling (IOAS) for improving revenue. Firstly, a deep reinforcement learning-based virtual production scheduling (VPS) agent together with 8 state features and 11 action rules is designed. The VPS agent quickly and virtually reschedules a dynamically-arriving accepted order to evaluate the overall impact of accepting this order, including consumed capacity and increased revenue. Then, a deep reinforcement learning-based order acceptance decision (OAD) agent is designed. Based on the information guidance resulting from an interaction with the VPS agent, the OAD agent selectively accepts orders to maximize long-term gains, as well as to improve system resilience in the presence of a high ratio of urgent orders. The experiment results show that the proposed DDRLA method has better performance, compared with other IOAS approaches.
KW - Deep reinforcement learning
KW - Dual deep reinforcement learning agents
KW - Mass individualized prototyping
KW - Order acceptance decision
KW - Virtual production scheduling
UR - http://www.scopus.com/inward/record.url?scp=85173585768&partnerID=8YFLogxK
U2 - 10.1016/j.jclepro.2023.139249
DO - 10.1016/j.jclepro.2023.139249
M3 - Journal article
AN - SCOPUS:85173585768
SN - 0959-6526
VL - 427
JO - Journal of Cleaner Production
JF - Journal of Cleaner Production
M1 - 139249
ER -