Dual deep reinforcement learning agents-based integrated order acceptance and scheduling of mass individualized prototyping

Jiewu Leng, Jiwei Guo, Hu Zhang, Kailin Xu, Yan Qiao, Pai Zheng, Weiming Shen

Research output: Journal article publicationJournal articleAcademic researchpeer-review

1 Citation (Scopus)

Abstract

Coordinating order acceptance decisions with production scheduling to maximize revenue is challenging for Mass Individualized Prototyping (MIP) service providers. This paper presents a dual deep reinforcement learning agents-based (DDRLA) integrated order acceptance and scheduling (IOAS) for improving revenue. Firstly, a deep reinforcement learning-based virtual production scheduling (VPS) agent together with 8 state features and 11 action rules is designed. The VPS agent quickly and virtually reschedules a dynamically-arriving accepted order to evaluate the overall impact of accepting this order, including consumed capacity and increased revenue. Then, a deep reinforcement learning-based order acceptance decision (OAD) agent is designed. Based on the information guidance resulting from an interaction with the VPS agent, the OAD agent selectively accepts orders to maximize long-term gains, as well as to improve system resilience in the presence of a high ratio of urgent orders. The experiment results show that the proposed DDRLA method has better performance, compared with other IOAS approaches.

Original languageEnglish
Article number139249
Number of pages14
JournalJournal of Cleaner Production
Volume427
DOIs
Publication statusPublished - 15 Nov 2023

Keywords

  • Deep reinforcement learning
  • Dual deep reinforcement learning agents
  • Mass individualized prototyping
  • Order acceptance decision
  • Virtual production scheduling

ASJC Scopus subject areas

  • Renewable Energy, Sustainability and the Environment
  • General Environmental Science
  • Strategy and Management
  • Industrial and Manufacturing Engineering

Fingerprint

Dive into the research topics of 'Dual deep reinforcement learning agents-based integrated order acceptance and scheduling of mass individualized prototyping'. Together they form a unique fingerprint.

Cite this