TY - JOUR
T1 - On-Chain and Off-Chain Data Management for Blockchain-Internet of Things: A Multi-Agent Deep Reinforcement Learning Approach
AU - Tsang, Y. P.
AU - Lee, C. K.M.
AU - Zhang, Kening
AU - Wu, C. H.
AU - Ip, W. H.
N1 - Funding information:
The authors would like to thank the Department of Supply Chain and Information Management & Big Data Intelligence Centre, The Hang Seng University of Hong Kong, and the Department of Industrial and Systems Engineering & Research Institute for Advanced Manufacturing, The Hong Kong Polytechnic University for supporting this research study.
The work described in this paper was partly supported by a grant from Research Institute for Advanced Manufacturing, The Hong Kong Polytechnic University (Project code: CD4E) and a grant from the University Grants Committee of the HK SAR, China (RMGS Project Acc. No.: 700043).e
Publisher Copyright:
© 2024, The Author(s), under exclusive licence to Springer Nature B.V.
PY - 2024/3
Y1 - 2024/3
N2 - The emergence of blockchain technology has seen applications increasingly hybridise cloud storage and distributed ledger technology in the Internet of Things (IoT) and cyber-physical systems, complicating data management in decentralised applications (DApps). Because it is inefficient for blockchain technology to handle large amounts of data, effective on-chain and off-chain data management in peer-to-peer networks and cloud storage has drawn considerable attention. Space reservation is a cost-effective approach to managing cloud storage effectively, contrasting with the demand for additional space in real-time. Furthermore, off-chain data replication in the peer-to-peer network can eliminate single points of failure of DApps. However, recent research has rarely discussed optimising on-chain and off-chain data management in the blockchain-enabled IoT (BIoT) environment. In this study, the BIoT environment is modelled, with cloud storage and blockchain orchestrated over the peer-to-peer network. The asynchronous advantage actor-critic algorithm is applied to exploit intelligent agents with the optimal policy for data packing, space reservation, and data replication to achieve an intelligent data management strategy. The experimental analysis reveals that the proposed scheme demonstrates rapid convergence and superior performance in terms of average total reward compared with other typical schemes, resulting in enhanced scalability, security and reliability of blockchain-IoT networks, leading to an intelligent data management strategy.
AB - The emergence of blockchain technology has seen applications increasingly hybridise cloud storage and distributed ledger technology in the Internet of Things (IoT) and cyber-physical systems, complicating data management in decentralised applications (DApps). Because it is inefficient for blockchain technology to handle large amounts of data, effective on-chain and off-chain data management in peer-to-peer networks and cloud storage has drawn considerable attention. Space reservation is a cost-effective approach to managing cloud storage effectively, contrasting with the demand for additional space in real-time. Furthermore, off-chain data replication in the peer-to-peer network can eliminate single points of failure of DApps. However, recent research has rarely discussed optimising on-chain and off-chain data management in the blockchain-enabled IoT (BIoT) environment. In this study, the BIoT environment is modelled, with cloud storage and blockchain orchestrated over the peer-to-peer network. The asynchronous advantage actor-critic algorithm is applied to exploit intelligent agents with the optimal policy for data packing, space reservation, and data replication to achieve an intelligent data management strategy. The experimental analysis reveals that the proposed scheme demonstrates rapid convergence and superior performance in terms of average total reward compared with other typical schemes, resulting in enhanced scalability, security and reliability of blockchain-IoT networks, leading to an intelligent data management strategy.
KW - Asynchronous advantage actor-critic (A3C) algorithm
KW - Blockchain
KW - Data management
KW - Deep reinforcement learning
KW - Internet of Things
UR - http://www.scopus.com/inward/record.url?scp=85182679712&partnerID=8YFLogxK
U2 - 10.1007/s10723-023-09739-x
DO - 10.1007/s10723-023-09739-x
M3 - Journal article
AN - SCOPUS:85182679712
SN - 1570-7873
VL - 22
JO - Journal of Grid Computing
JF - Journal of Grid Computing
IS - 1
M1 - 16
ER -