TY - JOUR
T1 - On-Device Learning Systems for Edge Intelligence: A Software and Hardware Synergy Perspective
AU - Zhou, Qihua
AU - Qu, Zhihao
AU - Guo, Song
AU - Luo, Boyuan
AU - Guo, Jingcai
AU - Xu, Zhenda
AU - Akerkar, Rajendra
N1 - Funding Information:
Manuscript received September 7, 2020; revised November 21, 2020; accepted February 16, 2021. Date of publication March 2, 2021; date of current version July 23, 2021. This work was supported in part by the Hong Kong RGC Research Impact Fund (RIF) under Project R5060-19 and Project R5034-18; in part by the General Research Fund (GRF) under Project 152221/19E and Project 15220320/20E; in part by the Collaborative Research Fund (CRF) under Project C5026-18G; in part by the National Natural Science Foundation of China under Grant 61872310; in part by the Shenzhen Science and Technology Innovation Commission under Grant R2020A045; in part by the Shenzhen Basic Research Funding Scheme under Grant JCYJ20170818103849343; and in part by RCN-Diku INTPART BDEM under Grant 261685. (Corresponding authors: Song Guo; Zhihao Qu.) Qihua Zhou, Boyuan Luo, Jingcai Guo, and Zhenda Xu are with the Department of Computing, The Hong Kong Polytechnic University, Hong Kong (e-mail: csqzhou@comp.polyu.edu.hk; boyuan.luo@connect.polyu.hk; cscjguo@comp.polyu.edu.hk; jackal.xu@connect.polyu.hk).
Publisher Copyright:
© 2014 IEEE.
PY - 2021/8/1
Y1 - 2021/8/1
N2 - Modern machine learning (ML) applications are often deployed in the cloud environment to exploit the computational power of clusters. However, this in-cloud computing scheme cannot satisfy the demands of emerging edge intelligence scenarios, including providing personalized models, protecting user privacy, adapting to real-time tasks, and saving resource cost. In order to conquer the limitations of conventional in-cloud computing, there comes the rise of on-device learning, which makes the end-to-end ML procedure totally on user devices, without unnecessary involvement of the cloud. In spite of the promising advantages of on-device learning, implementing a high-performance on-device learning system still faces with many severe challenges, such as insufficient user training data, backward propagation (BP) blocking, and limited peak processing speed. Observing the substantial improvement space in the implementation and acceleration of on-device learning systems, we intend to present a comprehensive analysis of the latest research progress and point out potential optimization directions from the system perspective. This survey presents a software and hardware synergy of on-device learning techniques, covering the scope of model-level neural network design, algorithm-level training optimization, and hardware-level instruction acceleration. We hope this survey could bring fruitful discussions and inspire the researchers to further promote the field of edge intelligence.
AB - Modern machine learning (ML) applications are often deployed in the cloud environment to exploit the computational power of clusters. However, this in-cloud computing scheme cannot satisfy the demands of emerging edge intelligence scenarios, including providing personalized models, protecting user privacy, adapting to real-time tasks, and saving resource cost. In order to conquer the limitations of conventional in-cloud computing, there comes the rise of on-device learning, which makes the end-to-end ML procedure totally on user devices, without unnecessary involvement of the cloud. In spite of the promising advantages of on-device learning, implementing a high-performance on-device learning system still faces with many severe challenges, such as insufficient user training data, backward propagation (BP) blocking, and limited peak processing speed. Observing the substantial improvement space in the implementation and acceleration of on-device learning systems, we intend to present a comprehensive analysis of the latest research progress and point out potential optimization directions from the system perspective. This survey presents a software and hardware synergy of on-device learning techniques, covering the scope of model-level neural network design, algorithm-level training optimization, and hardware-level instruction acceleration. We hope this survey could bring fruitful discussions and inspire the researchers to further promote the field of edge intelligence.
KW - Hardware acceleration
KW - neural network design
KW - on-device learning
KW - system implementation
KW - training algorithm optimization
UR - http://www.scopus.com/inward/record.url?scp=85102295534&partnerID=8YFLogxK
U2 - 10.1109/JIOT.2021.3063147
DO - 10.1109/JIOT.2021.3063147
M3 - Review article
AN - SCOPUS:85102295534
SN - 2327-4662
VL - 8
SP - 11916
EP - 11934
JO - IEEE Internet of Things Journal
JF - IEEE Internet of Things Journal
IS - 15
M1 - 9366901
ER -