TY - JOUR
T1 - Communication-Efficient Edge AI: Algorithms and Systems
AU - Shi, Yuanming
AU - Yang, Kai
AU - Jiang, Tao
AU - Zhang, Jun
AU - Letaief, Khaled B.
N1 - Funding Information:
Manuscript received February 22, 2020; revised May 27, 2020; accepted July 1, 2020. Date of publication July 7, 2020; date of current version November 20, 2020. This work was supported by the National Nature Science Foundation of China under Grant 61601290. (Corresponding author: Jun Zhang.) Yuanming Shi and Tao Jiang are with the School of Information Science and Technology, ShanghaiTech University, Shanghai 201210, China (e-mail: [email protected]; [email protected]).
Publisher Copyright:
© 1998-2012 IEEE.
Copyright:
Copyright 2020 Elsevier B.V., All rights reserved.
PY - 2020/10/1
Y1 - 2020/10/1
N2 - Artificial intelligence (AI) has achieved remarkable breakthroughs in a wide range of fields, ranging from speech processing, image classification to drug discovery. This is driven by the explosive growth of data, advances in machine learning (especially deep learning), and the easy access to powerful computing resources. Particularly, the wide scale deployment of edge devices (e.g., IoT devices) generates an unprecedented scale of data, which provides the opportunity to derive accurate models and develop various intelligent applications at the network edge. However, such enormous data cannot all be sent to the cloud for processing, due to the varying channel quality, traffic congestion and/or privacy concerns, and the enormous energy consumption. By pushing inference and training processes of AI models to edge nodes, edge AI has emerged as a promising alternative. AI at the edge requires close cooperation among edge devices, such as smart phones and smart vehicles, and edge servers at the wireless access points and base stations, which however result in heavy communication overheads. In this paper, we present a comprehensive survey of the recent developments in various techniques for overcoming these communication challenges. Specifically, we first identify key communication challenges in edge AI systems. We then introduce communication-efficient techniques, from both algorithmic and system perspectives for training and inference tasks at the network edge. Potential future research directions are also highlighted.
AB - Artificial intelligence (AI) has achieved remarkable breakthroughs in a wide range of fields, ranging from speech processing, image classification to drug discovery. This is driven by the explosive growth of data, advances in machine learning (especially deep learning), and the easy access to powerful computing resources. Particularly, the wide scale deployment of edge devices (e.g., IoT devices) generates an unprecedented scale of data, which provides the opportunity to derive accurate models and develop various intelligent applications at the network edge. However, such enormous data cannot all be sent to the cloud for processing, due to the varying channel quality, traffic congestion and/or privacy concerns, and the enormous energy consumption. By pushing inference and training processes of AI models to edge nodes, edge AI has emerged as a promising alternative. AI at the edge requires close cooperation among edge devices, such as smart phones and smart vehicles, and edge servers at the wireless access points and base stations, which however result in heavy communication overheads. In this paper, we present a comprehensive survey of the recent developments in various techniques for overcoming these communication challenges. Specifically, we first identify key communication challenges in edge AI systems. We then introduce communication-efficient techniques, from both algorithmic and system perspectives for training and inference tasks at the network edge. Potential future research directions are also highlighted.
KW - Artificial intelligence
KW - communication efficiency
KW - edge AI
KW - edge intelligence
UR - http://www.scopus.com/inward/record.url?scp=85095368450&partnerID=8YFLogxK
U2 - 10.1109/COMST.2020.3007787
DO - 10.1109/COMST.2020.3007787
M3 - Journal article
AN - SCOPUS:85095368450
SN - 1553-877X
VL - 22
SP - 2167
EP - 2191
JO - IEEE Communications Surveys and Tutorials
JF - IEEE Communications Surveys and Tutorials
IS - 4
M1 - 9134426
ER -