TY - JOUR
T1 - Efficient On-Chip Training of Optical Neural Networks Using Genetic Algorithm
AU - Zhang, Hui
AU - Thompson, Jayne
AU - Gu, Mile
AU - Jiang, Xu Dong
AU - Cai, Hong
AU - Liu, Patricia Yang
AU - Shi, Yuzhi
AU - Zhang, Yi
AU - Karim, Muhammad Faeyz
AU - Lo, Guo Qiang
AU - Luo, Xianshu
AU - Dong, Bin
AU - Kwek, Leong Chuan
AU - Liu, Ai Qun
N1 - Funding Information:
This work was supported by the Singapore Ministry of Education (MOE) Tier 3 Grant (MOE2017-T3-1-001), the Singapore National Research Foundation (NRF) National Natural Science Foundation of China (NSFC) Joint Grant (NRF2017NRF-NSFC002-014), the Singapore National Research Foundation under the Competitive Research Program (NRF-CRP13-2014-01), and the NRF Fellowship Reference No. NRF-NRFF2016-02. (checked)
Publisher Copyright:
©
PY - 2021/6/16
Y1 - 2021/6/16
N2 - Recent advances in silicon photonic chips have made huge progress in optical computing owing to their flexibility in the reconfiguration of various tasks. Its deployment of neural networks serves as an alternative for mitigating the rapidly increased demand for computing resources in electronic platforms. However, it remains a formidable challenge to train the online programmable optical neural networks efficiently, being restricted by the difficulty in obtaining gradient information on a physical device when executing a gradient descent algorithm. Here, we experimentally demonstrate an efficient, physics-agnostic, and closed-loop protocol for training optical neural networks on chip. A gradient-free algorithm, that is, the genetic algorithm, is adopted. The protocol is on-chip implementable, physical agnostic (no need to rely on characterization and offline modeling), and gradient-free. The protocol works for various types of chip structures and is especially helpful to those that cannot be analytically decomposed and characterized. We confirm its viability using several practical tasks, including the crossbar switch and the Iris classification. Finally, by comparing our physics-agonistic and gradient-free method to the off-chip and gradient-based training methods, we demonstrate the robustness of our system to perturbations such as imperfect phase implementation and photodetection noise. Optical processors with gradient-free genetic algorithms have broad application potentials in pattern recognition, reinforcement learning, quantum computing, and realistic applications (such as facial recognition, natural language processing, and autonomous vehicles).
AB - Recent advances in silicon photonic chips have made huge progress in optical computing owing to their flexibility in the reconfiguration of various tasks. Its deployment of neural networks serves as an alternative for mitigating the rapidly increased demand for computing resources in electronic platforms. However, it remains a formidable challenge to train the online programmable optical neural networks efficiently, being restricted by the difficulty in obtaining gradient information on a physical device when executing a gradient descent algorithm. Here, we experimentally demonstrate an efficient, physics-agnostic, and closed-loop protocol for training optical neural networks on chip. A gradient-free algorithm, that is, the genetic algorithm, is adopted. The protocol is on-chip implementable, physical agnostic (no need to rely on characterization and offline modeling), and gradient-free. The protocol works for various types of chip structures and is especially helpful to those that cannot be analytically decomposed and characterized. We confirm its viability using several practical tasks, including the crossbar switch and the Iris classification. Finally, by comparing our physics-agonistic and gradient-free method to the off-chip and gradient-based training methods, we demonstrate the robustness of our system to perturbations such as imperfect phase implementation and photodetection noise. Optical processors with gradient-free genetic algorithms have broad application potentials in pattern recognition, reinforcement learning, quantum computing, and realistic applications (such as facial recognition, natural language processing, and autonomous vehicles).
KW - deep learning, optical computing
KW - genetic algorithm
KW - gradient-free
KW - on-chip training
KW - optical neural networks
UR - http://www.scopus.com/inward/record.url?scp=85105074519&partnerID=8YFLogxK
U2 - 10.1021/acsphotonics.1c00035
DO - 10.1021/acsphotonics.1c00035
M3 - Journal article
AN - SCOPUS:85105074519
SN - 2330-4022
VL - 8
SP - 1662
EP - 1672
JO - ACS Photonics
JF - ACS Photonics
IS - 6
ER -