TY - GEN
T1 - An optical computing chip executing complex-valued neural network and its on-chip training
AU - Zhang, Hui
AU - Liu, Ai Qun
N1 - Funding Information:
This work was supported by the Singapore Ministry of Education (MOE) Tier 3 grant (MOE2017-T3-1-001), the Singapore National Research Foundation (NRF) National Natural Science Foundation of China (NSFC) joint grant (NRF2017NRF-NSFC002-014).
Publisher Copyright:
© COPYRIGHT SPIE. Downloading of the abstract is permitted for personal use only.
PY - 2021/8
Y1 - 2021/8
N2 - The optical implementation of neural networks is proposed to have advantages over electronic implementations with lower power consumption and higher computation speed. However, most optical neural networks (ONNs) utilize conventional real-valued frameworks that are designed for digital computers, forfeiting many advantages of optical computing such as efficient complex-valued operations. Complex-valued neural networks are advantageous to their real-valued counterparts by offering rich representation space, fast convergence, and strong generalizations. We propose and demonstrate an ONN that implements truly complex-valued neural networks, achieving high accuracy and strong learning capability in many benchmark tasks.1 On the other hand, efficiently training ONNs remains a formidable challenge, due to the difficulty in obtaining gradient information from a physical device. We propose an efficient on-chip training protocol for ONNs and demonstrate it by several practical tasks.2 The protocol is gradient-free and physical agnostic, and is applicable for various types of chip structures, especially those that cannot be analytically decomposed and characterized. The protocol is robust to experimental perturbations like imperfect phase detection and photodetection noise. Our results present a promising avenue towards deep complex networks with smaller chip size, stronger performance, and flexible reconfiguration to realistic applications (e.g., facial recognition, natural language processing, and autonomous vehicles).
AB - The optical implementation of neural networks is proposed to have advantages over electronic implementations with lower power consumption and higher computation speed. However, most optical neural networks (ONNs) utilize conventional real-valued frameworks that are designed for digital computers, forfeiting many advantages of optical computing such as efficient complex-valued operations. Complex-valued neural networks are advantageous to their real-valued counterparts by offering rich representation space, fast convergence, and strong generalizations. We propose and demonstrate an ONN that implements truly complex-valued neural networks, achieving high accuracy and strong learning capability in many benchmark tasks.1 On the other hand, efficiently training ONNs remains a formidable challenge, due to the difficulty in obtaining gradient information from a physical device. We propose an efficient on-chip training protocol for ONNs and demonstrate it by several practical tasks.2 The protocol is gradient-free and physical agnostic, and is applicable for various types of chip structures, especially those that cannot be analytically decomposed and characterized. The protocol is robust to experimental perturbations like imperfect phase detection and photodetection noise. Our results present a promising avenue towards deep complex networks with smaller chip size, stronger performance, and flexible reconfiguration to realistic applications (e.g., facial recognition, natural language processing, and autonomous vehicles).
KW - complex-valued neural network
KW - deep learning
KW - genetic algorithm
KW - on-chip training
KW - optical computing.
KW - Optical neural networks
UR - http://www.scopus.com/inward/record.url?scp=85117733695&partnerID=8YFLogxK
U2 - 10.1117/12.2597553
DO - 10.1117/12.2597553
M3 - Conference article published in proceeding or book
AN - SCOPUS:85117733695
T3 - Proceedings of SPIE - The International Society for Optical Engineering
BT - ODS 2021
A2 - Katayama, Ryuichi
A2 - Takashima, Yuzuru
PB - SPIE
T2 - 2021 Industrial Optical Devices and Systems, ODS 2021
Y2 - 1 August 2021 through 5 August 2021
ER -