TY - JOUR
T1 - Automated training dataset collection system design for machine learning application in optical networks: an example of quality of transmission estimation
AU - Lu, Jianing
AU - Fan, Qirui
AU - Zhou, Gai
AU - Lu, Linyue
AU - Yu, Changyuan
AU - Lau, Alan Pak Tao
AU - Lu, Chao
N1 - Funding Information:
Funding. National Key Research and Development Program of China (2018YFB1801701); National Natural Science Foundation of China (U1701661); Hong Kong Ph.D. Fellowship Scheme.
Publisher Copyright:
© 2009-2012 OSA.
PY - 2021/11
Y1 - 2021/11
N2 - Applications of machine learning (ML) models in optical communications and networks have been extensively investigated. For an optical wavelength-division-multiplexing (WDM) system, the quality of transmission (QoT) estimation generally depends on many parameters including the number and arrangement of WDM channels; launch power of each channel; number and distribution of fiber spans; attenuation, dispersion, and nonlinearity parameters and length of each fiber span; noise figure; gain and gain tilt of erbium-doped fiber amplifiers; transceiver noise; digital signal processing (DSP) performance; and so on. In recent years, ML-based QoT estimation schemes have gained significant attention. However, nearly all relevant works are conducted through simulations because it is difficult to obtain sufficient and high-quality datasets for training ML models. In this paper, we demonstrate completely automated generation and collection of an ultra-large-scale experimental training dataset for ML-model-based QoT estimation by automation of transceivers and optical link parameters, as well as data transfer and DSP. Implementation details and key codes of automation are presented. Artificial neural network models with one and two hidden layers are trained by the collected dataset, and brief QoT estimation results are evaluated and discussed to verify the performance and stability of the established automated system.
AB - Applications of machine learning (ML) models in optical communications and networks have been extensively investigated. For an optical wavelength-division-multiplexing (WDM) system, the quality of transmission (QoT) estimation generally depends on many parameters including the number and arrangement of WDM channels; launch power of each channel; number and distribution of fiber spans; attenuation, dispersion, and nonlinearity parameters and length of each fiber span; noise figure; gain and gain tilt of erbium-doped fiber amplifiers; transceiver noise; digital signal processing (DSP) performance; and so on. In recent years, ML-based QoT estimation schemes have gained significant attention. However, nearly all relevant works are conducted through simulations because it is difficult to obtain sufficient and high-quality datasets for training ML models. In this paper, we demonstrate completely automated generation and collection of an ultra-large-scale experimental training dataset for ML-model-based QoT estimation by automation of transceivers and optical link parameters, as well as data transfer and DSP. Implementation details and key codes of automation are presented. Artificial neural network models with one and two hidden layers are trained by the collected dataset, and brief QoT estimation results are evaluated and discussed to verify the performance and stability of the established automated system.
UR - http://www.scopus.com/inward/record.url?scp=85114318638&partnerID=8YFLogxK
U2 - 10.1364/JOCN.431780
DO - 10.1364/JOCN.431780
M3 - Journal article
AN - SCOPUS:85114318638
SN - 1943-0620
VL - 13
SP - 289
EP - 300
JO - Journal of Optical Communications and Networking
JF - Journal of Optical Communications and Networking
IS - 11
M1 - 9526654
ER -