TY - JOUR
T1 - Gaussian Process Regression for Transportation System Estimation and Prediction Problems
T2 - The Deformation and a Hat Kernel
AU - Liu, Zhiyuan
AU - Lyu, Cheng
AU - Huo, Jinbiao
AU - Wang, Shuaian
AU - Chen, Jun
N1 - Funding Information:
This work was supported in part by the Distinguished Young Scholar Project under Grant 71922007 and in part by the Key Project of the National Natural Science Foundation of China under Grant 52131203.
Publisher Copyright:
© 2000-2011 IEEE.
PY - 2022/11/1
Y1 - 2022/11/1
N2 - Gaussian process regression (GPR) is an emerging machine learning model with potential in a wide range of transportation system estimation and prediction problems, especially those where the uncertainty of estimation needs to be measured, for instance, traffic flow analysis, the transportation infrastructure performance estimation problems and transportation simulation-based optimization problems. The kernel function is the core component of GPR, and the radial basis function (RBF) kernel is the most commonly used one, suitable for tasks without special knowledge about the patterns of data, like trend and periodicity. However, an inappropriate hyperparameter of the kernel function may lead to over-fitting or under-fitting of GPR. During hyperparameter optimization, the usage of the RBF kernel often suffers from the issue of failing to find the optimal hyperparameter. This paper aims to address this problem by promoting the use of the hat kernel, which can reduce the risk of under-fitting. Moreover, we propose the notion of deformation, corresponding to severe over-fitting of a GPR. To further address this issue, we investigate the connection between deformation and the Bayesian generalization error of GPR. Two lower bounds for the hyperparameter of the hat kernel are also proposed to avoid deformation of GPR.
AB - Gaussian process regression (GPR) is an emerging machine learning model with potential in a wide range of transportation system estimation and prediction problems, especially those where the uncertainty of estimation needs to be measured, for instance, traffic flow analysis, the transportation infrastructure performance estimation problems and transportation simulation-based optimization problems. The kernel function is the core component of GPR, and the radial basis function (RBF) kernel is the most commonly used one, suitable for tasks without special knowledge about the patterns of data, like trend and periodicity. However, an inappropriate hyperparameter of the kernel function may lead to over-fitting or under-fitting of GPR. During hyperparameter optimization, the usage of the RBF kernel often suffers from the issue of failing to find the optimal hyperparameter. This paper aims to address this problem by promoting the use of the hat kernel, which can reduce the risk of under-fitting. Moreover, we propose the notion of deformation, corresponding to severe over-fitting of a GPR. To further address this issue, we investigate the connection between deformation and the Bayesian generalization error of GPR. Two lower bounds for the hyperparameter of the hat kernel are also proposed to avoid deformation of GPR.
KW - Gaussian process
KW - Hat kernel
KW - hyperparameter optimization
KW - kernel machine
KW - lower bound
UR - http://www.scopus.com/inward/record.url?scp=85131734109&partnerID=8YFLogxK
U2 - 10.1109/TITS.2022.3155527
DO - 10.1109/TITS.2022.3155527
M3 - Journal article
AN - SCOPUS:85131734109
SN - 1524-9050
VL - 23
SP - 22331
EP - 22342
JO - IEEE Transactions on Intelligent Transportation Systems
JF - IEEE Transactions on Intelligent Transportation Systems
IS - 11
ER -