TY - JOUR
T1 - Multimodal Spatio-Temporal Prediction with Stochastic Adversarial Networks
AU - Saxena, Divya
AU - Cao, Jiannong
N1 - Funding Information:
This work is supported by RGC Theme-based Research Scheme (Grant no: T41-603/20-R), RGC Collaborative Research Fund (Grant no: C5026-18G), and PolyU Internal Start-up Fund (Grant no: P0038876). Authors’ addresses: D. Saxena, Department of Computing, The Hong Kong Polytechnic University, Hung Hom, Kowloon, Hong Kong; email: [email protected]; J. CAO, Department of Computing and UBDA, The Hong Kong Polytechnic University, Hung Hom, Kowloon, Hong Kong; email: [email protected]. ACM acknowledges that this contribution was authored or co-authored by an employee, contractor or affiliate of a national government. As such, the Government retains a nonexclusive, royalty-free right to publish or reproduce this article, or to allow others to do so, for Government purposes only. © 2022 Association for Computing Machinery. 2157-6904/2022/01-ART18 $15.00 https://doi.org/10.1145/3458025
Publisher Copyright:
© 2022 Association for Computing Machinery.
PY - 2022/4
Y1 - 2022/4
N2 - Spatio-temporal (ST) data is a collection of multiple time series data with different spatial locations and is inherently stochastic and unpredictable. An accurate prediction over such data is an important building block for several urban applications, such as taxi demand prediction, traffic flow prediction, and so on. Existing deep learning based approaches assume that outcome is deterministic and there is only one plausible future; therefore, cannot capture the multimodal nature of future contents and dynamics. In addition, existing approaches learn spatial and temporal data separately as they assume weak correlation between them. To handle these issues, in this article, we propose a stochastic spatio-temporal generative model (named D-GAN) which adopts Generative Adversarial Networks (GANs)-based structure for more accurate ST prediction in multiple time steps. D-GAN consists of two components: (1) spatio-temporal correlation network which models spatio-temporal joint distribution of pixels and supports a stochastic sampling of latent variables for multiple plausible futures; (2) a stochastic adversarial network to jointly learn generation and variational inference of data through implicit distribution modeling. D-GAN also supports fusion of external factors through explicit objective to improve the model learning. Extensive experiments performed on two real-world datasets show that D-GAN achieves significant improvements and outperforms baseline models.
AB - Spatio-temporal (ST) data is a collection of multiple time series data with different spatial locations and is inherently stochastic and unpredictable. An accurate prediction over such data is an important building block for several urban applications, such as taxi demand prediction, traffic flow prediction, and so on. Existing deep learning based approaches assume that outcome is deterministic and there is only one plausible future; therefore, cannot capture the multimodal nature of future contents and dynamics. In addition, existing approaches learn spatial and temporal data separately as they assume weak correlation between them. To handle these issues, in this article, we propose a stochastic spatio-temporal generative model (named D-GAN) which adopts Generative Adversarial Networks (GANs)-based structure for more accurate ST prediction in multiple time steps. D-GAN consists of two components: (1) spatio-temporal correlation network which models spatio-temporal joint distribution of pixels and supports a stochastic sampling of latent variables for multiple plausible futures; (2) a stochastic adversarial network to jointly learn generation and variational inference of data through implicit distribution modeling. D-GAN also supports fusion of external factors through explicit objective to improve the model learning. Extensive experiments performed on two real-world datasets show that D-GAN achieves significant improvements and outperforms baseline models.
KW - deep learning
KW - Generative adversarial networks
KW - spatio-temporal prediction
UR - http://www.scopus.com/inward/record.url?scp=85129511291&partnerID=8YFLogxK
U2 - 10.1145/3458025
DO - 10.1145/3458025
M3 - Journal article
AN - SCOPUS:85129511291
SN - 2157-6904
VL - 13
SP - 1
EP - 22
JO - ACM Transactions on Intelligent Systems and Technology
JF - ACM Transactions on Intelligent Systems and Technology
IS - 2
M1 - 18
ER -