TY - GEN
T1 - Learning multimodal word representation via dynamic fusion methods
AU - Wang, Shaonan
AU - Zhang, Jiajun
AU - Zong, Chengqing
N1 - Publisher Copyright:
Copyright © 2018, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.
PY - 2018/4/26
Y1 - 2018/4/26
N2 - Multimodal models have been proven to outperform text-based models on learning semantic word representations. Almost all previous multimodal models typically treat the representations from different modalities equally. However, it is obvious that information from different modalities contributes differently to the meaning of words. This motivates us to build a multimodal model that can dynamically fuse the semantic representations from different modalities according to different types of words. To that end, we propose three novel dynamic fusion methods to assign importance weights to each modality, in which weights are learned under the weak supervision of word association pairs. The extensive experiments have demonstrated that the proposed methods outperform strong unimodal baselines and state-of-the-art multimodal models.
AB - Multimodal models have been proven to outperform text-based models on learning semantic word representations. Almost all previous multimodal models typically treat the representations from different modalities equally. However, it is obvious that information from different modalities contributes differently to the meaning of words. This motivates us to build a multimodal model that can dynamically fuse the semantic representations from different modalities according to different types of words. To that end, we propose three novel dynamic fusion methods to assign importance weights to each modality, in which weights are learned under the weak supervision of word association pairs. The extensive experiments have demonstrated that the proposed methods outperform strong unimodal baselines and state-of-the-art multimodal models.
UR - https://www.scopus.com/pages/publications/85060470057
U2 - 10.1609/aaai.v32i1.12031
DO - 10.1609/aaai.v32i1.12031
M3 - Conference article published in proceeding or book
AN - SCOPUS:85060470057
VL - 32
T3 - 32nd AAAI Conference on Artificial Intelligence, AAAI 2018
SP - 5973
EP - 5980
BT - Proceedings of the AAAI Conference on Artificial Intelligence
PB - AAAI press
T2 - 32nd AAAI Conference on Artificial Intelligence, AAAI 2018
Y2 - 2 February 2018 through 7 February 2018
ER -