TY - GEN
T1 - AutoEmb: Automated Embedding Dimensionality Search in Streaming Recommendations
AU - Zhaok, Xiangyu
AU - Liu, Haochen
AU - Fan, Wenqi
AU - Liu, Hui
AU - Tang, Jiliang
AU - Wang, Chong
AU - Chen, Ming
AU - Zheng, Xudong
AU - Liu, Xiaobing
AU - Yang, Xiwang
N1 - Publisher Copyright:
© 2021 IEEE.
PY - 2021
Y1 - 2021
N2 - Deep learning-based recommender systems (DLRSs) often have embedding layers, which are utilized to lessen the dimension of categorical variables (e.g., user/item identifiers) and meaningfully transform them in the low-dimensional space. The majority of existing DLRSs empirically pre-define a fixed and unified dimension for all user/item embeddings. It is evident from recent researches that different embedding sizes are highly desired for different users/items according to their frequency. However, manually selecting embedding sizes in recommender systems can be very challenging due to a large number of users/items and the dynamic nature of their frequency. Thus, in this paper, we propose an AutoML based end-to-end framework (AutoEmb), enabling various embedding dimensions according to the frequency in an automated and dynamic manner. To be specific, we first enhance a typical DLRS to allow various embedding dimensions; then, we propose an end-to-end differentiable framework that can automatically select different embedding dimensions according to user/item frequency; finally, we propose an AutoML based optimization algorithm in a streaming recommendation setting. The experimental results based on widely used benchmark datasets demonstrate the effectiveness of the AutoEmb framework.
AB - Deep learning-based recommender systems (DLRSs) often have embedding layers, which are utilized to lessen the dimension of categorical variables (e.g., user/item identifiers) and meaningfully transform them in the low-dimensional space. The majority of existing DLRSs empirically pre-define a fixed and unified dimension for all user/item embeddings. It is evident from recent researches that different embedding sizes are highly desired for different users/items according to their frequency. However, manually selecting embedding sizes in recommender systems can be very challenging due to a large number of users/items and the dynamic nature of their frequency. Thus, in this paper, we propose an AutoML based end-to-end framework (AutoEmb), enabling various embedding dimensions according to the frequency in an automated and dynamic manner. To be specific, we first enhance a typical DLRS to allow various embedding dimensions; then, we propose an end-to-end differentiable framework that can automatically select different embedding dimensions according to user/item frequency; finally, we propose an AutoML based optimization algorithm in a streaming recommendation setting. The experimental results based on widely used benchmark datasets demonstrate the effectiveness of the AutoEmb framework.
UR - http://www.scopus.com/inward/record.url?scp=85125192643&partnerID=8YFLogxK
U2 - 10.1109/ICDM51629.2021.00101
DO - 10.1109/ICDM51629.2021.00101
M3 - Conference article published in proceeding or book
AN - SCOPUS:85125192643
T3 - Proceedings - IEEE International Conference on Data Mining, ICDM
SP - 896
EP - 905
BT - Proceedings - 21st IEEE International Conference on Data Mining, ICDM 2021
A2 - Bailey, James
A2 - Miettinen, Pauli
A2 - Koh, Yun Sing
A2 - Tao, Dacheng
A2 - Wu, Xindong
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 21st IEEE International Conference on Data Mining, ICDM 2021
Y2 - 7 December 2021 through 10 December 2021
ER -