TY - JOUR
T1 - Network Embedding via Coupled Kernelized Multi-Dimensional Array Factorization
AU - Xu, Linchuan
AU - Cao, Jiannong
AU - Wei, Xiaokai
AU - Yu, Philip S.
N1 - Funding Information:
The work described in this paper was partially supported by the National Key R&D Program of China, Project No: 2018YFB1004801, Hong Kong RGC Collaborative Research Fund (CRF), Project No. C5026-18G, Hong Kong RGC Collaborative Research Fund (CRF), Project No. C6030-18G, and NSF through grants III-1526499, III-1763325, III-1909323, SaTC1930941, and CNS-1626432.
Publisher Copyright:
© 1989-2012 IEEE.
Copyright:
Copyright 2020 Elsevier B.V., All rights reserved.
PY - 2020/12/1
Y1 - 2020/12/1
N2 - Network embedding has been widely employed in networked data mining applications as it can learn low-dimensional and dense node representations from the high-dimensional and sparse network structure. While most existing network embedding methods only model the proximity between two nodes regardless of the order of the proximity, this paper proposes to explicitly model multi-node proximities which can be widely observed in practice, e.g., multiple researchers coauthor a paper, and multiple genes co-express a protein. Explicitly modeling multi-node proximities is important because some two-node interactions may not come into existence without a third node. By proving that LINE(1st), a recent network embedding method, is equivalent to kernelized matrix factorization, this paper proposes coupled kernelized multi-dimensional array factorization (Cetera) which jointly factorizes multiple multi-dimensional arrays by enforcing a consensus representation for each node. In this way, node representations can be more comprehensive and effective, which is demonstrated on three real-world networks through link prediction and multi-label classification.
AB - Network embedding has been widely employed in networked data mining applications as it can learn low-dimensional and dense node representations from the high-dimensional and sparse network structure. While most existing network embedding methods only model the proximity between two nodes regardless of the order of the proximity, this paper proposes to explicitly model multi-node proximities which can be widely observed in practice, e.g., multiple researchers coauthor a paper, and multiple genes co-express a protein. Explicitly modeling multi-node proximities is important because some two-node interactions may not come into existence without a third node. By proving that LINE(1st), a recent network embedding method, is equivalent to kernelized matrix factorization, this paper proposes coupled kernelized multi-dimensional array factorization (Cetera) which jointly factorizes multiple multi-dimensional arrays by enforcing a consensus representation for each node. In this way, node representations can be more comprehensive and effective, which is demonstrated on three real-world networks through link prediction and multi-label classification.
KW - kernelized array factorization
KW - link prediction
KW - multi-label classification
KW - Network embedding
UR - http://www.scopus.com/inward/record.url?scp=85096175343&partnerID=8YFLogxK
U2 - 10.1109/TKDE.2019.2931833
DO - 10.1109/TKDE.2019.2931833
M3 - Journal article
AN - SCOPUS:85096175343
SN - 1041-4347
VL - 32
SP - 2414
EP - 2425
JO - IEEE Transactions on Knowledge and Data Engineering
JF - IEEE Transactions on Knowledge and Data Engineering
IS - 12
M1 - 8781900
ER -