TY - GEN
T1 - Learningword embeddings via context grouping
AU - Ma, Yun
AU - Li, Qing
AU - Yang, Zhenguo
AU - Liu, Wenyin
AU - Chan, Antoni B.
PY - 2017/5/12
Y1 - 2017/5/12
N2 - Recently, neural-network based word embedding models have been shown to produce high-quality distributional representations capturing both semantic and syntactic information. In this paper, we propose a grouping-based context predictive model by considering the interactions of contextwords, which generalizes the widely used CBOWmodel and Skip-Gram model. In particular, the words within a context window are split into several groups with a grouping function, where words in the same group are combined while different groups are treated as independent. To determine the grouping function, we propose a relatedness hypothesis stating the relationship among context words and propose several context grouping methods. Experimental results demonstrate better representations can be learned with suitable context groups.
AB - Recently, neural-network based word embedding models have been shown to produce high-quality distributional representations capturing both semantic and syntactic information. In this paper, we propose a grouping-based context predictive model by considering the interactions of contextwords, which generalizes the widely used CBOWmodel and Skip-Gram model. In particular, the words within a context window are split into several groups with a grouping function, where words in the same group are combined while different groups are treated as independent. To determine the grouping function, we propose a relatedness hypothesis stating the relationship among context words and propose several context grouping methods. Experimental results demonstrate better representations can be learned with suitable context groups.
KW - Context grouping
KW - Non-parametric clustering
KW - Word embeddings
UR - http://www.scopus.com/inward/record.url?scp=85021202764&partnerID=8YFLogxK
U2 - 10.1145/3063955.3063979
DO - 10.1145/3063955.3063979
M3 - Conference article published in proceeding or book
AN - SCOPUS:85021202764
T3 - ACM International Conference Proceeding Series
BT - Proceedings of the ACM Turing 50th Celebration Conference - China, ACM TUR-C 2017
PB - Association for Computing Machinery
T2 - 50th ACM Turing Conference - China, ACM TUR-C 2017
Y2 - 12 May 2017 through 14 May 2017
ER -