TY - GEN
T1 - Discovering K web user groups with specific aspect interests
AU - Si, Jianfeng
AU - Li, Qing
AU - Qian, Tieyun
AU - Deng, Xiaotie
PY - 2012/8/17
Y1 - 2012/8/17
N2 - Online review analysis becomes a hot research topic recently. Most of the existing works focus on the problems of review summarization, aspect identification or opinion mining from an item's point of view such as the quality and popularity of products. Considering the fact that authors of these review texts may pay different attentions to different domain-based product aspects with respect to their own interests, in this paper, we aim to learn K user groups with specific aspect interests indicated by their review writings. Such K user groups' identification can facilitate better understanding of customers' interests which are crucial for application like product improvement on customer-oriented design or diverse marketing strategies. Instead of using a traditional text clustering approach, we treat the clusterId as a hidden variable and use a permutation-based structural topic model called KMM. Through this model, we infer K groups' distribution by discovering not only the frequency of reviewers' product aspects, but also the occurrence priority of respective aspects. Our experiment on several real-world review datasets demonstrates a competitive solution.
AB - Online review analysis becomes a hot research topic recently. Most of the existing works focus on the problems of review summarization, aspect identification or opinion mining from an item's point of view such as the quality and popularity of products. Considering the fact that authors of these review texts may pay different attentions to different domain-based product aspects with respect to their own interests, in this paper, we aim to learn K user groups with specific aspect interests indicated by their review writings. Such K user groups' identification can facilitate better understanding of customers' interests which are crucial for application like product improvement on customer-oriented design or diverse marketing strategies. Instead of using a traditional text clustering approach, we treat the clusterId as a hidden variable and use a permutation-based structural topic model called KMM. Through this model, we infer K groups' distribution by discovering not only the frequency of reviewers' product aspects, but also the occurrence priority of respective aspects. Our experiment on several real-world review datasets demonstrates a competitive solution.
UR - https://www.scopus.com/pages/publications/84864950167
U2 - 10.1007/978-3-642-31537-4_25
DO - 10.1007/978-3-642-31537-4_25
M3 - Conference article published in proceeding or book
AN - SCOPUS:84864950167
SN - 9783642315367
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 321
EP - 335
BT - Machine Learning and Data Mining in Pattern Recognition - 8th International Conference, MLDM 2012, Proceedings
T2 - 8th International Conference on Machine Learning and Data Mining in Pattern Recognition, MLDM 2012
Y2 - 13 July 2012 through 20 July 2012
ER -