Abstract
For sentiment classification, it is often recognized that embedding based on distributional hypothesis is weak in capturing sentiment contrast-contrasting words may have similar local context. Based on broader context, we propose to incorporate Theta Pure Dependence (TPD) into the Paragraph Vector method to reinforce topical and sentimental information. TPD has a theoretical guarantee that the word dependency is pure, i.e., the dependence pattern has the integral meaning whose underlying distribution can not be conditionally factorized. Our method outperforms the state-of-the-art performance on text classification tasks.
Original language | English |
---|---|
Title of host publication | Conference Proceedings - EMNLP 2015 |
Subtitle of host publication | Conference on Empirical Methods in Natural Language Processing |
Publisher | Association for Computational Linguistics (ACL) |
Pages | 2551-2556 |
Number of pages | 6 |
ISBN (Electronic) | 9781941643327 |
Publication status | Published - 1 Jan 2015 |
Event | Conference on Empirical Methods in Natural Language Processing, EMNLP 2015 - Lisbon, Portugal Duration: 17 Sept 2015 → 21 Sept 2015 |
Conference
Conference | Conference on Empirical Methods in Natural Language Processing, EMNLP 2015 |
---|---|
Country/Territory | Portugal |
City | Lisbon |
Period | 17/09/15 → 21/09/15 |
ASJC Scopus subject areas
- Computational Theory and Mathematics
- Computer Science Applications
- Information Systems