Abstract
Speech acts provide good insights into the communicative behavior of tweeters on Twitter. This paper is mainly concerned with speech act recognition in Twitter as a multi-class classification problem, for which we propose a set of word-based and character-based features. Inexpensive, robust and efficient, our method achieves an average Fl score of nearly 0.7 with the existence of much noise in our annotated Twitter data. In view of the deficiency of training data for the task, we experimented extensively with different configurations of training and test data, leading to empirical findings that may provide valuable reference for building benchmark datasets for sustained research on speech act recognition in Twitter.
Original language | English |
---|---|
Title of host publication | Analyzing Microtext - Papers from the 2011 AAAI Workshop, Technical Report |
Pages | 86-91 |
Number of pages | 6 |
Volume | WS-11-05 |
Publication status | Published - 31 Oct 2011 |
Event | 2011 AAAI Workshop - San Francisco, CA, United States Duration: 8 Aug 2011 → 8 Aug 2011 |
Conference
Conference | 2011 AAAI Workshop |
---|---|
Country/Territory | United States |
City | San Francisco, CA |
Period | 8/08/11 → 8/08/11 |
ASJC Scopus subject areas
- General Engineering