Discovering public sentiment in social media for predicting stock movement of publicly listed companies

Bing Li, Chun Chung Chan, Carol Ou, Sun Ruifeng

Research output: Journal article publicationJournal articleAcademic researchpeer-review

51 Citations (Scopus)


Studies have suggested that public emotions shown through Twitter could be well correlated with the Dow Jones Industrial Average. However, it remains unclear how public sentiment, as reflected on social media, can be used to predict stock price movement of a particular publicly-listed company. In this study, we attempt to fill this research void by proposing a technique, called SMeDA-SA, to mine Twitter data for sentiment analysis and then predict the stock movement of specific listed companies. For the purpose of experimentation, we collected 200 million tweets that mentioned one or more of 30 companies that were listed in NASDAQ or the New York Stock Exchange. SMeDA-SA performs its task by first extracting ambiguous textual messages from these tweets to create a list of words that reflects public sentiment. SMeDA-SA then made use of a data mining algorithm to expand the word list by adding emotional phrases so as to better classify sentiments in the tweets. With SMeDA-SA, we discover that the stock movement of many companies can be predicted rather accurately with an average accuracy over 70%. This paper describes how SMeDA-SA can be used to mine social media date for sentiments. It also presents the key implications of our study.
Original languageEnglish
Pages (from-to)81-92
Number of pages12
JournalInformation Systems
Publication statusPublished - 1 Sep 2017


  • Big data
  • Data mining
  • Parallel architecture
  • Sentiment analysis
  • SMeDA-SA
  • Social media analysis
  • Stock prediction
  • Twitter

ASJC Scopus subject areas

  • Software
  • Information Systems
  • Hardware and Architecture


Dive into the research topics of 'Discovering public sentiment in social media for predicting stock movement of publicly listed companies'. Together they form a unique fingerprint.

Cite this