traders jlbollen #twitter mood predicts the #DJIA FTW!

Similar documents
Twitter mood predicts the stock market

STUDYING MOOD VARIATIONS IN LONGITUDINAL TWITTER TIMELINES

How To Predict Stock Price With Mood Based Models

QUANTIFYING THE EFFECTS OF ONLINE BULLISHNESS ON INTERNATIONAL FINANCIAL MARKETS

Sentiment Analysis. D. Skrepetos 1. University of Waterloo. NLP Presenation, 06/17/2015

Twitter mood predicts the stock market.

Stock Prediction Using Twitter Sentiment Analysis

CSE 598 Project Report: Comparison of Sentiment Aggregation Techniques

Sentiment analysis on tweets in a financial domain

The Influence of Sentimental Analysis on Corporate Event Study

Sentiment Analysis of Twitter Feeds for the Prediction of Stock Market Movement

Can Twitter provide enough information for predicting the stock market?

CS 229, Autumn 2011 Modeling the Stock Market Using Twitter Sentiment Analysis

Using Tweets to Predict the Stock Market

Italian Journal of Accounting and Economia Aziendale. International Area. Year CXIV n. 1, 2 e 3

Tweets Miner for Stock Market Analysis

AT&T Global Network Client for Windows Product Support Matrix January 29, 2015

The Viability of StockTwits and Google Trends to Predict the Stock Market. By Chris Loughlin and Erik Harnisch

Using Twitter as a source of information for stock market prediction

A GENERAL TAXONOMY FOR VISUALIZATION OF PREDICTIVE SOCIAL MEDIA ANALYTICS

Analysis of Tweets for Prediction of Indian Stock Markets

COMPARISON OF FIXED & VARIABLE RATES (25 YEARS) CHARTERED BANK ADMINISTERED INTEREST RATES - PRIME BUSINESS*

COMPARISON OF FIXED & VARIABLE RATES (25 YEARS) CHARTERED BANK ADMINISTERED INTEREST RATES - PRIME BUSINESS*

The impact of social media is pervasive. It has

The process of gathering and analyzing Twitter data to predict stock returns EC115. Economics

Predicting Stock Market Fluctuations. from Twitter

Sentiment Analysis and Time Series with Twitter Introduction

Big Data and High Quality Sentiment Analysis for Stock Trading and Business Intelligence. Dr. Sulkhan Metreveli Leo Keller

Case 2:08-cv ABC-E Document 1-4 Filed 04/15/2008 Page 1 of 138. Exhibit 8

On the Predictability of Stock Market Behavior using StockTwits Sentiment and Posting Volume

Nowcasting the Bitcoin Market with Twitter Signals

Using Text and Data Mining Techniques to extract Stock Market Sentiment from Live News Streams

Qi Liu Rutgers Business School ISACA New York 2013

Enhanced Vessel Traffic Management System Booking Slots Available and Vessels Booked per Day From 12-JAN-2016 To 30-JUN-2017

Social Intelligence Report ADOBE DIGITAL INDEX Q4 2013

Resource Management Spreadsheet Capabilities. Stuart Dixon Resource Manager

Twitter Analytics for Insider Trading Fraud Detection

Predicting Stock Market Indicators Through Twitter I hope it is not as bad as I fear

Text and data analytics for social network mining

The Hollywood Stock Exchange: Efficiency and The Power of Twitter

Analysis One Code Desc. Transaction Amount. Fiscal Period

BT Retail Social Media making it easy for our customers

Combining Social Data and Semantic Content Analysis for L Aquila Social Urban Network

Measuring the Return on Marketing Investment

Big Data, Official Statistics and Social Science Research: Emerging Data Challenges

A Comparative Study on the Performance of ULIPs Offered by the Selected Insurance Companies-A Study in Indian Capital Markets

Using Data Mining for Mobile Communication Clustering and Characterization

Market will worry about demand later Weekly Corn Review for May 11, 2016 By Bryce Knorr

Sentiment Analysis on Twitter with Stock Price and Significant Keyword Correlation. Abstract

P/T 2B: 2 nd Half of Term (8 weeks) Start: 24-AUG-2015 End: 18-OCT-2015 Start: 19-OCT-2015 End: 13-DEC-2015

Multilanguage sentiment-analysis of Twitter data on the example of Swiss politicians

SEO Presentation. Asenyo Inc.

Forecasting stock markets with Twitter

5 Point Social Media Action Plan.

Prediction of Stock Market Shift using Sentiment Analysis of Twitter Feeds, Clustering and Ranking

ENERGY STAR for Data Centers

Market Assessment & Campaign SLA Calculator LOGO WE OPEN THE DOOR, SO YOU CAN CLOSE IT.

Big Data how it changes the way you treat data

Initial Report. Predicting association football match outcomes using social media and existing knowledge.

Sentiment Analysis on Big Data

the beginner s guide to SOCIAL MEDIA METRICS

International Conference PRSSA meeting

Social Media Analytics

Tutorial: Big Data Algorithms and Applications Under Hadoop KUNPENG ZHANG SIDDHARTHA BHATTACHARYYA

Award Winning Education

Overview. October Investment Portfolios & Products. Approved for public distribution. Investment Advisory Services

Ashley Institute of Training Schedule of VET Tuition Fees 2015

28 Creative Ways Teachers Are Using Twitter - Best Colleges O...

CUSTOMER FEEDBACK INDEX

Social Media as a Leading Indicator of Markets and Predictor of Voting Patterns

FOR RELEASE: MONDAY, SEPTEMBER 10 AT 4 PM

Text Analytics The three-minute guide

adjust reports The Undead App Store A 2014 retrospective report on App Store performance

Transcription:

Introduction Methods Results Conclusions traders jlbollen #twitter mood predicts the #DJIA FTW! jbollen@indiana.edu, huinmao@indiana.edu School of Informatics and Computing Center for Complex Networks and Systems Research Indiana University May 24, 2011

Introduction Methods Results Conclusions So what s the story? We upload paper to arxiv in October 2010 Johan Bollen, Huina Mao, and Xiao-Jun Zeng. Twitter mood predicts the stock market. Journal of Computational Science, 2(1), March 2011, Pages 1-8 3 days later:

Introduction Methods Results Conclusions So what s the story? We upload paper to arxiv in October 2010 Johan Bollen, Huina Mao, and Xiao-Jun Zeng. Twitter mood predicts the stock market. Journal of Computational Science, 2(1), March 2011, Pages 1-8 3 days later: msnbc...

Introduction Methods Results Conclusions So what s the story? We upload paper to arxiv in October 2010 Johan Bollen, Huina Mao, and Xiao-Jun Zeng. Twitter mood predicts the stock market. Journal of Computational Science, 2(1), March 2011, Pages 1-8 3 days later: msnbc... bloomberg...

Introduction Methods Results Conclusions So what s the story? We upload paper to arxiv in October 2010 Johan Bollen, Huina Mao, and Xiao-Jun Zeng. Twitter mood predicts the stock market. Journal of Computational Science, 2(1), March 2011, Pages 1-8 3 days later: msnbc... bloomberg... cnn...

Introduction Methods Results Conclusions So what s the story? We upload paper to arxiv in October 2010 Johan Bollen, Huina Mao, and Xiao-Jun Zeng. Twitter mood predicts the stock market. Journal of Computational Science, 2(1), March 2011, Pages 1-8 3 days later: msnbc... cnn... bloomberg... 2 months later

Introduction Methods Results Conclusions How did we get here from there? Computational social science at Indiana University: Do societies experience varying mood states like individuals? If so, can we assess such mood states from online materials and determine its socio-economic correlates?

Introduction Methods Results Conclusions How did we get here from there? Computational social science at Indiana University: Do societies experience varying mood states like individuals? If so, can we assess such mood states from online materials and determine its socio-economic correlates?

Introduction Methods Results Conclusions How did we get here from there? Computational social science at Indiana University: Do societies experience varying mood states like individuals? If so, can we assess such mood states from online materials and determine its socio-economic correlates?

Introduction Methods Results Conclusions 2009-2011: ongoing research on public sentiment in social media feeds Huina Mao, Alberto Pepe, and Johan Bollen. Structure and evolution of mood contagion in the Twitter social network. Proceedings of the International Sunbelt Social Network Conference XXX, Riva del Garda, Italy, July 2010. Johan Bollen, Huina Mao, and Alberto Pepe. Determining the public mood state by analysis of microblogging posts. Proceedings of the Proc. of the Alife XII Conference, Odense, Denmark, MIT Press, August 2010. (Reviewer-selected for Plenary Presentation) Johan Bollen, Alberto Pepe, and Huina Mao. Modeling public mood and emotion: Twitter sentiment and socio-economic phenomena. ICWSM11, Barcelona, Spain, July 2011 (arxiv: 0911.1583) - poster.

Introduction Methods Results Conclusions Public mood analysis: lots of squiggly lines... Artifact or reality?

Introduction Methods Results Conclusions Public mood analysis: lots of squiggly lines... Artifact or reality? +2sd elated/depressed 2sd +2sd agreeable/hostile 2sd

Introduction Methods Results Conclusions Cross-validating to broad socio-economic indices... DJIA z-score 2 1 0-1 -2 bank bail-out 2 1 0-1 -2 Calm z-score 2 DJIA z-score 1 0-1 -2 2 Calm z-score 1 0-1 -2 Aug 09 Aug 29 Sep 18 Oct 08 Oct 28 Prediction accuracy: 86.7%?!

Introduction Methods Results Conclusions Paul Hawtin of Derwent Capital Markets: For years, investors have widely accepted that financial markets are driven by fear and greed but we ve never before had the technology or data to be able to quantify human emotion. This is the 4th dimension.

Introduction Methods Results Conclusions Outline 1 Introduction Microblogging: canary in a coal mine Sentiment analysis 2 Methods Data Sentiment tracking instrument 3 Results Case-studies DJIA 4 Conclusions Discussion Literature

Introduction Methods Results Conclusions Microblogging: canary in a coal mine Sentiment analysis Outline 1 Introduction Microblogging: canary in a coal mine Sentiment analysis 2 Methods Data Sentiment tracking instrument 3 Results Case-studies DJIA 4 Conclusions Discussion Literature

Introduction Methods Results Conclusions Microblogging: canary in a coal mine Sentiment analysis Microblogging: casu Twitter! tweets and updates users broadcast brief text updates to the public or to a limited group of contacts: 140 characters or less Twitter, Facebook, Myspace Examples Our Rights from Creator (h/t @JLocke). Life, Liberty, PoH FTW! Your transgressions = FAIL. GTFO, @GeorgeIII. -HANCOCK et al. at work feeling lousy

Introduction Methods Results Conclusions Microblogging: canary in a coal mine Sentiment analysis Analyzing the chatter Twitter: +100M tweets per day, +150M users > many industrialized nations Predicting the present Mapping online traffic provides real-time information which translates to real-world outcomes Box office receipts from Twitter chatter: Asur (2010) Google trends: flu (verbal autopsies) Predicting consumer behavior from search query volume (Goel, 2010) Contagion of Loneliness and happiness in social networks (Cacioppo, 2010 - Bollen, 2011)

Introduction Methods Results Conclusions Microblogging: canary in a coal mine Sentiment analysis Extracting sentiment indicators from text Happy tweets. So...nothing quite feels like a good shower, shave and haircut...love it My beautiful friend. i love you sweet smile and your amazing soul i am very happy. People in Chicago loved my conference. Love you, my sweet friends @anonymous thanks for your follow I am following you back, great group amazing people Unhappy tweets. She doesn t deserve the tears but i cry them anyway I m sick and my body decides to attack my face and make me break out!! WTF :( I think my headphones are electrocuting me. My mom almost killed me this morning. I don t know how much longer i can be here. Different Approaches: Natural Language processing (n-grams) for reviews (Nasukawa, 2003), topics (Yi, 2003), Support Vector Machines: text classification (positive vs. negative) using pre-classified learning sets: Gamon (2004), Pang (2008), Blogs, web sites: mixed approaches. Mishne (2006), Balog (2006), Gruhl (2005),...

Introduction Methods Results Conclusions Microblogging: canary in a coal mine Sentiment analysis Sentiment and mood analysis is difficult for tweets Individual tweets Length: 140 characters, lack of text content Diversity: no standardized training sets, dimensions of mood? Lack of topic specificity Public mood from tweet collections and other microblog contents? We Feel Fine http://www.wefeelfine.org/ Moodviews http://moodviews.com Myspace: Thelwall (2009), FB: United States Gross National Happiness http://apps.facebook.com/usa_gnh/, Michael Jackson (Kim, 2009)

Introduction Methods Results Conclusions Microblogging: canary in a coal mine Sentiment analysis Ratio of emotional tweets, over time. ratio of # tweets with mood expressions over all tweets % mood expressions 2 3 4 5 6 7 8 9 residual (%) 1.5 0.0 1.0 probability 0.0 0.4 0.8 Aug 08 Oct 08 Dec 08 1.5 0.5 0.5 residual (%) Aug 08 Sep 08 Oct 08 Nov 08 Dec 08 Ratio of tweets containing mood expressions vs. all tweets on a given day, including residuals from trendline.

Introduction Methods Results Conclusions Microblogging: canary in a coal mine Sentiment analysis What we did: Trends in general public mood from a large-scale collection of tweets Each tweet= patient taking psychometric instrument for mood assessment

Introduction Methods Results Conclusions Microblogging: canary in a coal mine Sentiment analysis What we did: Trends in general public mood from a large-scale collection of tweets Each tweet= patient taking psychometric instrument for mood assessment Large-scale collection of tweets: 10M, 2006-2008

Introduction Methods Results Conclusions Microblogging: canary in a coal mine Sentiment analysis What we did: Trends in general public mood from a large-scale collection of tweets Each tweet= patient taking psychometric instrument for mood assessment Large-scale collection of tweets: 10M, 2006-2008 Daily public mood assessment: Time series depicting fluctuations of public mood

Introduction Methods Results Conclusions Microblogging: canary in a coal mine Sentiment analysis What we did: Trends in general public mood from a large-scale collection of tweets Each tweet= patient taking psychometric instrument for mood assessment Large-scale collection of tweets: 10M, 2006-2008 Daily public mood assessment: Time series depicting fluctuations of public mood Correlations to socio-economic indicators?

Introduction Methods Results Conclusions Data Sentiment tracking instrument Outline 1 Introduction Microblogging: canary in a coal mine Sentiment analysis 2 Methods Data Sentiment tracking instrument 3 Results Case-studies DJIA 4 Conclusions Discussion Literature

Introduction Methods Results Conclusions Data Sentiment tracking instrument Data sets Collection of tweets: April 29, 2006 to December 20, 2008 2.7M users Subset: August 1, 2008 to December 2008-9,664,952 tweets log(n tweets) 2e+02 1e+03 5e+03 2e+04 1e+05 Aug 1 Sep 1 Oct 1 Nov 1 Dec 1 Dec 20 2008

Introduction Methods Results Conclusions Data Sentiment tracking instrument Each tweet: ID date-time type text 1 2008-11-28 web Getting ready for Black Friday. Sleeping out at Circuit City or Walmart not 02:35:48 2 2008-11-28 02:35:48 web sure which. So cold out. @anonymous I didn t know I had an uncle named Bob :-P I am going to be checking out the new Flip sometime soon

Introduction Methods Results Conclusions Data Sentiment tracking instrument Mood assessment tool Definition Uses model derived from existing psychometric instrument (40 years of practice). Maps the content of Tweet to 6 dimensions of human mood. Uses ancient magic (just kidding). calm alert sure vital kind happy Tool built in-house, beyond mere term matching, learns from the web, lots of behind the scenes processing, continuous development.

Introduction Methods Results Conclusions Data Sentiment tracking instrument Tweet: I am so not bored. way too busy! I feel really great!

Introduction Methods Results Conclusions Data Sentiment tracking instrument Tweet: I am so not bored. way too busy! I feel really great! composed/anxious clearheaded/confused confident/unsure energetic/tired agreeable/hostile elated/depressed 0.01725 0.05125 0.725625 0.666625 0.361 0.53175

Introduction Methods Results Conclusions Data Sentiment tracking instrument Aggregating daily tweets into a mood time series Twitter feed text analysis ~ Mood indicators (daily) Calm Happy Confident...

Introduction Methods Results Conclusions Case-studies DJIA Outline 1 Introduction Microblogging: canary in a coal mine Sentiment analysis 2 Methods Data Sentiment tracking instrument 3 Results Case-studies DJIA 4 Conclusions Discussion Literature

Introduction Methods Results Conclusions Case-studies DJIA Public mood trends: overview +2sd elated/depressed 2sd +2sd agreeable/hostile 2sd +2sd energetic/tired 2sd +2sd confident/unsure 2sd +2sd clearheaded/confused 2sd +2sd composed/anxious 2sd Election08 Thanksgiving08 08/01 09/01 10/01 11/01 12/01 12/20

Introduction Methods Results Conclusions Case-studies DJIA Case study 1: November 4th, 2008 - the presidential election +2sd elated/depressed 2sd +2sd agreeable/hostile 2sd +2sd energetic/tired 2sd +2sd confident/unsure 2sd +2sd clearheaded/confused 2sd +2sd composed/anxious 2sd Election08 10/20 11/04 11/19

Introduction Methods Results Conclusions Case-studies DJIA TFIDF scoring of tweet terms 2008 U.S. Presidential Election Nov 03 Nov 04 Nov 05 robocal poll histori business plumber won voter result barack cleanser absente prop grandmoth ballot speech russert turnout result socialist barack president-elect halloween citizen hologram acknowledg joe victori race thoughtfulli ecstat Table: Top 10 TF-IDF ranking terms 1 day before, on and 1 day after election day.

Introduction Methods Results Conclusions Case-studies DJIA Case study 2: November 27th, 2008 - Thanksgiving +2sd elated/depressed 2sd +2sd agreeable/hostile 2sd +2sd energetic/tired 2sd +2sd confident/unsure 2sd +2sd clearheaded/confused 2sd +2sd composed/anxious 2sd Thanksgiving08 11/12 11/27 12/12 Figure: Sparklines Johan Bollen (IU) forand public Huina Mao mood (IU) before, tradersduring jlbollen #twitter and mood afterpredicts Thanksgiving the #DJIA FTW!

Introduction Methods Results Conclusions Case-studies DJIA TerraMood:World Mood Analysis from Twitter

Introduction Methods Results Conclusions Case-studies DJIA Outline 1 Introduction Microblogging: canary in a coal mine Sentiment analysis 2 Methods Data Sentiment tracking instrument 3 Results Case-studies DJIA 4 Conclusions Discussion Literature

Introduction Methods Results Conclusions Case-studies DJIA Comparison to DJIA DJIA daily closing value (March 2008 December 2008 13000 12000 11000 10000 9000 8000 Mar Apr May Jun Jul Aug Sep Oct Nov Dec 2008

Introduction Methods Results Conclusions Case-studies DJIA Comparison to DJIA 1 -n (lag) text analysis Twitter feed ~ Mood indicators (daily) (1) OpinionFinder Granger causality F-statistic p-value 2 (2) G-POMS (6 dim.) (3) DJIA DJIA ~ normalization Stock market (daily) t-1 t-2 t-3 t=0 value SOFNN predicted value MAPE Direction % 3 Figure: Methodological diagram outlining use of Granger causality analysis and Self-Organizing Fuzzy Neural Network to predict daily DJIA values from (1) past DJIA values at t 1, t 2, t 3, and various permutations of Twitter mood values (OpinionFinder and GPOMS).

Introduction Methods Results Conclusions Case-studies DJIA bivariate-causal analysis: DJIA vs. public mood Table: Calm (X 1 ), Alert (X 2 ),Sure (X 3 ), Vital (X 4 ), Kind (X 5 ), Happy (X 6 ) lag X OF X 1 X 2 X 3 X 4 X 5 X 6 1 0.703 0.080 0.521 0.422 0.679 0.712 0.300 2 0.633 0.004 0.777 0.828 0.996 0.935 0.697 3 0.928 0.009 0.920 0.563 0.897 0.995 0.652 4 0.657 0.03 0.54 0.61 0.87 0.78 0.68 5 0.235 0.053 0.753 0.703 0.246 0.837 0.05

Introduction Methods Results Conclusions Case-studies DJIA Calm vs. DJIA DJIA z-score 2 1 0-1 -2 bank bail-out 2 1 0-1 -2 Calm z-score 2 DJIA z-score 1 0-1 -2 2 Calm z-score 1 0-1 -2 Aug 09 Aug 29 Sep 18 Oct 08 Oct 28

Introduction Methods Results Conclusions Case-studies DJIA Table: DJIA Daily Prediction Using SOFNN Evaluation I OF I 0 I 1 I 1,2 I 1,3 I 1,4 I 1,5 I 1,6 MAPE (%) 1.95 1.94 1.83 2.03 2.13 2.05 1.85 1.79 Direction (%) 73.3 73.3 86.7 60.0 46.7 60.0 73.3 80.0

Introduction Methods Results Conclusions Discussion Literature Outline 1 Introduction Microblogging: canary in a coal mine Sentiment analysis 2 Methods Data Sentiment tracking instrument 3 Results Case-studies DJIA 4 Conclusions Discussion Literature

Introduction Methods Results Conclusions Discussion Literature Discussion Fear vs. greed Social media offers window into public mood states Potential to predict market behavior? Results need to be proven in real trading, not just DJIA. Commodities, FOREX?

Introduction Methods Results Conclusions Discussion Literature Discussion Fear vs. greed Social media offers window into public mood states Potential to predict market behavior? Results need to be proven in real trading, not just DJIA. Commodities, FOREX? Some issues: Time period chosen for analysis: Fall 2008 = worst case scenario Short time period for test: subsequent analysis confirms prediction accuracy Lack of historical data: prediction is at scale of 1-4 days, not decades Algorithmic trading vs. hybrid (expert insight)? The crowd is not clairvoyant! Gaming: 100M tweets/day, 150M users!

Introduction Methods Results Conclusions Discussion Literature References Johan Bollen, Huina Mao, and Xiao-Jun Zeng. Twitter mood predicts the stock market. Journal of Computational Science, 2(1), March 2011, Pages 1-8, doi:10.1016/j.jocs.2010.12.007, arxiv: abs/1010.3003. Johan Bollen, Alberto Pepe, and Huina Mao. Modeling public mood and emotion: Twitter sentiment and socio-economic phenomena. ICWSM11, Barcelona, Spain, July 2011 (arxiv: 0911.1583) Johan Bollen, Bruno Gonalves, Guangchen Ruan and Huina Mao. Happiness is assortative in online social networks. Artificial Life, In Press, Spring 2011 (arxiv:1103.0784)

Introduction Methods Results Conclusions Discussion Literature THANK YOU! Johan Bollen jbollen@indiana.edu