Twitter mood predicts the stock market.
|
|
|
- Cecily Wood
- 10 years ago
- Views:
Transcription
1 Twitter mood predicts the stock market. Johan Bollen,,Huina Mao,,Xiao-Jun Zeng. : authors made equal contributions. arxiv: v [cs.ce] 4 Oct 00 Abstract Behavioral economics tells us that emotions can profoundly affect individual behavior and decision-making. Does this also apply to societies at large, i.e. can societies experience mood states that affect their collective decision making? By extension is the public mood correlated or even predictive of economic indicators? Here we investigate whether measurements of collective mood states derived from large-scale Twitter feeds are correlated to the value of the Dow Jones Industrial Average (DJIA) over time. We analyze the text content of daily Twitter feeds by two mood tracking tools, namely OpinionFinder that measures positive vs. negative mood and Google-Profile of Mood States (GPOMS) that measures mood in terms of 6 dimensions (Calm, Alert, Sure, Vital, Kind, and Happy). We cross-validate the resulting mood time series by comparing their ability to detect the public s response to the presidential election and Thanksgiving day in 008. A Granger causality analysis and a Self-Organizing Fuzzy Neural Network are then used to investigate the hypothesis that public mood states, as measured by the OpinionFinder and GPOMS mood time series, are predictive of changes in DJIA closing values. Our results indicate that the accuracy of DJIA predictions can be significantly improved by the inclusion of specific public mood dimensions but not others. We find an accuracy of 87.6% in predicting the daily up and down changes in the closing values of the DJIA and a reduction of the Mean Average Percentage Error by more than 6%. Index Terms stock market prediction twitter mood analysis. I. INTRODUCTION STOCK market prediction has attracted much attention from academia as well as business. But can the stock market really be predicted? Early research on stock market prediction [], [], [3] was based on random walk theory and the Efficient Market Hypothesis (EMH) [4]. According to the EMH stock market prices are largely driven by new information, i.e. news, rather than present and past prices. Since news is unpredictable, stock market prices will follow a random walk pattern and cannot be predicted with more than 50 percent accuracy [5]. There are two problems with EMH. First, numerous studies show that stock market prices do not follow a random walk and can indeed to some degree be predicted [5], [6], [7], [8] thereby calling into question EMH s basic assumptions. Second, recent research suggests that news may be unpredictable but that very early indicators can be extracted from online social media (blogs, Twitter feeds, etc) to predict changes in various economic and commercial indicators. This may conceivably also be the case for the stock market. For example, [] shows how online chat activity predicts book sales. [] uses assessments of blog sentiment to predict movie sales. [5] predict future product sales using a Probabilistic Latent Semantic Analysis (PLSA) model to extract indicators of sentiment from blogs. In addition, Google search queries have been shown to provide early indicators of disease infection rates and consumer spending [4]. [9] investigates the relations between breaking financial news and stock price changes. Most recently [3] provide a ground-breaking demonstration of how public sentiment related to movies, as expressed on Twitter, can actually predict box office receipts. Although news most certainly influences stock market prices, public mood states or sentiment may play an equally important role. We know from psychological research that emotions, in addition to information, play an significant role in human decision-making [6], [8], [39]. Behavioral finance has provided further proof that financial decisions are significantly driven by emotion and mood [9]. It is therefore reasonable to assume that the public mood and sentiment can drive stock market values as much as news. This is supported by recent research by [0] who extract an indicator of public anxiety from LiveJournal posts and investigate whether its variations can predict S&P500 values. However, if it is our goal to study how public mood influences the stock markets, we need reliable, scalable and early assessments of the public mood at a time-scale and resolution appropriate for practical stock market prediction. Large surveys of public mood over representative samples of the population are generally expensive and time-consuming to conduct, cf. Gallup s opinion polls and various consumer and well-being indices. Some have therefore proposed indirect assessment of public mood or sentiment from the results of soccer games [0] and from weather conditions []. The accuracy of these methods is however limited by the low degree to which the chosen indicators are expected to be correlated with public mood. Over the past 5 years significant progress has been made in sentiment tracking techniques that extract indicators of public mood directly from social media content such as blog content [0], [], [5], [7] and in particular large-scale Twitter feeds []. Although each so-called tweet, i.e. an individual user post, is limited to only 40 characters, the aggregate of millions of tweets submitted to Twitter at any given time may provide an accurate representation of public mood and sentiment. This has led to the development of realtime sentiment-tracking indicators such as [7] and Pulse of Nation. In this paper we investigate whether public sentiment, as expressed in large-scale collections of daily Twitter posts, can be used to predict the stock market. We use two tools to measure variations in the public mood from tweets submitted
2 to the Twitter service from February 8, 008 to December 9, 008. The first tool, OpinionFinder, analyses the text content of tweets submitted on a given day to provide a positive vs. negative daily time series of public mood. The second tool, GPOMS, similarly analyses the text content of tweets to generate a six-dimensional daily time series of public mood to provide a more detailed view of changes in public along a variety of different mood dimensions. The resulting public mood time series are correlated to the Dow Jones Industrial Average (DJIA) to assess their ability to predict changes in the DJIA over time. Our results indicate that the prediction accuracy of standard stock market prediction models is significantly improved when certain mood dimensions are included, but not others. In particular variations along the public mood dimensions of Calm and Happiness as measured by GPOMS seem to have a predictive effect, but not general happiness as measured by the OpinionFinder tool. II. RESULTS A. Data and methods overview We obtained a collection of public tweets that was recorded from February 8 to December 9th, 008 (9,853,498 tweets posted by approximately.7m users). For each tweet these records provide a tweet identifier, the date-time of the submission (GMT+0), its submission type, and the text content of the Tweet which is by design limited to 40 characters. After removal of stop-words and punctuation, we group all tweets that were submitted on the same date. We only take into account tweets that contain explicit statements of their author s mood states, i.e. those that match the expressions i feel, i am feeling, i m feeling, i dont feel, I m, Im, I am, and makes me. In order to avoid spam messages and other information-oriented tweets, we also filter out tweets that match the regular expressions http: or www. As shown in Fig. we then proceed in three phases. In the first phase, we subject the collections of daily tweets to mood assessment tools: () OpinionFinder which measures positive vs. negative mood from text content, and () GPOMS which measures 6 different mood dimensions from text content. This results in a total of 7 public mood time series, one generated by OpinionFinder and six generated by GPOMS, each representing a potentially different aspect of the public s mood on a given day. In addition, we extract a time series of daily DJIA closing-values from Yahoo! Finance. In the second phase, we investigate the hypothesis that public mood as measured by GPOMS and OpinionFinder is predictive of future DJIA values. We use a Granger causality analysis in which we correlate DJIA values to GPOMs and OF values of the past n days. In the third phase, we deploy a Self- Organizing Fuzzy Neural Network model to test the hypothesis that the prediction accuracy of DJIA prediction models can be improved by including measurements of public mood. We are not interested in proposing an optimal DJIA prediction model, but to assess the effects of including public mood information on the accuracy of a baseline prediction model. Methodology text analysis Twitter feed ~ DJIA ~ Data sets and timeline feb8 008 Mood indicators (daily) () OpinionFinder () G-POMS (6 dim.) (3) DJIA normalization Stock market (daily) t- t- t-3 t=0 value () Granger Causality analysis -n (lag) Granger causality SOFNN F-statistic p-value predicted value MAPE Direction % () OF ~ GPOMS (3) SOFNN training test apr may jun jul aug sep oct nov dec dec0 008 Fig.. Diagram outlining 3 phases of methodology and corresponding data sets: () creation and validation of OpinionFinder and GPOMS public mood time series from October 008 to December 008 (Presidential Election and Thanksgiving), () use of Granger causality analysis to determine correlation between DJIA, OpinionFinder and GPOMS public mood from August 008 to December 008, and (3) training of a Self-Organizing Fuzzy Neural Network to predict DJIA values on the basis of various combinations of past DJIA values and OF and GPOMS public mood data from March 008 to December 008. B. Generating public mood time series: OpinionFinder and GPOMS OpinionFinder (OF) is a publicly available software package for sentiment analysis that can be applied to determine sentence-level subjectivity [5], i.e. to identify the emotional polarity (positive or negative) of sentences. It has been successfully used to analyze the emotional content of large collections of tweets [6] by using the OF lexicon to determine the ratio of positive versus negative tweets on a given day. The resulting time series were shown to correlate with the Consumer Confidence Index from Gallup 3 and the Reuters/University of Michigan Surveys of Consumers 4 over a given period of time. We adopt OF s subjective lexicon that has been established upon previous work [37], [38], [4]. We select positive and negative words that are marked as either weak and strong from the OF sentiment lexicon resulting in a list of 78 positive and 49 negative words. For each tweet we determine whether it contains any number of negative and positive terms from the OF lexicon. For each occurrence we increase the score of either negative or positive tweets by and calculate the ratio of positive vs. negative messages for the tweets posted on the same day t. Like many sentiment analysis tools OF adheres to a unidimensional model of mood, making binary distinctions between positive and negative sentiment [3]. This may however ignore the rich, multi-dimensional structure of human mood. To capture additional dimensions of public mood we created a second mood analysis tools, labeled GPOMS, that can measure human mood states in terms of 6 different mood dimensions, namely Calm, Alert, Sure, Vital, Kind and Happy. GPOMS
3 3 mood dimensions and lexicon are derived from an existing and well-vetted psychometric instrument, namely the Profile of Mood States (POMS-bi)[3], [33]. To make it applicable to Twitter mood analysis we expanded the original 7 terms of the POMS questionnaire to a lexicon of 964 associated terms by analyzing word co-occurrences in a collection of.5 billion 4- and 5-grams 5 computed by Google in 006 from approximately trillion word tokens observed in publicly accessible Webpages [35], [36]. The enlarged lexicon of 964 terms thus allows GPOMS to capture a much wider variety of naturally occurring mood terms in Tweets and map them to their respective POMS mood dimensions. We match the terms used in each tweet against this lexicon. Each tweet term that matches an n-gram term is mapped back to its original POMS terms (in accordance with its co-occurence weight) and via the POMS scoring table to its respective POMS dimension. The score of each POMS mood dimension is thus determined as the weighted sum of the co-occurence weights of each tweet term that matched the GPOMS lexicon. All data sets and methods are available on our project web site 6. To enable the comparison of OF and GPOMS time series we normalize them to z-scores on the basis of a local mean and standard deviation within a sliding window of k days before and after the particular date. For example, the z-score of time series X t, denoted Z Xt, is defined as: Z Xt = X t x(x t±k ) σ(x t±k ) where x(x t±k ) and σ(d t±k ) represent the mean and standard deviation of the time series within the period [t k, t+k]. This normalization causes all time series to fluctuate around a zero mean and be expressed on a scale of standard deviation. C. Cross-validating OF and GPOMS time series against large socio-cultural events We first validate the ability of OF and GPOMS to capture various aspects of public mood. To do so we apply them to tweets posted in a 3-month period from October 5, 008 to December 5, 008. This period was chosen specifically because it includes several socio-cultural events that may have had a unique, significant and complex effect on public mood namely the U.S presidential election (November 4, 008) and Thanksgiving (November 7, 008). The OF and GPOMS measurements can therefore be cross-validated against the expected emotional responses to these events. The resulting mood time series are shown in Fig. and are expressed in z-scores as given by in Eq.. Fig. shows that the OF successfully identifies the public s emotional response to the Presidential election on November 4th and Thanksgiving on November 7th. In both cases OF marks a significant, but short-lived uptick in positive sentiment specific to those days. The GPOMS results reveal a more differentiated public mood response to the events in the three-day period surrounding the election day (November 4, 008). November 3, 008 is 5 n-grams are frequently occurring sequences of terms in text of length n, for example we are the robots could be a frequent 4-gram. 6 () characterized by a significant drop in Calm indicating highly elevated levels of public anxiety. Election Day itself is characterized by a reversal of Calm scores indicating a significant reduction in public anxiety, in conjunction with a significant increases of Vital, Happy as well as Kind scores. The latter indicates a public that is energized, happy and friendly on election day. On November 5, these GPOMS dimensions continue to indicate positive mood levels, in particular high levels of Calm, Sure, Vital and Happy. After November 5, all mood dimensions gradually return to the baseline. The public mood response to Thanksgiving on November 7, 008 provides a counterpart to the differentiated response to the Presidential election. On Thanksgiving day we find a spike in Happy values, indicating high levels of public happiness. However, no other mood dimensions are elevated on November 7. Furthermore, the spike in Happy values is limited to the one day, i.e. we find no significant mood response the day before or after Thanksgiving. z-scores OpinionFinder CALM ALERT SURE VITAL KIND HAPPY day after election pre- election anxiety election results pre! election energy Thanksgiving Thanksgiving happiness Oct Oct 9 Nov 05 Nov Nov 9 Nov 6 Fig.. Tracking public mood states from tweets posted between October 008 to December 008 shows public responses to presidential election and thanksgiving. A visual comparison of Fig. suggests that GPOMS Happy dimension best approximates the mood trend provided by OpinionFinder. To quantitatively determine the relations between GPOMS s mood dimensions and the OF mood trends, we test the correlation between the trend obtained from OF lexicon and the six dimensions of GPOMS using multiple regression. The regression model is shown in Eq.. Y OF = α + n β i X i + t () where X, X, X 3, X 4, X 5 and X 6 represent the mood time series obtained from the 6 GPOMS dimensions, respectively i
4 4 TABLE I MULTIPLE REGRESSION RESULTS FOR OPINIONFINDER VS. 6GPOMS MOOD DIMENSIONS. Parameters Coeff. Std.Err. t p Calm (X ) Alert (X ) Sure (X 3 ) e-08 Vital (X 4 ) Kind (X 5 ) Happy (X 6 ) e-05 Summary Residual Std.Err Adj.R F 6,55 p e-3 (p-value < 0.00:, p-value < 0.05:, p-value < 0.: ) Calm, Alert, Sure, Vital, Kind and Happy. The multiple linear regression results are provided in Table I (coefficient and p-values), and indicate that Y OF is significantly correlated with X 3 (Sure), X 4 (Vital) and X 6 (Happy), but not with X (Calm), X (Alert) and X 5 (Kind). We therefore conclude that certain GPOMS mood dimension partially overlap with the mood values provided by OpinionFinder, but not necessarily all mood dimensions that may be important in describing the various components of public mood e.g. the varied mood response to the Presidential election. The GPOMS thus provides a unique perspective on public mood states not captured by uni-dimensional tools such as OpinionFinder. D. Bivariate Granger Causality Analysis of Mood vs. DJIA prices After establishing that our mood time series responds to significant socio-cultural events such as the Presidential election and Thanksgiving, we are concerned with the question whether other variations of the public s mood state correlate with changes in the stock market, in particular DJIA closing values. To answer this question, we apply the econometric technique of Granger causality analysis to the daily time series produced by GPOMS and OpinionFinder vs. the DJIA. Granger causality analysis rests on the assumption that if a variable X causes Y then changes in X will systematically occur before changes in Y. We will thus find that the lagged values of X will exhibit a statistically significant correlation with Y. Correlation however does not prove causation. We therefore use Granger causality analysis in a similar fashion to [0]; we are not testing actual causation but whether one time series has predictive information about the other or not 7. Our DJIA time series, denoted D t, is defined to reflect daily changes in stock market value, i.e. its values are the delta between day t and day t : D t = DJIA t DJIA t.to test whether our mood time series predicts changes in stock market values we compare the variance explained by two linear models as shown in Eq. 3 and Eq. 4. The first model (L ) uses only n lagged values of D t, i.e. (D t,,d t n ) for prediction, while the second model L uses the n lagged values of both D t and the GPOMS plus the OpinionFinder mood time series denoted X t,,x t n. 7 [0] uses only one mood index, namely Anxiety, but we investigate the relation between DJIA values and all Twitter mood dimensions measured by GPOMS and OpinionFinder We perform the Granger causality analysis according to model L and L shown in Eq. 3 and 4 for the period of time between February 8 to November 3, 008 to exclude the exceptional public mood response to the Presidential Election and Thanksgiving from the comparison. GPOMS and OpinionFinder time series were produced for 34,55 tweets in that period, and the daily Dow Jones Industrial Average (DJIA) was retrieved from Yahoo! Finance for each day 8. L : D t = α + L : D t = α + n β i D t i + t (3) i= n β i D t i + i= n γ i X t i + t (4) i= Based on the results of our Granger causality (shown in Table II), we can reject the null hypothesis that the mood time series do not predict DJIA values, i.e. β {,,,n} =0with a high level of confidence. However, this result only applies to GPOMS mood dimension. We observe that X (i.e. Calm) has the highest Granger causality relation with DJIA for lags ranging from to 6 days (p-values < 0.05). The other four mood dimensions of GPOMS do not have significant causal relations with changes in the stock market, and neither does the OpinionFinder time series. To visualize the correlation between X and the DJIA in more detail, we plot both time series in Fig. 3. To maintain the same scale, we convert the DJIA delta values D t and mood index value X t to z-scores as shown in Eq.. DJIA z-score DJIA z-score Calm z-score bank bail-out Aug 09 Aug 9 Sep 8 Oct 08 Oct 8 Fig. 3. A panel of three graphs. The top graph shows the overlap of the day-to-day difference of DJIA values (blue: Z Dt ) with the GPOMS Calm time series (red: Z Xt ) that has been lagged by 3 days. Where the two graphs overlap the Calm time series predict changes in the DJIA closing values that occur 3 days later. Areas of significant congruence are marked by gray areas. The middle and bottom graphs show the separate DJIA and GPOMS Calm time series. As can be seen in Fig. 3 both time series frequently overlap or point in the same direction. Changes in past values of Calm (t 3 ) predicts a similar rise or fall in DJIA values (t = 8 Our DJIA time series has no values for weekends and holidays because trading is suspended during those days. We do not linearly extropolate to fill the gaps. This results in a time series of 64 days Calm z-score
5 5 TABLE II STATISTICAL SIGNIFICANCE (P-VALUES) OF BIVARIATE GRANGER-CAUSALITY CORRELATION BETWEEN MOODS AND DJIA IN PERIOD FEBRUARY 8, 008 TO NOVEMBER 3, 008. Lag OF Calm Alert Sure Vital Kind Happy day days days days days days days (p-value < 0.05:, p-value < 0.: ) 0. The Calm mood dimension thus has predictive value with regards to the DJIA. In fact the p-value for this shorter period, i.e. August, 008 to October , is significantly lower (lag n =3, p =0.009) than that listed in Table II for the period February 8, 008 to November 3, 008. The cases in which the t 3 mood time series fails to track changes in the DJIA are nearly equally informative as where it doesn t. In particular we point to a significant deviation between the two graphs on October 3th where the DJIA surges by more than 3 standard deviations trough-topeak. The Calm curve however remains relatively flat at that time after which it starts to again track changes in the DJIA again. This discrepancy may be the result of the the Federal Reserve s announcement on October 3th of a major bank bailout initiative which unexpectedly increase DJIA values that day. The deviation between Calm values and the DJIA on that day illustrates that unexpected news is not anticipated by the public mood yet remains a significant factor in modeling the stock market. E. Non-linear models for emotion-based stock prediction Our Granger causality analysis suggests a predictive relation between certain mood dimensions and DJIA. However, Granger causality analysis is based on linear regression whereas the relation between public mood and stock market values is almost certainly non-linear. To better address these non-linear effects and assess the contribution that public mood assessments can make in predictive models of DJIA values, we compare the performance of a Self-organizing Fuzzy Neural Network (SOFNN) model [30] that predicts DJIA values on the basis of two sets of inputs: () the past 3 days of DJIA values, and () the same combined with various permutations of our mood time series (explained below). Statistically significant performance differences will allow us to either confirm or reject the null hypothesis that public mood measurement do not improve predictive models of DJIA values. We use a SOFNN as our prediction model since they have previously been used to decode nonlinear time series data which describe the characteristics of the stock market [8] and predict its values [9]. Our SOFNN in particular is a fivelayer hybrid neural network with the ability to self-organize its own neurons in the learning process. A similar organization has been successfully used for electricial load forecasting in our previous work [3]. To predict the DJIA value on day t, the input attributes of our SOFNN include combinations of DJIA values and mood values of the past n days. We choose n = 3 since the results shown in Table II indicate that past n = 4 the Granger causal relation between Calm and DJIA decreases significantly. All historical load values are linearly scaled to [0,]. This procedure causes every input variable be treated with similar importance since they are processed within a uniform range. SOFNN models require the tuning of a number of parameters that can influence the performance of the model. We maintain the same parameter values across our various input combinations to allow an unbiased comparison of model performance, namely δ = 0.04,σ = 0.0,k rmse = 0.05,k d (i), (i =,...,r)=0. where r is the dimension of input variables and k rmse is the expected training root mean squared error which is a predefined value. To properly evaluate the SOFNN model s ability to predict daily DJIA prices, we extend the period under consideration to February 8, 008 to December 9, 008 for training and testing. February 8, 008 to November 8, 008 is chosen as the longest possible training period while Dec to Dec 9, 008 was chosen as the test period because it was characterized by stabilization of DJIA values after considerable volatility in previous months and the absence of any unusual or significant socio-cultural events. Fig. 4 shows that the Fall of 008 is an unusual period for the DJIA due to a sudden dramatic decline of stock prices. This variability may in fact render stock market prediction more difficult than in other periods DJIA daily closing value (March 008 December 008 Mar Apr May Jun Jul Aug Sep Oct Nov Dec 008 Fig. 4. Daily Dow Jones Industrial Average values between February 8, 008 and December 9, 008. The Granger causality analysis indicates that only Calm (and to some degree Happy) is Granger-causative of DJIA values. However, the other mood dimensions could still contain predictive information of DJIA values when combined
6 6 with other mood dimensions. For example, Happy may not be independently linearly related with DJIA, but it may nevertheless improve the SOFNN prediction accuracy when combined with Calm. To clarify these questions, we investigate seven permutations of input variables to the SOFNN model, the first of which, denoted I 0, represents a naive, baseline model that has been trained to predict DJIA values at time t from the historical values at time {t,t,t 3} : I 0 = {DJIA t 3,, } I = {DJIA t 3,,, X,t 3,, } I, = {DJIA t 3,,, X,t 3,,, X,t 3,, } I,3 = {DJIA t 3,,, X,t 3,,, X 3,t 3,, } DJIA t 3,, represents the DJIA values and X,t 3,, represents the values of the GPOMS mood dimension, at time t 3, t, and t. According to the same notation I,3, I,4, I,5, I,6 represent a combination of historical DJIA with mood dimensions 3, 4, 5 and 6 at time t 3, t, and t. For example, I,6 represents a set of inputs that includes the DJIA values t 3, t, and t, and mood dimensions and 6 at the same times. In order to compare the performance of the GPOMS mood data with the positive/negative sentiment values given by OpinionFinder, we additionally define the input combination: I OF = {DJIA t 3,,, X OF,t 3,, } Forecasting accuracy is measured in terms of the average Mean Absolute Percentage Error (MAPE) and the direction accuracy (up or down) during the test period (December to December 9, 008). The prediction results are shown in Table III. We can draw several conclusions from these results. First, adding positive/negative sentiment obtained from OF (I OF ) has no effect on prediction accuracy compared to using only historical DJIA values(i 0 ). This confirms the results of our Granger causuality analysis. Second, adding Calm, i.e. input I, we find the highest prediction accuracy. Compared to I 0 and all other input combinations, adding input I leads to significant improvements in MAPE values (.83% vs. the maximum of.3% and.95% for I OF ) and direction accuracy (86.7% compared to 73.3% for I OF and 46.7% for I,3 ). Thirdly, I,3 and I,4 actually reduce prediction accuracy significantly both in terms of MAPE and direction %, most likely because X 3 (Sure) and X 4 (Vital) do not contain information that is useful for prediction of DJIA values causing prediction accuracy to regress to chance levels. It is notable that I,6, i.e. a combination of X 6 and X does significantly reduce average MAPE, and provides good direction accuracy (80%). This is surprising since X 6 (i.e. Happy) does not have a good Granger causality relation with DJIA at a lag of n = 3 days (see Table II, p =0.73). However in combination with Calm, it produces a more accurate SOFNN prediction (MAPE=.79%) and direction accuracy (80%). To assess the statistical significance of the SOFNN achieving the above mentioned accuracy of 87.6% in predicting the up and down movement of the DJIA we calculate the odds of this result occurring by chance. The binomial distribution indicates that the probability of achieving exactly 87.6% correct guesses over 5 trials (0 days minus weekends) with a 50% chance of success on each single trial equals 0.3%. Taken over the entire length of our data set (February 8 to December 0, excluding weekends) we find approximately 0.9 of such 0 day periods. The odds that the mentioned probability would hold by chance for a random period of 0 days within that period is then estimated to be ( 0.003) 0.9 = or 3.4%. The SOFNN direction accuracy is thus most likely not the result of chance nor our selecting a specifically favorable test period. In addition, we test the linear effect of both Calm(X ) and Happy (X 6 ) on DJIA, with a nested F test between the full model F and reduced model R shown as follows: F : R : D t = α + n β i D t i + 3 γ i X,t i + 3 γ i X 6,t i + t i= i= i= D t = α + n β i D t i + 3 γ i X,t i + t i= i= We find a p-value of 0.66 and an F-statistic of 0.53 indicating that a linear combination of X and X 6 produces worse results than X alone. Since the SOFNN prediction is more accurate when using a combination of X and X 6, we conclude that this confirms a nonlinear relation among the different dimensions of moods. III. DISCUSSION In this paper, we investigate whether public mood as measured from large-scale collection of tweets posted on twitter.com is correlated or even predictive of DJIA values. Our results show that changes in the public mood state can indeed be tracked from the content of large-scale Twitter feeds by means of rather simple text processing techniques and that such changes respond to a variety of socio-cultural drivers in a highly differentiated manner. Among the 7 observed mood dimensions only some are Granger causative of the DJIA; changes of the public mood along these mood dimensions match shifts in the DJIA values that occur 3 to 4 days later. Surprisingly we do not observe this effect for OpinionFinder s assessment of public mood states in terms of positive vs. negative mood but rather for the GPOMS dimension labeled Calm. The calmness of the public (measured by GPOMS) is thus predictive of the DJIA rather than general levels of positive sentiment as measured by OpinionFinder. A Self- Organizing Fuzzy Neural Network trained on the basis of past DJIA values and our public mood time series furthermore demonstrated the ability of the latter to significantly improve the accuracy of even the most basic models to predict DJIA closing values. Given the performance increase for a relatively basic model such as the SOFNN we are hopeful to find equal or better improvements for more sophisticated market models that may in fact include other information derived from news
7 7 TABLE III DJIA DAILY PREDICTION USING SOFNN Evaluation I OF I 0 I I, I,3 I,4 I,5 I,6 MAPE (%) Direction (%) sources, and a variety of relevant economic indicators. These results have implications for existing sentiment tracking tools as well as surveys of self-reported subjective well-being in which individuals evaluate the extent to which they experience positive and negative affect, happiness, or satisfaction with life [40]. Such surveys are relatively expensive and timeconsuming, and may nevertheless not allow the measurement of public mood along mood dimensions that are relevant to assess particular socio-economic indicators. Public mood analysis from Twitter feeds on the other hand offers an automatic, fast, free and large-scale addition to this toolkit that may in addition be optimized to measure a variety of dimensions of the public mood state. Our analysis does not acknowledge a number of important factors that will form the basis of future research. First, we note that our analysis is not designed to be limited to any particular geographical location nor subset of the world s population. This approach may be appropriate since the US stock markets are affected by individuals worldwide, but for the particular period under observation Twitter.com users were de facto predominantly English speaking and located in the US. As Twitter.com s user base becomes increasingly international and the use of smartphones equipped with geo-location increases, future analysis will have to factor in location and language to avoid geographical and cultural sampling errors. Second, although we have cross-validated the results of different tools to assess public mood states, we have no knowledge of the ground truth for public mood states nor in fact for the particular subsample of the population represented by the community of Twitter.com users. This problem can only be addressed by increased research into direct assessments of public mood states vs. those derived from online communities such as Twitter. Third, these results are strongly indicative of a predictive correlation between measurements of the public mood states from Twitter feeds, but offer no information on the causative mechanisms that may connect public mood states with DJIA values in this manner. One could speculate that the general public is presently as strongly invested in the DJIA as financial experts, and that therefore their mood states will directly affect their investment decisions and thus stock market values, but this too remains an area of future research. ACKNOWLEDGMENT This research was supported by NSF Grant BCS #030. We thank David Crandell and Michael Nelson for their helpful comments on earlier versions of this manuscript. We are also grateful to Daniel Kahneman for his early comments on our work. REFERENCES [] Fama, Eugene F, e. a. (969) International Economic Review 0,. [] Fama, E. F. (99) Journal of Finance 46, [3] H.Cootner, P. (964) The random character of stock market prices. (MIT). [4] Fama, E. F. (965) The Journal of Business 38, [5] Qian, Bo, Rasheed, & Khaled. (007) Applied Intelligence 6, [6] Gallagher, L. A & Taylor, M. P. (00) Southern Economic Journal 69, [7] Kavussanos, M & Dockery, E. (00) Applied Financial Economics, [8] Butler, K. C & Malaikah, S. J. (99) Journal of Banking and Finance 6, [9] Schumaker, R. P & Chen, H. (009) ACM Trans. Inf. Syst. 7, : :9. [0] Gilbert, E & Karahalios, K. (00) Widespread worry and the stock market. [] Gruhl, D, Guha, R, Kumar, R, Novak, J, & Tomkins, A. (005) The predictive power of online chatter. (ACM, New York, NY, USA), pp [] Mishne, G & Glance, N. (006) Predicting Movie Sales from Blogger Sentiment. AAAI 006 Spring Symposium on Computational Approaches to Analysing Weblogs [3] S. Asur and B. A. Huberman 00 Predicting the Future with Social Media arxiv: v [4] Choi, H & Varian, H. (009) Predicting the present with google trends., (Google), Technical report. [5] Liu, Y, Huang, X, An, A, & Yu, X. (007) ARSA: a sentiment-aware model for predicting sales performance using blogs. (ACM, New York, NY, USA), pp [6] Dolan, R. J. (00) Science 98, [7] Dodds, Peter. (009) Journal of Happiness July, doi: 0.007/s [8] Damasio, A. R. (994) Descartes error : emotion, reason, and the human brain. (Putnam), pp. xix, 3 p.+. [9] Nofsinger, J. (005) Journal of Behaviour Finance. 6, [0] Edmans, A, Garca, D, & Norli,. (007) Journal of Finance 6, [] Hirshleifer, D & Shumway, T. (003) Journal of Finance 58, [] Pak, A & Paroubek, P. (00) Twitter as a Corpus for Sentiment Analysis and Opinion Mining. (European Language Resources Association (ELRA), Valletta, Malta). [3] Pang, B & Lee, L. (008) Foundations and Trends in Information Retrieval, 35. [4] Wilson, T, Wiebe, J, & Hoffmann, P. (005) Recognizing Contextual Polarity in Phrase-Level Sentiment Analysis. (Vancouver, CA). [5] Wilson, T, Hoffmann, P, Somasundaran, S, Kessler, J, Wiebe, J, Choi, Y, Cardie, C, Riloff, E, & Patwardhan, S. (005) OpinionFinder: A system for subjectivity analysis. pp [6] O Connor, B, Balasubramanyan, R, Routledge, B. R, & Smith, N. A. (00) From Tweets to Polls: Linking Text Sentiment to Public Opinion Time Series. [7] Diener E, Diener M & Diener D (009) Factors Predicting the Subjective Well-Being of Nations. Social Indicators Research Series 38:43-70 [8] Lapedes, A & Farber, R. (987) Nonlinear signal processing using neural network: Prediction and system modeling, (Los Alamos National Lab Technical Report), Technical report. [9] Zhu, X, Wang, H, Xu, L, & Li, H. (008) Expert Syst. Appl. 34, [30] Leng, G, Prasad, G, & McGinnity, T. M. (004) Neural Netw. 7, [3] Mao, H, Zeng, X.-J, Leng, G, Zhai, Y, & Keane, A. J. (009) IEEE Transaction on Power System. 4, [3] Norcross, J. C, Guadagnoli, E, & Prochaska, J. O. (006) Journal of Clinical Psychology 40, [33] McNair, D, Heuchert, J. P, & Shilony, E. (003) Profile of mood states. Bibliography (Multi-Health Systems). [34] Pepe, A & Bollen, J. (008) Between conjecture and memento: shaping a collective emotional perception of the future.
8 8 [35] Brants, T & Franz, A. (006) Web t 5-gram version, (Linguistic Data Consortium, Philadelphia), Technical report. [36] Bergsma, S, Lin, D & Goebel, R (009) IJCAI 09: Proceedings of the st international jont conference on Artifical intelligence, San Francisco, CA., [37] Riloff, E, Wiebe, J, & Wilson, T. (003) Learning subjective nouns using extraction pattern bootstrapping. (Association for Computational Linguistics, Morristown, NJ, USA), 5 3. [38] Riloff, E & Wiebe, J. (003) Learning extraction patterns for subjective expressions. (Association for Computational Linguistics, Morristown, NJ, USA), pp. 05. [39] Kahneman, D & Tversky, Amos (979) Prospect Theory: An Analysis of Decision under Risk. (Econometrica), pp [40] Frey, B. S. (008) Happiness: A Revolution in Economics. (The MIT Press). Johan Bollen School of Informatics and Computing, Indiana Unviersity- Bloomington, United States [email protected] Huina Mao School of Informatics and Computing, Indiana Unviersity- Bloomington, United States [email protected] Xiao-Jun Zeng School of Computer Science, The University of Manchester, United Kingdom [email protected]
How To Predict Stock Price With Mood Based Models
Twitter Mood Predicts the Stock Market Xiao-Jun Zeng School of Computer Science University of Manchester [email protected] Outline Introduction and Motivation Approach Framework Twitter mood model
Sentiment Analysis. D. Skrepetos 1. University of Waterloo. NLP Presenation, 06/17/2015
Sentiment Analysis D. Skrepetos 1 1 Department of Computer Science University of Waterloo NLP Presenation, 06/17/2015 D. Skrepetos (University of Waterloo) Sentiment Analysis NLP Presenation, 06/17/2015
Sentiment analysis on tweets in a financial domain
Sentiment analysis on tweets in a financial domain Jasmina Smailović 1,2, Miha Grčar 1, Martin Žnidaršič 1 1 Dept of Knowledge Technologies, Jožef Stefan Institute, Ljubljana, Slovenia 2 Jožef Stefan International
QUANTIFYING THE EFFECTS OF ONLINE BULLISHNESS ON INTERNATIONAL FINANCIAL MARKETS
QUANTIFYING THE EFFECTS OF ONLINE BULLISHNESS ON INTERNATIONAL FINANCIAL MARKETS Huina Mao School of Informatics and Computing Indiana University, Bloomington, USA ECB Workshop on Using Big Data for Forecasting
Twitter mood predicts the stock market
Introduction Methods Results Conclusions [email protected], [email protected] School of Informatics and Computing Center for Complex Networks and Systems Research Indiana University April 9, 2011 Introduction
Stock Prediction Using Twitter Sentiment Analysis
Stock Prediction Using Twitter Sentiment Analysis Anshul Mittal Stanford University [email protected] Arpit Goel Stanford University [email protected] ABSTRACT In this paper, we apply sentiment analysis
traders jlbollen #twitter mood predicts the #DJIA FTW!
Introduction Methods Results Conclusions traders jlbollen #twitter mood predicts the #DJIA FTW! [email protected], [email protected] School of Informatics and Computing Center for Complex Networks
The Viability of StockTwits and Google Trends to Predict the Stock Market. By Chris Loughlin and Erik Harnisch
The Viability of StockTwits and Google Trends to Predict the Stock Market By Chris Loughlin and Erik Harnisch Spring 2013 Introduction Investors are always looking to gain an edge on the rest of the market.
CS 229, Autumn 2011 Modeling the Stock Market Using Twitter Sentiment Analysis
CS 229, Autumn 2011 Modeling the Stock Market Using Twitter Sentiment Analysis Team members: Daniel Debbini, Philippe Estin, Maxime Goutagny Supervisor: Mihai Surdeanu (with John Bauer) 1 Introduction
Using Tweets to Predict the Stock Market
1. Abstract Using Tweets to Predict the Stock Market Zhiang Hu, Jian Jiao, Jialu Zhu In this project we would like to find the relationship between tweets of one important Twitter user and the corresponding
Using Text and Data Mining Techniques to extract Stock Market Sentiment from Live News Streams
2012 International Conference on Computer Technology and Science (ICCTS 2012) IPCSIT vol. XX (2012) (2012) IACSIT Press, Singapore Using Text and Data Mining Techniques to extract Stock Market Sentiment
The Influence of Sentimental Analysis on Corporate Event Study
Volume-4, Issue-4, August-2014, ISSN No.: 2250-0758 International Journal of Engineering and Management Research Available at: www.ijemr.net Page Number: 10-16 The Influence of Sentimental Analysis on
Tweets Miner for Stock Market Analysis
Tweets Miner for Stock Market Analysis Bohdan Pavlyshenko Electronics department, Ivan Franko Lviv National University,Ukraine, Drahomanov Str. 50, Lviv, 79005, Ukraine, e-mail: [email protected]
The process of gathering and analyzing Twitter data to predict stock returns EC115. Economics
The process of gathering and analyzing Twitter data to predict stock returns EC115 Economics Purpose Many Americans save for retirement through plans such as 401k s and IRA s and these retirement plans
Analysis of Tweets for Prediction of Indian Stock Markets
Analysis of Tweets for Prediction of Indian Stock Markets Phillip Tichaona Sumbureru Department of Computer Science and Engineering, JNTU College of Engineering Hyderabad, Kukatpally, Hyderabad-500 085,
CSE 598 Project Report: Comparison of Sentiment Aggregation Techniques
CSE 598 Project Report: Comparison of Sentiment Aggregation Techniques Chris MacLellan [email protected] May 3, 2012 Abstract Different methods for aggregating twitter sentiment data are proposed and three
Using Twitter as a source of information for stock market prediction
Using Twitter as a source of information for stock market prediction Ramon Xuriguera ([email protected]) Joint work with Marta Arias and Argimiro Arratia ERCIM 2011, 17-19 Dec. 2011, University of
Sentiment Analysis of Twitter Feeds for the Prediction of Stock Market Movement
Sentiment Analysis of Twitter Feeds for the Prediction of Stock Market Movement Ray Chen, Marius Lazer Abstract In this paper, we investigate the relationship between Twitter feed content and stock market
Big Data and High Quality Sentiment Analysis for Stock Trading and Business Intelligence. Dr. Sulkhan Metreveli Leo Keller
Big Data and High Quality Sentiment Analysis for Stock Trading and Business Intelligence Dr. Sulkhan Metreveli Leo Keller The greed https://www.youtube.com/watch?v=r8y6djaeolo The money https://www.youtube.com/watch?v=x_6oogojnaw
Predicting Stock Market Fluctuations. from Twitter
Predicting Stock Market Fluctuations from Twitter An analysis of the predictive powers of real-time social media Sang Chung & Sandy Liu Stat 157 Professor ALdous Dec 12, 2011 Chung & Liu 2 1. Introduction
A GENERAL TAXONOMY FOR VISUALIZATION OF PREDICTIVE SOCIAL MEDIA ANALYTICS
A GENERAL TAXONOMY FOR VISUALIZATION OF PREDICTIVE SOCIAL MEDIA ANALYTICS Stacey Franklin Jones, D.Sc. ProTech Global Solutions Annapolis, MD Abstract The use of Social Media as a resource to characterize
News Sentiment Analysis Using R to Predict Stock Market Trends
News Sentiment Analysis Using R to Predict Stock Market Trends Anurag Nagar and Michael Hahsler Computer Science Southern Methodist University Dallas, TX Topics Motivation Gathering News Creating News
NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )
Chapter 340 Principal Components Regression Introduction is a technique for analyzing multiple regression data that suffer from multicollinearity. When multicollinearity occurs, least squares estimates
Forecasting stock markets with Twitter
Forecasting stock markets with Twitter Argimiro Arratia [email protected] Joint work with Marta Arias and Ramón Xuriguera To appear in: ACM Transactions on Intelligent Systems and Technology, 2013,
Predicting Stock Market Indicators Through Twitter I hope it is not as bad as I fear
Available online at www.sciencedirect.com Procedia Social and Behavioral Sciences Procedia - Social and Behavioral Sciences 00 (2009) 000 000 www.elsevier.com/locate/procedia COINs2010 Predicting Stock
Industry Environment and Concepts for Forecasting 1
Table of Contents Industry Environment and Concepts for Forecasting 1 Forecasting Methods Overview...2 Multilevel Forecasting...3 Demand Forecasting...4 Integrating Information...5 Simplifying the Forecast...6
Financial Trading System using Combination of Textual and Numerical Data
Financial Trading System using Combination of Textual and Numerical Data Shital N. Dange Computer Science Department, Walchand Institute of Rajesh V. Argiddi Assistant Prof. Computer Science Department,
Can Twitter provide enough information for predicting the stock market?
Can Twitter provide enough information for predicting the stock market? Maria Dolores Priego Porcuna Introduction Nowadays a huge percentage of financial companies are investing a lot of money on Social
INCREASING FORECASTING ACCURACY OF TREND DEMAND BY NON-LINEAR OPTIMIZATION OF THE SMOOTHING CONSTANT
58 INCREASING FORECASTING ACCURACY OF TREND DEMAND BY NON-LINEAR OPTIMIZATION OF THE SMOOTHING CONSTANT Sudipa Sarker 1 * and Mahbub Hossain 2 1 Department of Industrial and Production Engineering Bangladesh
OBJECTIVE ASSESSMENT OF FORECASTING ASSIGNMENTS USING SOME FUNCTION OF PREDICTION ERRORS
OBJECTIVE ASSESSMENT OF FORECASTING ASSIGNMENTS USING SOME FUNCTION OF PREDICTION ERRORS CLARKE, Stephen R. Swinburne University of Technology Australia One way of examining forecasting methods via assignments
STUDYING MOOD VARIATIONS IN LONGITUDINAL TWITTER TIMELINES
STUDYING MOOD VARIATIONS IN LONGITUDINAL TWITTER TIMELINES APPLICATIONS TO THE DETECTION OF PSYCHOLOGICAL TRANSITIONS JOHAN BOLLEN [email protected] INDIANA UNIVERSITY SCHOOL OF INFORMATICS AND COMPUTING
MGT 267 PROJECT. Forecasting the United States Retail Sales of the Pharmacies and Drug Stores. Done by: Shunwei Wang & Mohammad Zainal
MGT 267 PROJECT Forecasting the United States Retail Sales of the Pharmacies and Drug Stores Done by: Shunwei Wang & Mohammad Zainal Dec. 2002 The retail sale (Million) ABSTRACT The present study aims
Is the Forward Exchange Rate a Useful Indicator of the Future Exchange Rate?
Is the Forward Exchange Rate a Useful Indicator of the Future Exchange Rate? Emily Polito, Trinity College In the past two decades, there have been many empirical studies both in support of and opposing
Sensex Realized Volatility Index
Sensex Realized Volatility Index Introduction: Volatility modelling has traditionally relied on complex econometric procedures in order to accommodate the inherent latent character of volatility. Realized
Italian Journal of Accounting and Economia Aziendale. International Area. Year CXIV - 2014 - n. 1, 2 e 3
Italian Journal of Accounting and Economia Aziendale International Area Year CXIV - 2014 - n. 1, 2 e 3 Could we make better prediction of stock market indicators through Twitter sentiment analysis? ALEXANDER
Testing for Granger causality between stock prices and economic growth
MPRA Munich Personal RePEc Archive Testing for Granger causality between stock prices and economic growth Pasquale Foresti 2006 Online at http://mpra.ub.uni-muenchen.de/2962/ MPRA Paper No. 2962, posted
Module 6: Introduction to Time Series Forecasting
Using Statistical Data to Make Decisions Module 6: Introduction to Time Series Forecasting Titus Awokuse and Tom Ilvento, University of Delaware, College of Agriculture and Natural Resources, Food and
Text Opinion Mining to Analyze News for Stock Market Prediction
Int. J. Advance. Soft Comput. Appl., Vol. 6, No. 1, March 2014 ISSN 2074-8523; Copyright SCRG Publication, 2014 Text Opinion Mining to Analyze News for Stock Market Prediction Yoosin Kim 1, Seung Ryul
Simple Linear Regression Inference
Simple Linear Regression Inference 1 Inference requirements The Normality assumption of the stochastic term e is needed for inference even if it is not a OLS requirement. Therefore we have: Interpretation
MARKETS, INFORMATION AND THEIR FRACTAL ANALYSIS. Mária Bohdalová and Michal Greguš Comenius University, Faculty of Management Slovak republic
MARKETS, INFORMATION AND THEIR FRACTAL ANALYSIS Mária Bohdalová and Michal Greguš Comenius University, Faculty of Management Slovak republic Abstract: We will summarize the impact of the conflict between
Introduction to. Hypothesis Testing CHAPTER LEARNING OBJECTIVES. 1 Identify the four steps of hypothesis testing.
Introduction to Hypothesis Testing CHAPTER 8 LEARNING OBJECTIVES After reading this chapter, you should be able to: 1 Identify the four steps of hypothesis testing. 2 Define null hypothesis, alternative
Twitter sentiment vs. Stock price!
Twitter sentiment vs. Stock price! Background! On April 24 th 2013, the Twitter account belonging to Associated Press was hacked. Fake posts about the Whitehouse being bombed and the President being injured
The relation between news events and stock price jump: an analysis based on neural network
20th International Congress on Modelling and Simulation, Adelaide, Australia, 1 6 December 2013 www.mssanz.org.au/modsim2013 The relation between news events and stock price jump: an analysis based on
The Use of Twitter Activity as a Stock Market Predictor
National College of Ireland Higher Diploma in Science in Data Analytics 2013/2014 Robert Coyle X13109278 [email protected] The Use of Twitter Activity as a Stock Market Predictor Table of Contents
Twitter Volume Spikes: Analysis and Application in Stock Trading
Twitter Volume Spikes: Analysis and Application in Stock Trading Yuexin Mao University of Connecticut [email protected] Wei Wei FinStats.com [email protected] Bing Wang University of Connecticut [email protected]
Purchase Conversions and Attribution Modeling in Online Advertising: An Empirical Investigation
Purchase Conversions and Attribution Modeling in Online Advertising: An Empirical Investigation Author: TAHIR NISAR - Email: [email protected] University: SOUTHAMPTON UNIVERSITY BUSINESS SCHOOL Track:
Twitter Stock Bot. John Matthew Fong The University of Texas at Austin [email protected]
Twitter Stock Bot John Matthew Fong The University of Texas at Austin [email protected] Hassaan Markhiani The University of Texas at Austin [email protected] Abstract The stock market is influenced
II. RELATED WORK. Sentiment Mining
Sentiment Mining Using Ensemble Classification Models Matthew Whitehead and Larry Yaeger Indiana University School of Informatics 901 E. 10th St. Bloomington, IN 47408 {mewhiteh, larryy}@indiana.edu Abstract
Stock Price Forecasting Using Information from Yahoo Finance and Google Trend
Stock Price Forecasting Using Information from Yahoo Finance and Google Trend Selene Yue Xu (UC Berkeley) Abstract: Stock price forecasting is a popular and important topic in financial and academic studies.
High Frequency Equity Pairs Trading: Transaction Costs, Speed of Execution and Patterns in Returns
High Frequency Equity Pairs Trading: Transaction Costs, Speed of Execution and Patterns in Returns David Bowen a Centre for Investment Research, UCC Mark C. Hutchinson b Department of Accounting, Finance
Smoothing methods. Marzena Narodzonek-Karpowska. Prof. Dr. W. Toporowski Institut für Marketing & Handel Abteilung Handel
Smoothing methods Marzena Narodzonek-Karpowska Prof. Dr. W. Toporowski Institut für Marketing & Handel Abteilung Handel What Is Forecasting? Process of predicting a future event Underlying basis of all
Prediction of Stock Market Shift using Sentiment Analysis of Twitter Feeds, Clustering and Ranking
382 Prediction of Stock Market Shift using Sentiment Analysis of Twitter Feeds, Clustering and Ranking 1 Tejas Sathe, 2 Siddhartha Gupta, 3 Shreya Nair, 4 Sukhada Bhingarkar 1,2,3,4 Dept. of Computer Engineering
The Hollywood Stock Exchange: Efficiency and The Power of Twitter
The Hollywood Stock Exchange: Efficiency and The Power of Twitter by Nathaniel Harley A special thanks to Professor Richard Walker for advising on this thesis. Also, thanks to Professor Joseph Ferrie,
Nowcasting the Bitcoin Market with Twitter Signals
1 Nowcasting the Bitcoin Market with Twitter Signals JERMAIN KAMINSKI, MIT Media Lab & Witten/Herdecke University 1 PETER A. GLOOR, MIT Center for Collective Intelligence 1. INTRODUCTION Bitcoin is a peer-to-peer
JetBlue Airways Stock Price Analysis and Prediction
JetBlue Airways Stock Price Analysis and Prediction Team Member: Lulu Liu, Jiaojiao Liu DSO530 Final Project JETBLUE AIRWAYS STOCK PRICE ANALYSIS AND PREDICTION 1 Motivation Started in February 2000, JetBlue
Data Mining Yelp Data - Predicting rating stars from review text
Data Mining Yelp Data - Predicting rating stars from review text Rakesh Chada Stony Brook University [email protected] Chetan Naik Stony Brook University [email protected] ABSTRACT The majority
Optimization of technical trading strategies and the profitability in security markets
Economics Letters 59 (1998) 249 254 Optimization of technical trading strategies and the profitability in security markets Ramazan Gençay 1, * University of Windsor, Department of Economics, 401 Sunset,
IT S ALL ABOUT THE CUSTOMER FORECASTING 101
IT S ALL ABOUT THE CUSTOMER FORECASTING 101 Ed White CPIM, CIRM, CSCP, CPF, LSSBB Chief Value Officer Jade Trillium Consulting April 01, 2015 Biography Ed White CPIM CIRM CSCP CPF LSSBB is the founder
Neural Networks for Sentiment Detection in Financial Text
Neural Networks for Sentiment Detection in Financial Text Caslav Bozic* and Detlef Seese* With a rise of algorithmic trading volume in recent years, the need for automatic analysis of financial news emerged.
On the Predictability of Stock Market Behavior using StockTwits Sentiment and Posting Volume
On the Predictability of Stock Market Behavior using StockTwits Sentiment and Posting Volume Abstract. In this study, we explored data from StockTwits, a microblogging platform exclusively dedicated to
SSgA CAPITAL INSIGHTS
SSgA CAPITAL INSIGHTS viewpoints Part of State Street s Vision thought leadership series A Stratified Sampling Approach to Generating Fixed Income Beta PHOTO by Mathias Marta Senior Investment Manager,
A Regime-Switching Model for Electricity Spot Prices. Gero Schindlmayr EnBW Trading GmbH [email protected]
A Regime-Switching Model for Electricity Spot Prices Gero Schindlmayr EnBW Trading GmbH [email protected] May 31, 25 A Regime-Switching Model for Electricity Spot Prices Abstract Electricity markets
FORECASTING. Operations Management
2013 FORECASTING Brad Fink CIT 492 Operations Management Executive Summary Woodlawn hospital needs to forecast type A blood so there is no shortage for the week of 12 October, to correctly forecast, a
Using Data Mining for Mobile Communication Clustering and Characterization
Using Data Mining for Mobile Communication Clustering and Characterization A. Bascacov *, C. Cernazanu ** and M. Marcu ** * Lasting Software, Timisoara, Romania ** Politehnica University of Timisoara/Computer
Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression
Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression Objectives: To perform a hypothesis test concerning the slope of a least squares line To recognize that testing for a
Ch.3 Demand Forecasting.
Part 3 : Acquisition & Production Support. Ch.3 Demand Forecasting. Edited by Dr. Seung Hyun Lee (Ph.D., CPL) IEMS Research Center, E-mail : [email protected] Demand Forecasting. Definition. An estimate
Time series Forecasting using Holt-Winters Exponential Smoothing
Time series Forecasting using Holt-Winters Exponential Smoothing Prajakta S. Kalekar(04329008) Kanwal Rekhi School of Information Technology Under the guidance of Prof. Bernard December 6, 2004 Abstract
Big Data, Socio- Psychological Theory, Algorithmic Text Analysis, and Predicting the Michigan Consumer Sentiment Index
Big Data, Socio- Psychological Theory, Algorithmic Text Analysis, and Predicting the Michigan Consumer Sentiment Index Rickard Nyman *, Paul Ormerod Centre for the Study of Decision Making Under Uncertainty,
COMPARISONS OF CUSTOMER LOYALTY: PUBLIC & PRIVATE INSURANCE COMPANIES.
277 CHAPTER VI COMPARISONS OF CUSTOMER LOYALTY: PUBLIC & PRIVATE INSURANCE COMPANIES. This chapter contains a full discussion of customer loyalty comparisons between private and public insurance companies
Analysis of Load Frequency Control Performance Assessment Criteria
520 IEEE TRANSACTIONS ON POWER SYSTEMS, VOL. 16, NO. 3, AUGUST 2001 Analysis of Load Frequency Control Performance Assessment Criteria George Gross, Fellow, IEEE and Jeong Woo Lee Abstract This paper presents
Sentiment Analysis and Time Series with Twitter Introduction
Sentiment Analysis and Time Series with Twitter Mike Thelwall, School of Technology, University of Wolverhampton, Wulfruna Street, Wolverhampton WV1 1LY, UK. E-mail: [email protected]. Tel: +44 1902
17. SIMPLE LINEAR REGRESSION II
17. SIMPLE LINEAR REGRESSION II The Model In linear regression analysis, we assume that the relationship between X and Y is linear. This does not mean, however, that Y can be perfectly predicted from X.
MULTIPLE REGRESSIONS ON SOME SELECTED MACROECONOMIC VARIABLES ON STOCK MARKET RETURNS FROM 1986-2010
Advances in Economics and International Finance AEIF Vol. 1(1), pp. 1-11, December 2014 Available online at http://www.academiaresearch.org Copyright 2014 Academia Research Full Length Research Paper MULTIPLE
International Statistical Institute, 56th Session, 2007: Phil Everson
Teaching Regression using American Football Scores Everson, Phil Swarthmore College Department of Mathematics and Statistics 5 College Avenue Swarthmore, PA198, USA E-mail: [email protected] 1. Introduction
Financial Market Efficiency and Its Implications
Financial Market Efficiency: The Efficient Market Hypothesis (EMH) Financial Market Efficiency and Its Implications Financial markets are efficient if current asset prices fully reflect all currently available
The Combination Forecasting Model of Auto Sales Based on Seasonal Index and RBF Neural Network
, pp.67-76 http://dx.doi.org/10.14257/ijdta.2016.9.1.06 The Combination Forecasting Model of Auto Sales Based on Seasonal Index and RBF Neural Network Lihua Yang and Baolin Li* School of Economics and
Using News Articles to Predict Stock Price Movements
Using News Articles to Predict Stock Price Movements Győző Gidófalvi Department of Computer Science and Engineering University of California, San Diego La Jolla, CA 9237 [email protected] 21, June 15,
HIGH DIVIDEND STOCKS IN RISING INTEREST RATE ENVIRONMENTS. September 2015
HIGH DIVIDEND STOCKS IN RISING INTEREST RATE ENVIRONMENTS September 2015 Disclosure: This research is provided for educational purposes only and is not intended to provide investment or tax advice. All
Forecasting in supply chains
1 Forecasting in supply chains Role of demand forecasting Effective transportation system or supply chain design is predicated on the availability of accurate inputs to the modeling process. One of the
Executive Summary. Abstract. Heitman Analytics Conclusions:
Prepared By: Adam Petranovich, Economic Analyst [email protected] 541 868 2788 Executive Summary Abstract The purpose of this study is to provide the most accurate estimate of historical
Organizing Your Approach to a Data Analysis
Biost/Stat 578 B: Data Analysis Emerson, September 29, 2003 Handout #1 Organizing Your Approach to a Data Analysis The general theme should be to maximize thinking about the data analysis and to minimize
Demand forecasting & Aggregate planning in a Supply chain. Session Speaker Prof.P.S.Satish
Demand forecasting & Aggregate planning in a Supply chain Session Speaker Prof.P.S.Satish 1 Introduction PEMP-EMM2506 Forecasting provides an estimate of future demand Factors that influence demand and
Indian School of Business Forecasting Sales for Dairy Products
Indian School of Business Forecasting Sales for Dairy Products Contents EXECUTIVE SUMMARY... 3 Data Analysis... 3 Forecast Horizon:... 4 Forecasting Models:... 4 Fresh milk - AmulTaaza (500 ml)... 4 Dahi/
How To Forecast Solar Power
Forecasting Solar Power with Adaptive Models A Pilot Study Dr. James W. Hall 1. Introduction Expanding the use of renewable energy sources, primarily wind and solar, has become a US national priority.
