Trading Strategies To Exploit Blog and News Sentiment

Size: px
Start display at page:

Download "Trading Strategies To Exploit Blog and News Sentiment"

Transcription

1 Trading Strategies To Exploit Blog and News Sentiment Wenbin Zhang and Steven Skiena and Department of Computer Science, Stony Brook University Stony Brook, NY USA Abstract We use quantitative media (blogs, and news as a comparison) data generated by a large-scale natural language processing (NLP) text analysis system to perform a comprehensive and comparative study on how a company s reported media frequency, sentiment polarity and subjectivity anticipates or reflects its stock trading volumes and financial returns. Our analysis provides concrete evidence that media data is highly informative, as previously suggested in the literature but never studied on our scale of several large collections of blogs and news for over five years. Building on our findings, we give a sentiment-based market-neutral trading strategy which gives consistently favorable returns with low volatility over a five year period ( ). Our results are significant in confirming the performance of general blog and news sentiment analysis methods over broad domains and sources. Moreover, several remarkable differences between news and blogs are also identified in this paper. Introduction The efficient market hypothesis asserts that financial markets are informationally efficient, which means current stock prices already reflect all known information and all occurred facts. Moreover, prices in finance markets are unbiased and contain all the wisdom or future forecasts from investors. Therefore, investors cannot make excess profits from the market if their trading strategies are based on known information, because market prices are efficiently collecting and aggregating various information and keep changing without delay. However, a large and growing literature documents that movements of financial indicators are not always consistent with the quantitative measures of firms fundamentals (e.g. (Cutler, Poterba, and Summers 1989; Roll 1988; Tetlock, Saar-Tsechansky, and Macskassy 2007)). This mandates a rethinking of the fluctuation of stock prices to seek other evidence to explain it. Some encouraging results prove the conditional usage of the efficient market hypothesis. Particularly, (Chan 2003) shows that stock prices appear to drift after important corporate events for up to several months. This suggests that some of the drift is caused by the price s under-reaction to information. News data thus Copyright c 2010, Association for the Advancement of Artificial Intelligence ( All rights reserved. could provide a feasible and useful way to analyze financial markets. Our primary goal is to study the relationship between stock market data and linguistic media data, both blogs and news, and to illustrate the extent to which they can contribute to the design of investment strategies. Our main contributions in this paper are: Comparative Study of Blogs and News We conduct a thoughtful comparative study of four different linguistic sources, i.e., Twitter, Spinn3r RSS blogs, LiveJournal blogs, and Dailies news as a comparison. We compare their sentiments with corresponding stocks and evaluate the equity trading performance with using the four sources respectively. Our analysis also discovers many distinct properties between blogs and news. For example, news information could be incorporated into stock prices instantly (almost within 1 day) after release, while blog information like Twitter will be absorbed by stock market with a longer time period (around 2 to 3 days). Large-Scale Analysis We give comprehensive results of analyzing stock market using roughly one terabyte of blog and news data and thousands of different companies. This scale of analysis has never been previously attempted in the literature, and enables us to identify short-term but statistically significant correlations between media volume / sentiment and financial returns / trading volumes. Corpus Size Matters Previous work on sentiment-based financial analysis (e.g. (Tetlock, Saar-Tsechansky, and Macskassy 2007)) focus explicitly on national financial newspapers, namely the Dow Jones News Service and the Wall Street Journal. However, we demonstrate that a more significant, reliable sentiment signal comes from analyzing a full corpus of blogs and news than just reading the DJNS or WSJ. Sentiment-oriented Equity Trading We propose a market-neutral stock trading strategy, based completely on sentiment data drawn from published blog or news sources. Through careful experimentation over five full years of news/price ( ) data, we demonstrate that our strategy provides intriguing returns with low variance (ignoring both transaction costs and the timing resolution discussed in Section Media Timing Issues).

2 Validation of Sentiment Analysis Methods Perhaps another important contribution of our paper is the strongest validation to date of the accuracy of our media sentiment analysis methodology of Lydia. Proper validation is impossible in the absence of any agreed upon gold standard for entity-level sentiment analysis (Pang and Lee 2008). But our ability to extract a sufficiently reliable sentiment signal for successfully trading upon (regardless of timing resolution) provides rigorous evidence that our sentiment methods accurately reflect real changes in response to linguistic information. This paper is organized as follows. First we review related work. We then describe the origin and characteristics of the media and financial data we work with. After that, we give a complete analysis of the correlation between major stock market variables and major media variables, which is the most important part of this paper. Finally, we propose and evaluate a market-neutral trading strategy based on media data. We conclude that financial prices are significantly correlated with quantitative media data and can be used to formulate interesting trading strategies. Related Work Previous work is divided between the finance and computer science academic communities. We first survey research from the financial realm. Tetlock (Tetlock, Saar-Tsechansky, and Macskassy 2007) investigates whether the occurrence of negative words in firm-specific news articles can help us predict firms cash flows and whether firms stock market prices incorporate linguistic information efficiently. They claim that firms stock prices under-react to the underlying negative information of news articles. More specifically, negative information in news articles are reflected in stock market prices with roughly one-day delay. Chan (Chan 2003) examines monthly returns to a subset of stocks after public news about them is released and finds that investors react slowly to information, especially after bad news. Another important finding is that stocks tend to reverse in the subsequent month after extreme price movements unaccompanied by public news. In addition, these patterns are statistically significant. One limitation of this study uses coarse, monthly granularity. In our paper, we provide analysis of daily news and price movements. Antweiler and Frank (Antweiler and Frank 2004) study more than 1.5 million messages from Yahoo! Finance and Raging Bull, which are the two most popular Internet Stock Message Boards. They employed Naive Bayes and Support Vector Machine classifiers to assess bullishness content of these stock messages. They show these message boards are quite informative, and further that bullishness is positively and significantly associated with returns. In terms of trading volume, the paper shows controversial opinions are associated with more trades. From the Computer Science side, intense researches are delivered by text mining or machine learning communities. Their basic idea is to quantify linguistic information with text mining techniques, get the predefined set of features of the training data, and then build various models with classical statistical approaches or statistical learning algorithms. A detailed survey of the text mining for market response to news can be found in (Mittermayer and Knolmayer 2006a). In particular, the 3-category model is widely used to label documents or words. The first category (positive sentiment) consists of news articles or words that make the associated financial variables increase to a certain degree in a certain time period, for example, a news event makes the price of the single stock IBM increase 0.5% in the following day. Similarly, the second category (negative sentiment) is defined accordingly. The third category consists of neutral news articles or words. Research which can characterized under this model includes (Fung, Yu, and Lam 2002; Mittermayer and Knolmayer 2006b; Thomas 2003; Wuthrich, Cho, and etc. 1998). There has also been substantial interest in the opinion mining and NLP community on using financial text streams as a domain to test sentiment analysis methods, including (Chaovalit and Zhou 2005; Pang and Lee 2002). Broadly speaking, they apply information retrieval or machine learning techniques to classify text streams into some categories and hope to produce better classification accuracy than human being, and thus the underlying opinion could be discovered. Pang and Lee (Pang and Lee 2008) gave a detail review in this domain. Stock and Media Data Here we describe the stock and media data sources which is the basis for our analysis in this paper. Stock Data Our stock price and volume data is obtained from Thomson Datastream Services (Datastream ), a comprehensive database with time series on more than two million instruments. Here we only consider the stocks listed in the New York Stock Exchange because those stocks have more intensive blogging coverage than stocks in other markets. We downloaded the data of all 3238 stocks within the period from 2005 to 2009, for their daily open, close, high, low prices, turnover volumes, and monthly market capitalizations. Media Data Company-related blog and news data was generated using the Lydia ((Lloyd, Kechagias, and Skiena 2005), a high-speed text processing system, which reduces large text streams to time series data on the frequency of sentiment of underlying media entities. In this paper, we compare four different collections of blog/news sources as follows. 1. Dailies, which includes the coverage of over 500 nationwide and local newspapers. 2. Twitter, which is a free social networking and microblogging service that enables its users to send and read messages known as tweets.

3 Depositories Dailies Twitter Spinn3r LiveJournal Table 1: Time-range coverage of different depositories (sources) in our media database. 3. Spinn3r RSS Feeds, which is a collection of blogs worldwide. 4. LiveJournal, which includes all the blogs provided by LiveJournal. Table 1 shows the corresponding time range for all the four media sources. Because Dailies depository has the biggest data volume among all above four sources, analysis based on Dailies should be more credible and we will regard it as the benchmark for blog analysis. Media Timing Issues Proper interpretation of our results requires careful attention to the timing of our news spidering (text retrieval) agents. For the Dailies news corpus we employ, the spidering program begins to download news at 11pm EST every day, a process which can take up to 12 hours. All of these articles are credited to the day when the spider program began running. Thus while the bulk of our news was certainly retrieved before the 9:30AM opening of the NYSE each day, we cannot guarantee that it is unpolluted by news reporting events after the market opening. The three blog-type medias (Twitter, Spinn3r, and Live- Journal) are different, because accurate time stamp will be provided while blogs are published. The general consistency of our results across all four corpores (under different timing models) lends support to our conclusions, but the degree to which we follow rather than anticipate price movements cannot be conclusively answered by this study. Lydia Sentiment Analysis The Lydia sentiment data consists of time series of favorable (positive) and unfavorable (negative) words co-referenced with occurrences of each named entity (here denoting companies). With using a graphtheoretic approach described in (Godbole, Srinivasaiah, and Skiena 2007), the lexicon of almost 5,000 sentiment-laden words was constructed by expanding synonyms/antonyms from small sets of seed words associated with Business, Crime, Health, Politics, Sports, and Media domains. A General sentiment index aggregates the lexicons from all of these domains. Further details of the Lydia sentiment analysis methods and their validation are reported in (Bautin, Vijayarenu, and Skiena 2008; Godbole, Srinivasaiah, and Skiena 2007). For financial market analysis, we were most interested in General, Business, and Media categories. After initial correlation analysis, we identified the General category is the most relevant one, and thus we only use the General sentiment in the subsequent analysis. Let p and n denote the number of raw positive and negative references to a given entity, which occurs a total of N times in the corpus (including neutral references). Then we derive the following natural sentiment/subjectivity measures from these raw counts: polarity = (p n)/(p + n) subjectivity = (n + p)/n pos refs per ref = p/n neg refs per ref = n/n senti diffs per ref = (p n)/n These derived measures are not highly correlated with raw sentiment counts and they can provide additional information that raw data cannot. Therefore, with them we will be able to avoid multicollinearity during linear analysis. Matching Stock / Media Entities An important technical problem concerns matching the stock and news entity names. For example, the NYSE-listed First Commonwealth Financial is associated with three entities in our database: First Commonwealth Financial, First Commonwealth Financial Corporation, and First Commonwealth Financial Corp. We aggregate the time series of all the three media entities to define the media time series for this specific company. Our matching algorithm yields around 700 to 1300 matched stock/entity pairs for the four media sources we examined. Intercorrelationship Among Media Sources Here we investigate the relationship between any two of the four media corpuses we study. Our analysis shows data of one source is significantly correlated with data of any other sources. For example, some major observations between Spinn3r and Dailies are: Reference Frequency The correlation between the monthly (yearly) normalized article counts of Spinn3r vs. Dailies is (0.6257). The corpuses share a higher correlation with frequency than with sentiment variables, reflecting both the greater variance in editorial outlook and the difficulty of detecting sentiment precisely using algorithmic methods. Sentiment Polarity The monthly sentiment polarity correlation across the two corpuses is , and is statistically significant. Our experimental results show sentiment differences per reference is a better measure than raw polarity because it eliminates some extreme values (+1, 0, or -1) of raw polarity, and thus we will use it to measure general polarity in the following sections. Subjectivity The correlation coefficient of subjectivity between the two corpuses is We can get consistent intercorrelation result for other source pairs. These substantial correlations indicates blogs and news share similar opinions to some extent over the same entities, and explains why we obtain qualitatively similar results for all corpuses.

4 Figure 1: Logged Monthly Normalized Article Counts vs. Logged Stock Trading Volume for Dailies news, broken down by market capitalization. Correlation of Media / Stock Data In this section, we analyze the correlations between media and stock variables on a large scale. Here large scale means analysis for all media vs. NYSE matched name pairs. Media Frequency vs. Trading Volume The first problem we study is the relationship between media references and stock trading volume. Intuitively, more media coverage should lead to more trades. To compensate for technical variations in spidering efficacy, we use normalized article counts instead of raw article counts to correct for fluctuations in the total volume of news spidered each day. In particular, we will always use logged normalized article counts as our standard measure of entity news frequency, which follows a Gaussian distribution. Some significant observations are: Strength of correlation For all four depositories, the correlation coefficient between logged normalized article counts and logged stock trading volume are more than 0.4. Daily, monthly, and yearly analysis For Dailies, the correlation coefficient of logged normalized article counts versus logged normalized stock trading volume for daily, monthly, and yearly analysis are , , and respectively. Therefore, monthly analysis is a proper time scale for analysis. Persistence over time How well does today s article counts correlate with yesterday s or tomorrow s trading volumes? For Dailies, a one day lag generates the highest correlation (0.74), but the correlation coefficients persist (between 0.64 and 0.68) for periods up to ten days in the future. This is due to the high auto-correlation for both article counts and stock trading volumes. Influence of Market Sectors The correlation analysis could be further broken down by market sectors. We find that for sectors Pharmaceuticals & Biotechnolog and Aerospace & Defense, intensive news references are more likely to cause more trades (corr > 0.7). By contrast, the trading volumes for sectors Electronic & Electrical Equipments and Software & Computer Services are less sensitive to media exposure (corr < 0.2). Breakdown by Market Capitalization Figure 1 shows the breaking down analysis for different market capitalizations. This figure indicates that the correlation coefficients between article counts and stock trading volume become stronger and stronger with the increasing of market capitalization. For large enough companies, their news coverage reflects relative importance more than distinctive newsworthiness. Although most of above provided data focus on Dailies news, the corresponding results are suitable for other three blog sources as well because of the significant intercorrelationship among difference sources. Frequency vs. Capitalization The second problem we studied is the relationship between firms media references and their corresponding market capitalizations. Usually bigger firms receive more media coverage. Indeed, in our media database, the logged monthly normalized article count is also positively correlated with the logged market capitalizations with a correlation coefficient of 0.42, and it is statistically significant. Media Polarity vs. Stock Returns A more interesting question is the return of stocks. We believe the return of stocks are relevant to the public opinion of corresponding firms, say, how good or how bad people think about these firms. If people think a firm is good, more likely its stock price will raise, and vice versa. From the previous section, we know that polarity is a quantitative term to describe how good a firm is. Variable Selections First we will figure out the right variables to study this problem. We consider three different performance measures for a given stock s: change of stock prices, stock returns (R(s)), and abnormal return. The abnormal return R (s) is calculated by R (s) = R(s) R(NY SE) In our correlation analysis we correlate each news variable from [polarity, change of polarity, percentage change of polarity], to each stock variable from [change of stock prices, stock return, stock abnormal return]. This gives six combination pairs for testing. Our experiments show (polarity, stock return) pair has the most significant correlations among all the combinations, so we only give the analysis results for polarity versus stock returns in the following parts. Actually, the (polarity, stock abnormal return) pair achieves very similar performance with (polarity, stock return). Correlation Analysis with Shifting of Time Figure 2 examines how much today s polarity is correlated with stock returns on proximate days. For Dailies, we see that (1) the correlation coefficient of today s polarity versus previous return decrease gradually, and (2) for days 1 and later, all the correlation coefficients are almost zero, and all those correlations are not statistically significant. This proves that today s news almost have no predictive power for the return of tomorrow or later days. We also notice that the return of day 0 has the best correlation with

5 Figure 2: Correlation analysis of Polarity vs. Daily Return for Dailies, Twitter, and Spinn3r respectively. The correlation coefficients are calculated with time lags from -5 to 5 days. Please note: we do not show LiveJournal result here because its data volume is relative small and thus the correlation between polarity and stock return is not significant. polarity. In fact, most of our daily news are published in the early morning each day, and thus it is reasonable to infer they have some predictive power for current day s return. In the other word, today s news has significant relationship with the current day s return, has some relationship with yesterday s return, but almost has no relationship with tomorrow s return. The efficient market hypothesis states that the market reflects public information in the stock price within a very short time. Therefore, Dailies polarity shown in Figure 2 illustrated this theory perfectly, i.e., the correlation between news polarity and stock returns disappear after 1 day. Blogs vs. News Figure 2 also shows Spinn3r data are very similar to Dailies data. However, Twitter is somewhat different in that its polarity also has some corrrelaionship with tomorrow s or the day after tomorrow s return. That is, stock market incorporates Twitter sentiment slower than news, may need 2 or 3 days. In our opinion, there are two possible reasons (1) Our Twitter database only contains data for 0.5 year, and datasize is small and thus the result is less accurate, and (2) Twitter data has autocorrelationship between neighbor days and the sentiment of Twitter is more persistent. To dig this problem further, we show polarity s Lag-k-Day autocorrelation for the four media sources in Figure 3. This figure indicates Dailies and Spinn3r have the similar autocorrelation levels of polarity, while Twitter and LiveJournal have the lower levels in terms of the strength of autocorrelations. One surprising fact is that all those three blog-type sources have more moderate slopes than Dailies, especially the curves of Twitter and LiveJournal are near flat. This is a very important difference between news and blogs, i.e., the sentiment conveyed by blog can last longer than news. This phenomenon is understandable because news has more significant recency effect. Strengthening the Correlation The sentiment-return correlation can be improved by removing companies with the weakest detected polarity to focus on those showing significant sentiment signal. For example, we can try to remove α% neutral data of polarity. Actualy, once 80% of the neutral sentiment are removed, the correlation coefficients will become as strong as We can also break down the correlation analysis by market sector. Particularly in our Dailies source, the Household Goods & Home Constructions, Life Insurance, and Financial Services sectors are most strongly affected by news sentiment (corr > 0.17); all of which are strongly associated with the subprime mortgage crisis. By contrast, returns from the staid Fixed Line Telecommunications, Industrial Transportation, and Beverages sectors have near zero correlation with news sentiment. Figure 4: Polarity vs. Monthly Stock Close Return for Dailies. The analysis are broken down by the scale of market capitalizations. Figure 3: Polarity s Lag-k-Day autocorrelation for Dailies, Twitter, Spinn3r, and LiveJournal respectively. From Figure 4, we can see that correlation coefficient for bigger firms is much stronger than those for smaller firms, especially, for firms who have more than 50-billion market capitalization. The result makes sense, because large firms

6 generate more intensive media coverage and thus the collective information can better indicate the firms situation. Subjectivity vs. Trading Volume Now we consider the relationship between blog/news subjectivity and stock trading volume. Subjectivity means the amount of sentiment references among total references. Within the sentiment analysis community, subjectivity is considered a more robust measure than polarity (Pang and Lee 2008). In all blog and news sources, subjectivity is positively and significantly correlated with stock trading volume. This conclusion coincides with the result from Antweiler and Frank (Antweiler and Frank 2004) that controversial opinions are associated with more trades. A Sentiment-Based Trading Agent We have demonstrated significant correlations between media data and financial market indicators. In this section, we design a market-neutral trading agent to demonstrate the predictive power of news data. A market-neutral strategy seeks to profit from both increasing and decreasing prices in a single or numerous markets by taking matching long and short positions in different stocks. We propose our market-neutral algorithm and backtest it using real market data from 2004 to The results suggest that blog/news sentiment analysis should be employed as an informative component of trading agents. The Market-neutral Strategy Our market-neutral strategy first ranks companies by their reported sentiment each day, then goes long (short) on equal amounts of positive (negative) sentiment stocks. The yearly or monthly returns generated by such a trading agent will be used for performance evaluation. Our initial investment is M and backtesting period is from start date D s to end date D e. We identify four key tunable parameters in this strategy: n: The number of stocks selected from the top and bottom of the firm list (sorted by sentiment). s: The number of historical days used for sentiment calculation. If s = 1, only the current day s sentiment is considered. h: Holding days, which means how many days we will hold for the current portfolio. C l and C u : The lower bound and upper bound of firms market capitalization. We only consider the stocks whose market capitalizations are in range [C l, C u ]. The four parameters impact our stock returns substantially, and the details will be given in the following sections. The specific algorithm is described in Algorithm 1. Because we always long stocks with the best sentiment, and short stocks with the worst sentiment, this algorithm is also called the best-sentiment strategy. Algorithm 1 A sentiment-based market-neutral strategy Require: n, s, h, [C l, C u], M, [D s, D e]. 1: Get a list of matched pairs of NYSE stocks and firm entities in blogs/news. 2: For each matched pairs, get the stock price and firm polarity time series. 3: for each day D i from D s to D e do 4: if D i is one of the first h trading days in [D s, D e] then 5: Sort all stocks based on their polarity of day D i, with filtering out stocks if their corresponding market capitalization is not in [C l, C u]. 6: For each stock in top/bottom list, invest M/(2nh) with open prices. 7: else if D i is one of the final h trading days in [D s, D e] then 8: Sell stocks bought at trading day (D i h) with open prices. 9: else 10: Sort all stocks based on their polarity of day D i, with filtering out stocks if their corresponding market capitalization is not in [C l, C u]. 11: Sell stocks bought at trading day (D i h) with open prices, get bank roll M Di. 12: For each stock in top/bottom list, invest M Di /(2n) with open prices. 13: end if 14: end for 15: return Final bank roll M, and yearly/monthly return R. Figure 5: Yearly return vs. number of selected top and bottom stocks for Dailies depository. We tune n from 1 to 20, and fix parameters: s = 1, h = 2, C l = 10 billions, C u = 600 billions. Performance Evaluation In this section, we backtest our market-neutral strategy with real blog/news and stock data over a period from 2005 to There are four key parameters (n, s, h, C l and C u ) that contribute to the final returns. We performed experiments to isolate one parameter while fixing the other three: Diversification Figure 5 shows the impact of the number of selected stocks for Dailies. As we can see, with the increasing of the number of selected stocks, the yearly return decreases. The reason is straightforward the stocks with the highest (lowest) sentiment should have the biggest price movement. If we select fewer stocks, the average sentiment of selected stocks will be higher, and the expected return will be higher as well. Blogs vs. News Comparing with news result in Figure 5, Figure 6 shows the performance of blog-type sources, running the same experiments. A big different between news and blogs is that simulation with news uses current day s sentiment, while simulation with blogs uses yesterday s sentiment because current day s blogs are probably published after the open of stock market. Spinn3r has similar performance with Dailies, while Twitter and Live-

7 Figure 6: Yearly return vs. number of selected top and bottom stocks for blog-type depositories. Figure 8: Portfolio Landscape of Monthly Return vs. Volatility analysis for experiments shown in Figure 5. greater volume of media coverage. Figure 7: Yearly return vs. stock holding days. In this Figure, Dailies uses data of Journal have much lower performance, basically because Twitter and LiveJournal do not have plenty of data volume. However, all the blog-type sources also show the same performance trend with that of news. Sentiment Analysis Period For four of the five years studied (except 2008), yearly returns decrease with the increasing of sentiment analysis period s. This is consistent with the efficient market hypothesis, since longer periods dilute the freshness of the news. Holding Period Another tunable parameter is length of time we hold the stock. For all the five years, longer holding time leads to lower returns. Again, the market will quickly reflect all the news information, and thus we will not benefit from extra holding days. Moreover, quickly redeeming the investment frees up capital to invest in more recently reported-on stocks. Blogs vs. News Figure 7 give the comparison of Dailies news and blogs in terms of the influence of stock holding days. We notice that with news data, the performance monotonously and gradually decrease with the increase of holding days, but with blogs data, there are much more fluctuations in the performance curves. Market Capitalization Our experiments showed an interesting influence of market capitalizations. Both large and small firms showed greater returns than medium-size firms. The return for small firms is enhanced because their price is more affected by news events/sentiment. For large firms, we more accurately measure sentiment due to the From these experiments, we conclude that our agent should hold small numbers of selected stocks, use short sentiment-calculation and stock holding periods, and avoid holding medium-sized firms. To further validate the correctness of our blog/news sentiment analysis, we design other two strategies - Worstsentiment Strategy and Random-selection Strategy, the former does the opposite of Best-sentiment Strategy, i.e., long bottom stocks and short top stocks, while the latter just randomly picks stocks to be long and short. Our simulation results show that best-sentimen strategy always returns positive gains, worst-sentiment strategy always produces negative returns, while random-selection strategy oscillates about zero return. Returns and Volatility Returns only capture part of investment performance. The degree of risk (volatility) taken on to achieve these returns determines to amount of leverage which can safely be employed to exploit the agent, and the overall desirability of a given portfolio in the risk-return horizon. Here we analyze the monthly returns and volatility. The monthly return is taken to be the mean value of returns for all 60 months; the volatility is the standard deviation of these monthly returns. Figure 8 demonstrates the tradeoff between risk and return, with a scatter plot of performance vs. volatility for strategies differing only in the number of stocks held in each period. Increased diversification reduces risk. The result is consistent with modern portfolio theory regarding risk and return. If we assume monthly return follows standard Gaussian distribution, the 95% confidence interval of monthly return could be constructed by two standard deviations from the mean, and according to this, investors could choose a diversification level to balance return and the risk he can afford. An interesting observation is that selecting only one stock does not always yield the highest returns, just as the result shown in Figures 5 and 6. Actually n = 1 is very risky, and usually n = 2 or 3 are much better. It n becomes larger and larger, it will make the investment less risky, but simultaneously it will reduce return.

8 Figure 9: Returns for long vs. short in the experiments of Figure 5. Long vs. Short An interesting question in any market-neutral strategy is the relative contribution of returns going long vs. going short. To answer this, we partition the experiments in Figure 5 into long and short components in Figure 9. Figure 9(a) shows long returns positive in 2005 to 2007, turning negative in 2008, but turning positive again in By contrast, Figure 9(b) shows near zero short returns in 2005 to 2007, but very high positive short returns in 2008, and very high negative short return in That is, most profits come from shorting stocks in 2008 and from longing stocks in 2009, which due to the collapse of the broad market in 2008 and recovery in 2009, and thus this result perfectly validates the market-neutrality of our strategy. Conclusions We have shown that raw or derived blog/news variables are significantly correlated with some indicators in stock markets, e.g., media references versus stock trading volume, media references versus market capitalization, media polarity versus stock returns, media subjectivity versus stock trading volume, and the opinions from one media source can reflect those from another media source. Based on blog/news sentiment data, we design a market-neural strategy, which is able to generate consistent returns for investors. There are several tunable parameters in this strategy, and thus investors need to carefully tune them to balance risk and return. Our analysis also reveals many similar and distinct properties between news and blog sources. For example, both blog and news show similar sentiment vs. stock return correlationship, and we can get similar performance trends if we design trading strategies with these sources. However, opinions in blogs are more persistent and decay more gradually over time than news. In addition, comparing with blogs, news information takes a much shorter time period (within 1 day) to be incorporated into stock market completely. References Antweiler, W., and Frank, M. Z Is all that talk just noise? the information content of internet stock message boards. Journal of Finance 3: Bautin, M.; Vijayarenu, L.; and Skiena, S International sentiment analysis for news and blogs. In Second Int. Conf. on Weblogs and Social Media (ICWSM 2008). Chan, W. S Stock price reaction to news and nonews: Drift and reversal after headlines. Journal of Financial Economics 70: Chaovalit, P., and Zhou, L Movie review mining: a comparison between supervised and unsupervised classification approaches. In Proceedings of the Hawaii International Conference on System Sciences (HICSS). Cutler, D. M.; Poterba, J. M.; and Summers, L. H What moves stock prices? Journal of Portfolio Management 15:4 12. Datastream, T. Fung, G.; Yu, J.; and Lam, W News sensitive stock trend prediction. Proceedings 6th Pacific-Asia Conference on Knowledge Discovery and Data Mining Godbole, N.; Srinivasaiah, M.; and Skiena, S Largescale sentiment analysis for news and blogs. In Proceedings of the First International Conference on Weblogs and Social Media, Lloyd, L.; Kechagias, D.; and Skiena, S Lydia: A system for large-scale news analysis. In Proc. 12th String Processing and Information Retrieval (SPIRE 2005), volume LNCS 3772, Mittermayer, M., and Knolmayer, G. F. 2006a. Text mining system for market response to news: A survey. Working Paper No 184. Mittermayer, M.-A., and Knolmayer, G. 2006b. Newscats: A news categorization and trading system. In Proceedings of the International Conference in Data Mining (ICDM06). Pang, B., and Lee, L Thumbs up? sentiment classification using machine learning techniques. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), Pang, B., and Lee, L Opinion mining and sentiment analysis. Foundations and Trends in Information Retrieval Vol. 2, No 1-2: Roll, R. W R-squared. Journal of Finance Tetlock, P. C.; Saar-Tsechansky, M.; and Macskassy, S More than words: Quantifying language to measure firms fundamentals. In Proceedings of 9th Annual Texas Finance Festival. Thomas, J News and trading rules. Dissertation of Carnegie Mellon University, Pittsburgh. Wuthrich, B.; Cho, V.; and etc Daily prediction of major stock indices from textual www data. In Proceedings of 4th ACM SIGKDD Int. Conference on Knowledge Discovery and Data Mining,

Using Text and Data Mining Techniques to extract Stock Market Sentiment from Live News Streams

Using Text and Data Mining Techniques to extract Stock Market Sentiment from Live News Streams 2012 International Conference on Computer Technology and Science (ICCTS 2012) IPCSIT vol. XX (2012) (2012) IACSIT Press, Singapore Using Text and Data Mining Techniques to extract Stock Market Sentiment

More information

Twitter sentiment vs. Stock price!

Twitter sentiment vs. Stock price! Twitter sentiment vs. Stock price! Background! On April 24 th 2013, the Twitter account belonging to Associated Press was hacked. Fake posts about the Whitehouse being bombed and the President being injured

More information

Neural Networks for Sentiment Detection in Financial Text

Neural Networks for Sentiment Detection in Financial Text Neural Networks for Sentiment Detection in Financial Text Caslav Bozic* and Detlef Seese* With a rise of algorithmic trading volume in recent years, the need for automatic analysis of financial news emerged.

More information

News Sentiment Analysis Using R to Predict Stock Market Trends

News Sentiment Analysis Using R to Predict Stock Market Trends News Sentiment Analysis Using R to Predict Stock Market Trends Anurag Nagar and Michael Hahsler Computer Science Southern Methodist University Dallas, TX Topics Motivation Gathering News Creating News

More information

Predicting the Stock Market with News Articles

Predicting the Stock Market with News Articles Predicting the Stock Market with News Articles Kari Lee and Ryan Timmons CS224N Final Project Introduction Stock market prediction is an area of extreme importance to an entire industry. Stock price is

More information

ARTIFICIAL INTELLIGENCE METHODS IN STOCK INDEX PREDICTION WITH THE USE OF NEWSPAPER ARTICLES

ARTIFICIAL INTELLIGENCE METHODS IN STOCK INDEX PREDICTION WITH THE USE OF NEWSPAPER ARTICLES FOUNDATION OF CONTROL AND MANAGEMENT SCIENCES No Year Manuscripts Mateusz, KOBOS * Jacek, MAŃDZIUK ** ARTIFICIAL INTELLIGENCE METHODS IN STOCK INDEX PREDICTION WITH THE USE OF NEWSPAPER ARTICLES Analysis

More information

Financial Text Mining

Financial Text Mining Enabling Sophisticated Financial Text Mining Calum Robertson Research Analyst, Sirca Background Data Research Strategies Obstacles Conclusions Overview Background Efficient Market Hypothesis Asset Price

More information

Sentiment analysis on tweets in a financial domain

Sentiment analysis on tweets in a financial domain Sentiment analysis on tweets in a financial domain Jasmina Smailović 1,2, Miha Grčar 1, Martin Žnidaršič 1 1 Dept of Knowledge Technologies, Jožef Stefan Institute, Ljubljana, Slovenia 2 Jožef Stefan International

More information

Algorithmic Trading Session 6 Trade Signal Generation IV Momentum Strategies. Oliver Steinki, CFA, FRM

Algorithmic Trading Session 6 Trade Signal Generation IV Momentum Strategies. Oliver Steinki, CFA, FRM Algorithmic Trading Session 6 Trade Signal Generation IV Momentum Strategies Oliver Steinki, CFA, FRM Outline Introduction What is Momentum? Tests to Discover Momentum Interday Momentum Strategies Intraday

More information

QUANTIFYING THE EFFECTS OF ONLINE BULLISHNESS ON INTERNATIONAL FINANCIAL MARKETS

QUANTIFYING THE EFFECTS OF ONLINE BULLISHNESS ON INTERNATIONAL FINANCIAL MARKETS QUANTIFYING THE EFFECTS OF ONLINE BULLISHNESS ON INTERNATIONAL FINANCIAL MARKETS Huina Mao School of Informatics and Computing Indiana University, Bloomington, USA ECB Workshop on Using Big Data for Forecasting

More information

The Viability of StockTwits and Google Trends to Predict the Stock Market. By Chris Loughlin and Erik Harnisch

The Viability of StockTwits and Google Trends to Predict the Stock Market. By Chris Loughlin and Erik Harnisch The Viability of StockTwits and Google Trends to Predict the Stock Market By Chris Loughlin and Erik Harnisch Spring 2013 Introduction Investors are always looking to gain an edge on the rest of the market.

More information

Data Integration: Financial Domain-Driven Approach

Data Integration: Financial Domain-Driven Approach 70 Data Integration: Financial Domain-Driven Approach Caslav Bozic 1, Detlef Seese 2, and Christof Weinhardt 3 1 IME Graduate School, Karlsruhe Institute of Technology (KIT) bozic@kit.edu 2 Institute AIFB,

More information

Sentiment Analysis. D. Skrepetos 1. University of Waterloo. NLP Presenation, 06/17/2015

Sentiment Analysis. D. Skrepetos 1. University of Waterloo. NLP Presenation, 06/17/2015 Sentiment Analysis D. Skrepetos 1 1 Department of Computer Science University of Waterloo NLP Presenation, 06/17/2015 D. Skrepetos (University of Waterloo) Sentiment Analysis NLP Presenation, 06/17/2015

More information

by Maria Heiden, Berenberg Bank

by Maria Heiden, Berenberg Bank Dynamic hedging of equity price risk with an equity protect overlay: reduce losses and exploit opportunities by Maria Heiden, Berenberg Bank As part of the distortions on the international stock markets

More information

Pattern Recognition and Prediction in Equity Market

Pattern Recognition and Prediction in Equity Market Pattern Recognition and Prediction in Equity Market Lang Lang, Kai Wang 1. Introduction In finance, technical analysis is a security analysis discipline used for forecasting the direction of prices through

More information

Stock Market Forecasting Using Machine Learning Algorithms

Stock Market Forecasting Using Machine Learning Algorithms Stock Market Forecasting Using Machine Learning Algorithms Shunrong Shen, Haomiao Jiang Department of Electrical Engineering Stanford University {conank,hjiang36}@stanford.edu Tongda Zhang Department of

More information

How to Win the Stock Market Game

How to Win the Stock Market Game How to Win the Stock Market Game 1 Developing Short-Term Stock Trading Strategies by Vladimir Daragan PART 1 Table of Contents 1. Introduction 2. Comparison of trading strategies 3. Return per trade 4.

More information

Stock Market Prediction Using Data Mining

Stock Market Prediction Using Data Mining Stock Market Prediction Using Data Mining 1 Ruchi Desai, 2 Prof.Snehal Gandhi 1 M.E., 2 M.Tech. 1 Computer Department 1 Sarvajanik College of Engineering and Technology, Surat, Gujarat, India Abstract

More information

INCORPORATION OF LIQUIDITY RISKS INTO EQUITY PORTFOLIO RISK ESTIMATES. Dan dibartolomeo September 2010

INCORPORATION OF LIQUIDITY RISKS INTO EQUITY PORTFOLIO RISK ESTIMATES. Dan dibartolomeo September 2010 INCORPORATION OF LIQUIDITY RISKS INTO EQUITY PORTFOLIO RISK ESTIMATES Dan dibartolomeo September 2010 GOALS FOR THIS TALK Assert that liquidity of a stock is properly measured as the expected price change,

More information

Sentiment Analysis of Twitter Feeds for the Prediction of Stock Market Movement

Sentiment Analysis of Twitter Feeds for the Prediction of Stock Market Movement Sentiment Analysis of Twitter Feeds for the Prediction of Stock Market Movement Ray Chen, Marius Lazer Abstract In this paper, we investigate the relationship between Twitter feed content and stock market

More information

Execution Costs. Post-trade reporting. December 17, 2008 Robert Almgren / Encyclopedia of Quantitative Finance Execution Costs 1

Execution Costs. Post-trade reporting. December 17, 2008 Robert Almgren / Encyclopedia of Quantitative Finance Execution Costs 1 December 17, 2008 Robert Almgren / Encyclopedia of Quantitative Finance Execution Costs 1 Execution Costs Execution costs are the difference in value between an ideal trade and what was actually done.

More information

An Automated Guided Model For Integrating News Into Stock Trading Strategies Pallavi Parshuram Katke 1, Ass.Prof. B.R.Solunke 2

An Automated Guided Model For Integrating News Into Stock Trading Strategies Pallavi Parshuram Katke 1, Ass.Prof. B.R.Solunke 2 www.ijecs.in International Journal Of Engineering And Computer Science ISSN:2319-7242 Volume 4 Issue - 12 December, 2015 Page No. 15312-15316 An Automated Guided Model For Integrating News Into Stock Trading

More information

How can we discover stocks that will

How can we discover stocks that will Algorithmic Trading Strategy Based On Massive Data Mining Haoming Li, Zhijun Yang and Tianlun Li Stanford University Abstract We believe that there is useful information hiding behind the noisy and massive

More information

Social Media Mining. Data Mining Essentials

Social Media Mining. Data Mining Essentials Introduction Data production rate has been increased dramatically (Big Data) and we are able store much more data than before E.g., purchase data, social media data, mobile phone data Businesses and customers

More information

Italian Journal of Accounting and Economia Aziendale. International Area. Year CXIV - 2014 - n. 1, 2 e 3

Italian Journal of Accounting and Economia Aziendale. International Area. Year CXIV - 2014 - n. 1, 2 e 3 Italian Journal of Accounting and Economia Aziendale International Area Year CXIV - 2014 - n. 1, 2 e 3 Could we make better prediction of stock market indicators through Twitter sentiment analysis? ALEXANDER

More information

Stock Market Predictability Using News Indexes

Stock Market Predictability Using News Indexes This article was translated by the author and reprinted from the April 204 issue of the Securities Analysts Journal with the permission of the Securities Analysts Association of Japan (SAAJ). Stock Market

More information

CHAPTER 11: THE EFFICIENT MARKET HYPOTHESIS

CHAPTER 11: THE EFFICIENT MARKET HYPOTHESIS CHAPTER 11: THE EFFICIENT MARKET HYPOTHESIS PROBLEM SETS 1. The correlation coefficient between stock returns for two non-overlapping periods should be zero. If not, one could use returns from one period

More information

Predicting IMDB Movie Ratings Using Social Media

Predicting IMDB Movie Ratings Using Social Media Predicting IMDB Movie Ratings Using Social Media Andrei Oghina, Mathias Breuss, Manos Tsagkias, and Maarten de Rijke ISLA, University of Amsterdam, Science Park 904, 1098 XH Amsterdam, The Netherlands

More information

The Equity Evaluations In. Standard & Poor s. Stock Reports

The Equity Evaluations In. Standard & Poor s. Stock Reports The Equity Evaluations In Standard & Poor s Stock Reports The Equity Evaluations in Standard & Poor s Stock Reports Standard & Poor's Stock Reports present an in-depth picture of each company's activities,

More information

Technical analysis is one of the most popular methods

Technical analysis is one of the most popular methods Comparing Profitability of Day Trading Using ORB Strategies on Index Futures Markets in Taiwan, Hong-Kong, and USA Yi-Cheng Tsai, Mu-En Wu, Chin-Laung Lei, Chung-Shu Wu, and Jan-Ming Ho Abstract In literature,

More information

Data Mining Yelp Data - Predicting rating stars from review text

Data Mining Yelp Data - Predicting rating stars from review text Data Mining Yelp Data - Predicting rating stars from review text Rakesh Chada Stony Brook University rchada@cs.stonybrook.edu Chetan Naik Stony Brook University cnaik@cs.stonybrook.edu ABSTRACT The majority

More information

Text Opinion Mining to Analyze News for Stock Market Prediction

Text Opinion Mining to Analyze News for Stock Market Prediction Int. J. Advance. Soft Comput. Appl., Vol. 6, No. 1, March 2014 ISSN 2074-8523; Copyright SCRG Publication, 2014 Text Opinion Mining to Analyze News for Stock Market Prediction Yoosin Kim 1, Seung Ryul

More information

A Description of Consumer Activity in Twitter

A Description of Consumer Activity in Twitter Justin Stewart A Description of Consumer Activity in Twitter At least for the astute economist, the introduction of techniques from computational science into economics has and is continuing to change

More information

EXTRAPOLATION BIAS: INSIDER TRADING IMPROVEMENT SIGNAL

EXTRAPOLATION BIAS: INSIDER TRADING IMPROVEMENT SIGNAL EXTRAPOLATION BIAS: INSIDER TRADING IMPROVEMENT SIGNAL HIGHLIGHTS Consistent with previous studies, we find that knowledge of insider trading is valuable to non-insider investors. We find that the change

More information

Financial Market Efficiency and Its Implications

Financial Market Efficiency and Its Implications Financial Market Efficiency: The Efficient Market Hypothesis (EMH) Financial Market Efficiency and Its Implications Financial markets are efficient if current asset prices fully reflect all currently available

More information

Financial Trading System using Combination of Textual and Numerical Data

Financial Trading System using Combination of Textual and Numerical Data Financial Trading System using Combination of Textual and Numerical Data Shital N. Dange Computer Science Department, Walchand Institute of Rajesh V. Argiddi Assistant Prof. Computer Science Department,

More information

Target Strategy: a practical application to ETFs and ETCs

Target Strategy: a practical application to ETFs and ETCs Target Strategy: a practical application to ETFs and ETCs Abstract During the last 20 years, many asset/fund managers proposed different absolute return strategies to gain a positive return in any financial

More information

When to Refinance Mortgage Loans in a Stochastic Interest Rate Environment

When to Refinance Mortgage Loans in a Stochastic Interest Rate Environment When to Refinance Mortgage Loans in a Stochastic Interest Rate Environment Siwei Gan, Jin Zheng, Xiaoxia Feng, and Dejun Xie Abstract Refinancing refers to the replacement of an existing debt obligation

More information

I.e., the return per dollar from investing in the shares from time 0 to time 1,

I.e., the return per dollar from investing in the shares from time 0 to time 1, XVII. SECURITY PRICING AND SECURITY ANALYSIS IN AN EFFICIENT MARKET Consider the following somewhat simplified description of a typical analyst-investor's actions in making an investment decision. First,

More information

CHAPTER 7: OPTIMAL RISKY PORTFOLIOS

CHAPTER 7: OPTIMAL RISKY PORTFOLIOS CHAPTER 7: OPTIMAL RIKY PORTFOLIO PROLEM ET 1. (a) and (e).. (a) and (c). After real estate is added to the portfolio, there are four asset classes in the portfolio: stocks, bonds, cash and real estate.

More information

Challenges of Cloud Scale Natural Language Processing

Challenges of Cloud Scale Natural Language Processing Challenges of Cloud Scale Natural Language Processing Mark Dredze Johns Hopkins University My Interests? Information Expressed in Human Language Machine Learning Natural Language Processing Intelligent

More information

A Review of Cross Sectional Regression for Financial Data You should already know this material from previous study

A Review of Cross Sectional Regression for Financial Data You should already know this material from previous study A Review of Cross Sectional Regression for Financial Data You should already know this material from previous study But I will offer a review, with a focus on issues which arise in finance 1 TYPES OF FINANCIAL

More information

Can the Content of Public News be used to Forecast Abnormal Stock Market Behaviour?

Can the Content of Public News be used to Forecast Abnormal Stock Market Behaviour? Seventh IEEE International Conference on Data Mining Can the Content of Public News be used to Forecast Abnormal Stock Market Behaviour? Calum Robertson Information Research Group Queensland University

More information

Designing Ranking Systems for Consumer Reviews: The Impact of Review Subjectivity on Product Sales and Review Quality

Designing Ranking Systems for Consumer Reviews: The Impact of Review Subjectivity on Product Sales and Review Quality Designing Ranking Systems for Consumer Reviews: The Impact of Review Subjectivity on Product Sales and Review Quality Anindya Ghose, Panagiotis G. Ipeirotis {aghose, panos}@stern.nyu.edu Department of

More information

Social Market Analytics, Inc.

Social Market Analytics, Inc. S-Factors : Definition, Use, and Significance Social Market Analytics, Inc. Harness the Power of Social Media Intelligence January 2014 P a g e 2 Introduction Social Market Analytics, Inc., (SMA) produces

More information

A GENERAL TAXONOMY FOR VISUALIZATION OF PREDICTIVE SOCIAL MEDIA ANALYTICS

A GENERAL TAXONOMY FOR VISUALIZATION OF PREDICTIVE SOCIAL MEDIA ANALYTICS A GENERAL TAXONOMY FOR VISUALIZATION OF PREDICTIVE SOCIAL MEDIA ANALYTICS Stacey Franklin Jones, D.Sc. ProTech Global Solutions Annapolis, MD Abstract The use of Social Media as a resource to characterize

More information

The relation between the trading activity of financial agents and stock price dynamics

The relation between the trading activity of financial agents and stock price dynamics The relation between the trading activity of financial agents and stock price dynamics Fabrizio Lillo Scuola Normale Superiore di Pisa, University of Palermo (Italy) and Santa Fe Institute (USA) Econophysics

More information

Machine Learning in Stock Price Trend Forecasting

Machine Learning in Stock Price Trend Forecasting Machine Learning in Stock Price Trend Forecasting Yuqing Dai, Yuning Zhang yuqingd@stanford.edu, zyn@stanford.edu I. INTRODUCTION Predicting the stock price trend by interpreting the seemly chaotic market

More information

News Trading and Speed

News Trading and Speed News Trading and Speed Thierry Foucault, Johan Hombert, and Ioanid Rosu (HEC) High Frequency Trading Conference Plan Plan 1. Introduction - Research questions 2. Model 3. Is news trading different? 4.

More information

Twitter Stock Bot. John Matthew Fong The University of Texas at Austin jmfong@cs.utexas.edu

Twitter Stock Bot. John Matthew Fong The University of Texas at Austin jmfong@cs.utexas.edu Twitter Stock Bot John Matthew Fong The University of Texas at Austin jmfong@cs.utexas.edu Hassaan Markhiani The University of Texas at Austin hassaan@cs.utexas.edu Abstract The stock market is influenced

More information

ETF Total Cost Analysis in Action

ETF Total Cost Analysis in Action Morningstar ETF Research ETF Total Cost Analysis in Action Authors: Paul Justice, CFA, Director of ETF Research, North America Michael Rawson, CFA, ETF Analyst 2 ETF Total Cost Analysis in Action Exchange

More information

Financial Evolution and Stability The Case of Hedge Funds

Financial Evolution and Stability The Case of Hedge Funds Financial Evolution and Stability The Case of Hedge Funds KENT JANÉR MD of Nektar Asset Management, a market-neutral hedge fund that works with a large element of macroeconomic assessment. Hedge funds

More information

Reverse spreads: theoretical foundations and trading model

Reverse spreads: theoretical foundations and trading model Reverse spreads: theoretical foundations and trading model Introduction Those who already faced the Trend following concept know that effective strategies of following the trend suggest dynamic stop loss

More information

Prediction of Stock Performance Using Analytical Techniques

Prediction of Stock Performance Using Analytical Techniques 136 JOURNAL OF EMERGING TECHNOLOGIES IN WEB INTELLIGENCE, VOL. 5, NO. 2, MAY 2013 Prediction of Stock Performance Using Analytical Techniques Carol Hargreaves Institute of Systems Science National University

More information

Lecture 23: Pairs Trading Steven Skiena. http://www.cs.sunysb.edu/ skiena

Lecture 23: Pairs Trading Steven Skiena. http://www.cs.sunysb.edu/ skiena Lecture 23: Pairs Trading Steven Skiena Department of Computer Science State University of New York Stony Brook, NY 11794 4400 http://www.cs.sunysb.edu/ skiena Pairs Trading This strategy was pioneered

More information

INVESTING & TRADING WITH THE JSE PRICE/EARNINGS (P/E) RATIO

INVESTING & TRADING WITH THE JSE PRICE/EARNINGS (P/E) RATIO INVESTING & TRADING WITH THE JSE PRICE/EARNINGS (P/E) RATIO "At the end of the day, all stock price movements can be traced back to earnings." Page 1 INTRODUCTION This e-book shows how to use the daily

More information

Peer Reviewed. Abstract

Peer Reviewed. Abstract Peer Reviewed William J. Trainor, Jr.(trainor@etsu.edu) is an Associate Professor of Finance, Department of Economics and Finance, College of Business and Technology, East Tennessee State University. Abstract

More information

Lecture 8: Stock market reaction to accounting data

Lecture 8: Stock market reaction to accounting data Lecture 8: Stock market reaction to accounting data In this lecture we will focus on how the market appears to evaluate accounting disclosures. For most of the time, we shall be examining the results of

More information

Using Macro News Events in Automated FX Trading Strategy

Using Macro News Events in Automated FX Trading Strategy Using Macro News Events in Automated FX Trading Strategy The Main Thesis The arrival of macroeconomic news from the world s largest economies brings additional volatility to the market. 2 Overall methodology

More information

Rethinking Fixed Income

Rethinking Fixed Income Rethinking Fixed Income Challenging Conventional Wisdom May 2013 Risk. Reinsurance. Human Resources. Rethinking Fixed Income: Challenging Conventional Wisdom With US Treasury interest rates at, or near,

More information

Whether you re new to trading or an experienced investor, listed stock

Whether you re new to trading or an experienced investor, listed stock Chapter 1 Options Trading and Investing In This Chapter Developing an appreciation for options Using option analysis with any market approach Focusing on limiting risk Capitalizing on advanced techniques

More information

Using Currency Futures to Hedge Currency Risk

Using Currency Futures to Hedge Currency Risk Using Currency Futures to Hedge Currency Risk By Sayee Srinivasan & Steven Youngren Product Research & Development Chicago Mercantile Exchange Inc. Introduction Investment professionals face a tough climate.

More information

Lecture 6: Portfolios with Stock Options Steven Skiena. http://www.cs.sunysb.edu/ skiena

Lecture 6: Portfolios with Stock Options Steven Skiena. http://www.cs.sunysb.edu/ skiena Lecture 6: Portfolios with Stock Options Steven Skiena Department of Computer Science State University of New York Stony Brook, NY 11794 4400 http://www.cs.sunysb.edu/ skiena Portfolios with Options The

More information

Big Data and High Quality Sentiment Analysis for Stock Trading and Business Intelligence. Dr. Sulkhan Metreveli Leo Keller

Big Data and High Quality Sentiment Analysis for Stock Trading and Business Intelligence. Dr. Sulkhan Metreveli Leo Keller Big Data and High Quality Sentiment Analysis for Stock Trading and Business Intelligence Dr. Sulkhan Metreveli Leo Keller The greed https://www.youtube.com/watch?v=r8y6djaeolo The money https://www.youtube.com/watch?v=x_6oogojnaw

More information

Market Efficiency: Definitions and Tests. Aswath Damodaran

Market Efficiency: Definitions and Tests. Aswath Damodaran Market Efficiency: Definitions and Tests 1 Why market efficiency matters.. Question of whether markets are efficient, and if not, where the inefficiencies lie, is central to investment valuation. If markets

More information

Equity forecast: Predicting long term stock price movement using machine learning

Equity forecast: Predicting long term stock price movement using machine learning Equity forecast: Predicting long term stock price movement using machine learning Nikola Milosevic School of Computer Science, University of Manchester, UK Nikola.milosevic@manchester.ac.uk Abstract Long

More information

Implementing Point and Figure RS Signals

Implementing Point and Figure RS Signals Dorsey Wright Money Management 790 E. Colorado Blvd, Suite 808 Pasadena, CA 91101 626-535-0630 John Lewis, CMT August, 2014 Implementing Point and Figure RS Signals Relative Strength, also known as Momentum,

More information

ALGORITHMIC TRADING USING MACHINE LEARNING TECH-

ALGORITHMIC TRADING USING MACHINE LEARNING TECH- ALGORITHMIC TRADING USING MACHINE LEARNING TECH- NIQUES: FINAL REPORT Chenxu Shao, Zheming Zheng Department of Management Science and Engineering December 12, 2013 ABSTRACT In this report, we present an

More information

Do Tweets Matter for Shareholders? An Empirical Analysis

Do Tweets Matter for Shareholders? An Empirical Analysis Do Tweets Matter for Shareholders? An Empirical Analysis Brittany Cole University of Mississippi Jonathan Daigle University of Mississippi Bonnie F. Van Ness University of Mississippi We identify the 215

More information

Prediction of Stock Market Shift using Sentiment Analysis of Twitter Feeds, Clustering and Ranking

Prediction of Stock Market Shift using Sentiment Analysis of Twitter Feeds, Clustering and Ranking 382 Prediction of Stock Market Shift using Sentiment Analysis of Twitter Feeds, Clustering and Ranking 1 Tejas Sathe, 2 Siddhartha Gupta, 3 Shreya Nair, 4 Sukhada Bhingarkar 1,2,3,4 Dept. of Computer Engineering

More information

The relation between news events and stock price jump: an analysis based on neural network

The relation between news events and stock price jump: an analysis based on neural network 20th International Congress on Modelling and Simulation, Adelaide, Australia, 1 6 December 2013 www.mssanz.org.au/modsim2013 The relation between news events and stock price jump: an analysis based on

More information

Discussion of Momentum and Autocorrelation in Stock Returns

Discussion of Momentum and Autocorrelation in Stock Returns Discussion of Momentum and Autocorrelation in Stock Returns Joseph Chen University of Southern California Harrison Hong Stanford University Jegadeesh and Titman (1993) document individual stock momentum:

More information

A Proposed Prediction Model for Forecasting the Financial Market Value According to Diversity in Factor

A Proposed Prediction Model for Forecasting the Financial Market Value According to Diversity in Factor A Proposed Prediction Model for Forecasting the Financial Market Value According to Diversity in Factor Ms. Hiral R. Patel, Mr. Amit B. Suthar, Dr. Satyen M. Parikh Assistant Professor, DCS, Ganpat University,

More information

A CRF-based approach to find stock price correlation with company-related Twitter sentiment

A CRF-based approach to find stock price correlation with company-related Twitter sentiment POLITECNICO DI MILANO Scuola di Ingegneria dell Informazione POLO TERRITORIALE DI COMO Master of Science in Computer Engineering A CRF-based approach to find stock price correlation with company-related

More information

Exploring the use of Big Data techniques for simulating Algorithmic Trading Strategies

Exploring the use of Big Data techniques for simulating Algorithmic Trading Strategies Exploring the use of Big Data techniques for simulating Algorithmic Trading Strategies Nishith Tirpankar, Jiten Thakkar tirpankar.n@gmail.com, jitenmt@gmail.com December 20, 2015 Abstract In the world

More information

The Scientific Data Mining Process

The Scientific Data Mining Process Chapter 4 The Scientific Data Mining Process When I use a word, Humpty Dumpty said, in rather a scornful tone, it means just what I choose it to mean neither more nor less. Lewis Carroll [87, p. 214] In

More information

OUTLIER ANALYSIS. Data Mining 1

OUTLIER ANALYSIS. Data Mining 1 OUTLIER ANALYSIS Data Mining 1 What Are Outliers? Outlier: A data object that deviates significantly from the normal objects as if it were generated by a different mechanism Ex.: Unusual credit card purchase,

More information

Domain Classification of Technical Terms Using the Web

Domain Classification of Technical Terms Using the Web Systems and Computers in Japan, Vol. 38, No. 14, 2007 Translated from Denshi Joho Tsushin Gakkai Ronbunshi, Vol. J89-D, No. 11, November 2006, pp. 2470 2482 Domain Classification of Technical Terms Using

More information

A Study Of Bagging And Boosting Approaches To Develop Meta-Classifier

A Study Of Bagging And Boosting Approaches To Develop Meta-Classifier A Study Of Bagging And Boosting Approaches To Develop Meta-Classifier G.T. Prasanna Kumari Associate Professor, Dept of Computer Science and Engineering, Gokula Krishna College of Engg, Sullurpet-524121,

More information

Risk and Return in the Canadian Bond Market

Risk and Return in the Canadian Bond Market Risk and Return in the Canadian Bond Market Beyond yield and duration. Ronald N. Kahn and Deepak Gulrajani (Reprinted with permission from The Journal of Portfolio Management ) RONALD N. KAHN is Director

More information

Perspective. Economic and Market. Has Stock Market Stability Increased Vulnerability?

Perspective. Economic and Market. Has Stock Market Stability Increased Vulnerability? James W. Paulsen, Ph.D. Perspective Bringing you national and global economic trends for more than 30 years Economic and Market April 6, 2015 Has Stock Market Stability Increased Vulnerability? The emotional

More information

8.1 Summary and conclusions 8.2 Implications

8.1 Summary and conclusions 8.2 Implications Conclusion and Implication V{tÑàxÜ CONCLUSION AND IMPLICATION 8 Contents 8.1 Summary and conclusions 8.2 Implications Having done the selection of macroeconomic variables, forecasting the series and construction

More information

Chapter 20: Data Analysis

Chapter 20: Data Analysis Chapter 20: Data Analysis Database System Concepts, 6 th Ed. See www.db-book.com for conditions on re-use Chapter 20: Data Analysis Decision Support Systems Data Warehousing Data Mining Classification

More information

Study on the Volatility Smile of EUR/USD Currency Options and Trading Strategies

Study on the Volatility Smile of EUR/USD Currency Options and Trading Strategies Prof. Joseph Fung, FDS Study on the Volatility Smile of EUR/USD Currency Options and Trading Strategies BY CHEN Duyi 11050098 Finance Concentration LI Ronggang 11050527 Finance Concentration An Honors

More information

On the Predictability of Stock Market Behavior using StockTwits Sentiment and Posting Volume

On the Predictability of Stock Market Behavior using StockTwits Sentiment and Posting Volume On the Predictability of Stock Market Behavior using StockTwits Sentiment and Posting Volume Abstract. In this study, we explored data from StockTwits, a microblogging platform exclusively dedicated to

More information

Bank Profitability: The Impact of Foreign Currency Fluctuations

Bank Profitability: The Impact of Foreign Currency Fluctuations Bank Profitability: The Impact of Foreign Currency Fluctuations Ling T. He University of Central Arkansas Alex Fayman University of Central Arkansas K. Michael Casey University of Central Arkansas Given

More information

Better decision making under uncertain conditions using Monte Carlo Simulation

Better decision making under uncertain conditions using Monte Carlo Simulation IBM Software Business Analytics IBM SPSS Statistics Better decision making under uncertain conditions using Monte Carlo Simulation Monte Carlo simulation and risk analysis techniques in IBM SPSS Statistics

More information

Understanding Fixed Income

Understanding Fixed Income Understanding Fixed Income 2014 AMP Capital Investors Limited ABN 59 001 777 591 AFSL 232497 Understanding Fixed Income About fixed income at AMP Capital Our global presence helps us deliver outstanding

More information

A Comparative Study on Sentiment Classification and Ranking on Product Reviews

A Comparative Study on Sentiment Classification and Ranking on Product Reviews A Comparative Study on Sentiment Classification and Ranking on Product Reviews C.EMELDA Research Scholar, PG and Research Department of Computer Science, Nehru Memorial College, Putthanampatti, Bharathidasan

More information

VOLATILITY AND DEVIATION OF DISTRIBUTED SOLAR

VOLATILITY AND DEVIATION OF DISTRIBUTED SOLAR VOLATILITY AND DEVIATION OF DISTRIBUTED SOLAR Andrew Goldstein Yale University 68 High Street New Haven, CT 06511 andrew.goldstein@yale.edu Alexander Thornton Shawn Kerrigan Locus Energy 657 Mission St.

More information

Robust Sentiment Detection on Twitter from Biased and Noisy Data

Robust Sentiment Detection on Twitter from Biased and Noisy Data Robust Sentiment Detection on Twitter from Biased and Noisy Data Luciano Barbosa AT&T Labs - Research lbarbosa@research.att.com Junlan Feng AT&T Labs - Research junlan@research.att.com Abstract In this

More information

Using News Articles to Predict Stock Price Movements

Using News Articles to Predict Stock Price Movements Using News Articles to Predict Stock Price Movements Győző Gidófalvi Department of Computer Science and Engineering University of California, San Diego La Jolla, CA 9237 gyozo@cs.ucsd.edu 21, June 15,

More information

Sentiment Analysis on Big Data

Sentiment Analysis on Big Data SPAN White Paper!? Sentiment Analysis on Big Data Machine Learning Approach Several sources on the web provide deep insight about people s opinions on the products and services of various companies. Social

More information

Real estate: The impact of rising interest rates

Real estate: The impact of rising interest rates Fall 015 TIAA-CREF Asset Management Real estate: The impact of rising interest rates Overview TIAA-CREF Global Real Estate Strategy & Research Martha Peyton, Ph.D. Managing Director Edward F. Pierzak,

More information

The Best of Both Worlds:

The Best of Both Worlds: The Best of Both Worlds: A Hybrid Approach to Calculating Value at Risk Jacob Boudoukh 1, Matthew Richardson and Robert F. Whitelaw Stern School of Business, NYU The hybrid approach combines the two most

More information

UBS Global Asset Management has

UBS Global Asset Management has IIJ-130-STAUB.qxp 4/17/08 4:45 PM Page 1 RENATO STAUB is a senior assest allocation and risk analyst at UBS Global Asset Management in Zurich. renato.staub@ubs.com Deploying Alpha: A Strategy to Capture

More information

Review for Exam 2. Instructions: Please read carefully

Review for Exam 2. Instructions: Please read carefully Review for Exam 2 Instructions: Please read carefully The exam will have 25 multiple choice questions and 5 work problems You are not responsible for any topics that are not covered in the lecture note

More information

IMPACT OF SOCIAL MEDIA ON THE STOCK MARKET: EVIDENCE FROM TWEETS

IMPACT OF SOCIAL MEDIA ON THE STOCK MARKET: EVIDENCE FROM TWEETS IMPACT OF SOCIAL MEDIA ON THE STOCK MARKET: EVIDENCE FROM TWEETS Vojtěch Fiala 1, Svatopluk Kapounek 1, Ondřej Veselý 1 1 Mendel University in Brno Volume 1 Issue 1 ISSN 2336-6494 www.ejobsat.com ABSTRACT

More information

Performing equity investment simulations with the portfoliosim package

Performing equity investment simulations with the portfoliosim package Performing equity investment simulations with the portfoliosim package Kyle Campbell, Jeff Enos, and David Kane February 18, 2010 Abstract The portfoliosim package provides a flexible system for back-testing

More information