Financial Text Mining



Similar documents
Neural Networks for Sentiment Detection in Financial Text

ARTIFICIAL INTELLIGENCE METHODS IN STOCK INDEX PREDICTION WITH THE USE OF NEWSPAPER ARTICLES

Using Text and Data Mining Techniques to extract Stock Market Sentiment from Live News Streams

The relation between news events and stock price jump: an analysis based on neural network

HIGH FREQUENCY TRADING, ALGORITHMIC BUY-SIDE EXECUTION AND LINGUISTIC SYNTAX. Dan dibartolomeo Northfield Information Services 2011

Pattern Recognition and Prediction in Equity Market

The Handbook of News Analytics \ in Finance

Data Integration: Financial Domain-Driven Approach

ECON4510 Finance Theory Lecture 7

Trading for News: an Examination of Intraday Trading Behaviour of Australian Treasury-Bond Futures Markets

Investor Performance in ASX shares; contrasting individual investors to foreign and domestic. institutions. 1

Trading Strategies To Exploit Blog and News Sentiment

INCORPORATION OF LIQUIDITY RISKS INTO EQUITY PORTFOLIO RISK ESTIMATES. Dan dibartolomeo September 2010

Algorithmic Trading Session 6 Trade Signal Generation IV Momentum Strategies. Oliver Steinki, CFA, FRM

Big Data to trade bonds/fx & Python demo on FX intraday vol

Journal Of Financial And Strategic Decisions Volume 9 Number 3 Fall 1996 STOCK PRICES AND THE BARRON S RESEARCH REPORTS COLUMN

Equities Dealing, Brokerage and Market Making

Financial Trading System using Combination of Textual and Numerical Data

SAIF-2011 Report. Rami Reddy, SOA, UW_P

Stock Market Prediction Using Data Mining

Financial Market Efficiency and Its Implications

Event driven trading new studies on innovative way. of trading in Forex market. Michał Osmoła INIME live 23 February 2016

Finanzdienstleistungen (Praxis) Algorithmic Trading

Text Opinion Mining to Analyze News for Stock Market Prediction

Technical Accounting Alert

Need for Speed: An Empirical Analysis of Hard and Soft Information in a High Frequency World

Advanced Big Data Analytics with R and Hadoop

Earnings Announcement and Abnormal Return of S&P 500 Companies. Luke Qiu Washington University in St. Louis Economics Department Honors Thesis

Overlapping ETF: Pair trading between two gold stocks

Paper 2. Derivatives Investment Consultant Examination. Thailand Securities Institute November 2014

Technical analysis is one of the most popular methods

Asset Management Solutions for Research Analysts THE CHALLENGE YOUR END-TO-END RESEARCH SOLUTION

Market Microstructure: An Interactive Exercise

Leveraging Big Data. A case study from Thomson Reuters

FE670 Algorithmic Trading Strategies. Stevens Institute of Technology

Market Efficiency: Definitions and Tests. Aswath Damodaran

Online Appendix for. On the determinants of pairs trading profitability

The Evolution of Price Discovery in US Equity and Derivatives Markets

ASX 30-Day Interbank Futures

Chapter 2.2. Company Fundamentals

Stock Returns Following Profit Warnings: A Test of Models of Behavioural Finance.

Tweets Miner for Stock Market Analysis

Social Market Analytics, Inc.

Sentiment Score based Algorithmic Trading

THOMSON REUTERS STARMINE QUANTITATIVE ANALYTICS

A Proposed Prediction Model for Forecasting the Financial Market Value According to Diversity in Factor

CS224N Final Project: Sentiment analysis of news articles for financial signal prediction

The Institute of Trading and Finance. The Master in Trading Course (For futures)

Exploring the use of Big Data techniques for simulating Algorithmic Trading Strategies

Chapter 4.3. News Analysis

White Paper Electronic Trading- Algorithmic & High Frequency Trading. PENINSULA STRATEGY, Namir Hamid

EVIDENCE IN FAVOR OF MARKET EFFICIENCY

Prediction of Stock Performance Using Analytical Techniques

For more than 35 years the world s investment professionals have trusted FactSet to power their research, analytics, and data integration.

CHAPTER 11: THE EFFICIENT MARKET HYPOTHESIS

FOR FOREIGN EXCHANGE PROFESSIONALS

MARKETS, INFORMATION AND THEIR FRACTAL ANALYSIS. Mária Bohdalová and Michal Greguš Comenius University, Faculty of Management Slovak republic

Understanding the Equity Summary Score Methodology

Equity forecast: Predicting long term stock price movement using machine learning

Applying Machine Learning to Stock Market Trading Bryce Taylor

International Academy of Exchange Trading. Lesson 7: Fundamental Analysis

GLOBAL INVESTMENT PORTFOLIO CONSTRUCTION AND BIBF DEALING ROOM

Digitization of Financial Markets: Impact and Future

Department of Accounting and Finance

The European Central Bank s Minimum Bid Rate and Its Effect on Major Currency Pairs

ROYAL LONDON ABSOLUTE RETURN GOVERNMENT BOND FUND

PRODUCT KEY FACTS Samsung TOPIX Daily (2x) Leveraged Product

News Trading and Speed

Factors Influencing Retail Investors Attitude Towards Investing in Equity Stocks: A Study in Tamil Nadu

STRATEGY OF STOCK VALUATION BY FUNDAMENTAL ANALYSIS

Big Data and High Quality Sentiment Analysis for Stock Trading and Business Intelligence. Dr. Sulkhan Metreveli Leo Keller

Understanding the Technical Market Indicators

Introduction... 4 A look at Binary Options Who Trades Binary Options? Binary Option Brokers... 5 Individual Investors...

From Saving to Investing: An Examination of Risk in Companies with Direct Stock Purchase Plans that Pay Dividends

EUR/USD Trading Strategy

Algorithmic trading - Overview. Views expressed herein are personal views of the author

Good News: Using News Feeds with Genetic Programming to Predict Stock Prices

Execution Costs of Exchange Traded Funds (ETFs)

ANALYSIS AND MANAGEMENT

Transcription:

Enabling Sophisticated Financial Text Mining Calum Robertson Research Analyst, Sirca Background Data Research Strategies Obstacles Conclusions Overview

Background Efficient Market Hypothesis Asset Price reflects all available information Sources of Information Trading Data Numerical (i.e., Price, Volume, etc.) News Textual (i.e., Macroeconomic Announcements, Earnings Announcements, Press Announcements) Background Purpose of Market Efficiency Research Academic Evaluate efficiency of market Speculate reasons for supposed inefficiencies (Behavioural Finance) Shiller, R J (2003) From Efficient Markets Theory to Behavioral Finance. Journal of Economic Perspectives 17, 83-104. Commercial Exploit market inefficiencies for profit (Algorithmic traders) Manage Risk (Fund Managers) Market Regulation (i.e., prevent insider trading)

Background Limitations of Market Efficiency Research Many factors can contribute to the efficiency of a market Some unmeasurable Data Availability Cost Reliability High Frequency Research Noise is common Computationally expensive to identify patterns Data Sources of High Frequency Trading Data Financial Data Providers Thomson Reuters Bloomberg Sources of News Newspapers Websites Governments (Macroeconomic Announcements) Financial Data Providers Thomson Reuters Bloomberg

Sirca Data Thomson Reuters Integrated Data Network (IDN) 1996 present Total of over 400 TB of compressed data Currently receive ~ 50 GB of data per day Includes over 130 GB of compressed news Currently 19 supported languages g Currently receive up to 200,000 stories per day Event Studies Research Strategies Identify abnormalities in the Time Series and attempt to find Factors which h correlate E.g., the effect of Macroeconomic news on Foreign exchange rates Ederington, L and J Lee (1995) The Short-Run Dynamics of the Price Adjustment to New Information. Journal of Financial & Quantitative Analysis 30, 117-34. Identify Factors suspected to affect Efficiency (e.g., news) and Evaluate the Time Series before and after E.g., the effect of a different types of news on stock prices Kalev, P S, et al. (2004) Public Information Arrival and Volatility of Intraday Stock Returns. Journal of Banking and Finance 28, 1441-67.

Research Strategies Text Classification Categorise News Based on abnormalities in Time Series Train Classifier based on News Content and Document Category Predict Time Series abnormality after news E.g., Predict 3% returns within 60 minutes of Press Announcement Mittermayer, M-A (2004) Forecasting Intraday Stock Price Trends with Text Mining Techniques. In Proceedings of 37th Annual Hawaii International Conference on System Sciences (HICSS'04). E.g., Predict Volatility of returns with various time windows exceeding several standard deviations from the mean Robertson, C S, et al. (2007) News Aware Volatility Forecasting: Is the Content of News Important. In Proceedings of Australasian Data Mining Conference, 157-66. Research Strategies Sentiment Analysis Analyse the frequency of Positive and Negative terms/phrases in News to Predict Market Movement E.g., Sentiment Analysis of a popular column to evaluate Market Wide movement on following day. Tetlock, P C (2007) Giving Content to Investor Sentiment: The Role of Media in the Stock Market. Journal of Finance 62, 1139-68.

Data Volume Obstacles Problem: Too much information for most users Solution: Allow users to acquire aggregated time series of trading data (e.g., minute by minute, hourly, daily) Sirca has successfully provided this service for years Allow users to perform full text search to accurately identify the type of news they require. Sirca has developed services to allow customers to perform searches using any combination of» Topic Codes» Company Codes» Full Text Language Obstacles Problem: News is delivered in 19 different languages Solution: Utilise IBM s International Components of Unicode (ICU) tool to parse text in any language. Text Classification & Sentiment Analysis Problem: Large number of features in the text make if difficult to perform large scale text classification and sentiment analysis Solution: Calculate Term Counts for every news article Produce Report of term frequencies within documents matching user search criteria

Usability Obstacles Problem: Many complex, though routine, operations required to perform sophisticated t financial i text t mining. i Solution: Developing a suite of Web Services to allow users to easily combine our data and standard techniques to conduct their own novel research. Conclusions Many Obstacles to Large Scale Financial Text Mining Sirca is actively working to promote research within the field.

Questions?