A Review of Artificial Intelligence and Biologically Inspired Computational Approaches to Solving Issues in Narrative Financial Disclosure

Size: px
Start display at page:

Download "A Review of Artificial Intelligence and Biologically Inspired Computational Approaches to Solving Issues in Narrative Financial Disclosure"

Transcription

1 A Review of Artificial Intelligence and Biologically Inspired Computational Approaches to Solving Issues in Narrative Financial Disclosure Saliha Minhas 1, Soujanya Poria 2, Prof. Amir Hussain 1, Prof Khalid Hussainey 1, 1 Dept. of Computing Science, Univerity of Stirling, FK9 4LA, Scotland 2 Dept. of Computing Science, Jadavpur University, West Bengal, India szm@stir.ac.uk, ahu@cs.stir.ac.uk, soujanya.poria@ieee.org, khaled.hussainey@stir.ac.uk Abstract. Indisputably, financial reporting has a key role to play in the efficient workings of capitalist economies. Problems related to agency and asymmetric information (Jensen and Meckling, 1976) would abound and cripple financial markets, as it has done when left unchecked (Enron, WorldCom and Tyco). However for too long, quantitative data has monopolised the assessment and prediction role within this arena and this has contributed to the failures, borne out by research (Kumar & Ravi, 2007). As qualitative data proliferates, containing value relevant information it needs to be factored into the analysis. This paper reviews work on financial narrative disclosures and looks at conventional artificial intelligence and more recent biologically inspired computational approaches to catapult the domain to more progressive methods of using linguistic data in evaluations. Keywords: Narrative Financial Disclosure, Biologically Inspired, SenticNet

2 1 Introduction Efficient allocation of resources in an economy, capital market development, liquidity in the market, decreased cost of capital, lower return volatility, and high analyst forecast accuracy (Kothari, Li and Short, 2009) To reach this aim, narrative disclosures can play a significant role, especially given the deluge of unstructured content that abounds and could contain value relevant information. However, despite this potential by far analysis and prediction has been predominantly performed using quantitative data. There is a reason why the saying number s don t lie prevails and not the text doesn t lie. Numerical data is just easier to deal with, whereas text is riddled with ambiguities and difficult to decipher for humans, let alone computers. However, given the deluge, taming text must be done. Within the Financial Reporting domain it would significantly enhance decision making powers of stakeholders. The next section reviews research work in narrative financial reporting to appreciate the task at hand. Thereafter, significant artificial intelligence/biologically inspired approaches are examined to assess their capabilities to decipher content. In particular, the potential of Sentic Computing (Cambria and Hussain, 2012) cyrstalized into SenticNet, an opinion mining engine will be examined. This is built on ontological affective common sense representation and seeks to mimic cognitive processes in handling linguistic data. Preliminary results from this engine run on extracts of company annual reports will be given. 2 Literature Review - Narrative Disclosures Despite being overshadowed by the reams of work in numerical analysis, deciphering narrative disclosure in accounting documents has had its proponents. A catalogue of significant research in this field is given in appendix one. This research can be examined from two angles, delineated below:

3 2.1. A review of methodology applied to understand the narrative content. Merkl-Davies et al (2013) developed a taxonomy to give structure to the relationship between research paradigms and text analysis approaches. They correctly point out that content analysis is the main textual analysis approach used within accounting narrative domain. It involves draw[ing] inferences from data by systematically identifying characteristics within the data (Clatworthy et al, 2001). Within content analysis, two approaches are typically taken: form orientated analysis, which involves routine counting of words or concrete references, and meaning orientated'' analysis, which focuses on analysis of the underlying themes in the texts under investigation (Smith and Taffler, 2000). Form orientated can be quantitative, use of proxies is common to enable statistical analysis or it can be qualitative, searching for occurrence of predefined content categories within texts. As Davies et al (2013) argue what methodology is chosen depends on the aims of the research. Other studies (Li, 2008; Jones and Shoemaker, 1994) have looked at readability of narratives to typically check if the management obfuscation hypothesis is being played out. This is assessed by a readability formula that counts language variables in a text in order to provide a measure of reading difficulty for readers. Still others (Demers and Vega, 2008; Brennan et al, 2008) have looked at rhetorical manipulation, use of certain linguistic choices to influence meaning. This would again be relevant in the area of impression management and fraudulent activity. Another nuance within content analysis work is that it can be performed either with a dictionary or a statistical based approach. In the former, dictionaries are employed to pick up the meaning and polarity of the text. For example (Kothari, Li and Short, 2009) put words into categories through use of a word list and then ascertained polarity (positive/negative) using the General Inquirer dictionary. Whereas, (Feldman et al., 2010) used Loughran and Macdonald s (2011) financial word list with polarity for detecting (positive/negative) tone. This has been touted as being more relevant for the financial domain. Typically once polarity of text has been determined, the results are quantified and used within regression analysis. There is a number of significant studies performed in this manner (Tetlock, 2007, Kothari, Li and Short, 2009, Huang, 2011). The use of dictionaries can be cumbersome as they need to be updated and require expert manual effort. Statistical approaches use a variety of generative, probabilistic models based on bag of words approach and maximizing the likelihood of the observed word being in a class to gauge meaning (Manning and Shutze, 1999). The main criticism is the lack of lexical analysis done, therefore poor capture of semantics.

4 Smith and Taffler (2000) used OCP and SPSS to perform content analysis. They attempted to capture both themes and words. They looked for certain keywords that denote profitability, dividends, resignations etc. They also captured themes by examining combinations of words that would fit into designated thematic components. Keywords and captured themes are given numerical scores to correlate with bankruptcy (tested using failed firms). The scores are fed into linear discriminant models which show that chairman s statement in annual reports highly correlative with bankruptcy. Beattie et al (2004) use content analysis software to code each sentence into one of ten categories and into four dimensions. Coders go through the software to aid in this process. Although methodology is comprehensive in the review of annual report narratives, it was labour intensive. Hussainey (2003) attempted to move away from manual analysis, with a view to capturing forward looking statements in narratives by drawing up keyword list that were indicative of this class. There are a number of other similar manual/semi-automated content analysis performed (Botosan, 1997; Ahmed et al.,1999; Kothari, Li and Short, 2009; Li, 2011). The main problem with these approaches is that it requires human interaction and judgement and is thus subject to reliability issues (such as inter coder agreement on terms), it is error prone and expensive. Therefore samples sizes tend to be small. There has been some recent work in the accounting narrative domain using a more complete computerised content analysis and is now delineated. Li (2010) categorised 30,000 sentences containing Forward Looking Statement (FLS) into two dimension (positive/negative) and content (profitability, operations, liquidity). This training data is fed into a Naïve Bayes algorithm which is used to categorize the tone and content of about 13 million FLS. The problem with this approach is that no lexical analysis is done, each word is deemed independent of any other word and categorisation is often done using frequencies computed as prior/conditional probabilities. This is simplistic/naïve from a linguistic point of view. Demers and Vega (2008) use dictionaries to measure linguistic tone to determine optimism and certainty in narratives. This is then build into a Bayesian learning model which predicts that sentiments expressed as linked to stock returns. Similarly, Tetlock (2007) and Feldman (2008) detect the relationship between sentiment expressed in narratives and stock market reactions. The former looks at frequencies of negative words, whereas the latter uses dictionaries to determine polarity.

5 Brown and Tucker (2010) used the vector space model of information retrieval to determine similarities in accounting statements. The similarity of any two documents is measured by the angle between the two vectors representing the documents, a smaller angle indicates more similar documents. TF-IDF weighting function was used within the vectors, this picks up words that are more salient in document as compared to the corpus. This vector space comparison can also be done at sentence level. Qiu et al (2004) build a Support Vector Machine (SVM) based predictive model based on textual content and experimented with different feature selection methods. Balakrishnan et al (2010) again use a bag of word approach with TF/IDF scores to develop a predictive model where linguistic data predicts performance. Detection of financial misreporting has attracted research work, given the rise in fraudulent activity. Hobson et al (2012) build on cognitive dissonance theory (Festinger, 1957) - psychological discomfort when one s actions and beliefs are discrepant. They look for emotions such as fear, uncertainty, guilt, shame which give a high cognitive dissonance marker score that are indicative of deception. They apply this approach to non-verbal cues to CEO speech samples. Goel (2008) build an SVM classifier used to detect fraud. Linguistic features such as active/passive voice (look at personal pronouns used), uncertainty markers, readability index, tone, usage of proper nouns, type-token ratio etc were used. This helped to distinguish fraudulent annual reports from non-fraudulent reports. Generally researchers perform linguistic analysis identifying a set of lying words that pertain to deceptive language (Larcker and Zakolyukina, 2011; Newman et al., 2003; Loughran and McDonald, 2009). The above research has still failed to capture the true potential of natural language processing capabilities. In particular, to guage tone, sentiment in narratives, to distinguish it from factual statement falls short of opinion mining and sentiment analysis potential. Some of these technologies will be examined in the Section 3. In general, there is still a paucity of research that uses Biologically Inspired/Artificial Intelligence approaches that can utilise the knowledge gained from the research into accounting narratives (see appendix one) and that can extract meaning from text.

6 2.2 A review of the findings of methodology applied. Appendix one, gives an overview of salient research in the accounting narrative research. As said by Qui et al (2004) these studies emanate from the intuitive recognition of a link between the textual report content and corporate performance. This indeed is characteristic of the findings outlined in appendix one, the narratives are correlated with variables, some of which are proxied eg Kothari et al (2009) used the cost of capital, stock return volatility and analyst forecast dispersion to proxy for firm risk. In the majority of cases it is found that the narratives have information content of a predictive nature (see appendix one) or that it sheds light on management actions or explains industry specific disclosure practices. In all the text analysis of narratives conducted the researchers were keen to pick up forward looking information and tone as it relays messages that can be significant for stock markets and industry analysts. This was primarily done using keyword searches, which can be improved upon by concept based opinion mining tools (Cambria and Hussain, 2012). Recently the literature has been dominated by looking for linguistic cues that are indicative of impression management or fraud. Of primary importance in any textual analysis is the picking out of the most representative features. This has been more of a challenge given the ambiguous nature of these phenomena. Merkl-Davies and Brennan (2007) identified six strategies used for concealment. Two of these obfuscate bad news by manipulating verbal information and four strategies emphasize good news by manipulating verbal and/or numerical information. The challenge is to develop features that would pick up such semantics. For example Goal (2008), used word frequencies, syntactic and surface level features for fraud detection. The research delineated in appendix one also indicates that agency and information asymmetry, proprietary cost, self-attribution bias are all found to be empirically linked to disclosure. One of the main ways that researchers have used (Feldman, 2010; Tetlock, 2007; Li, 2010) to capture firm intention is tone. Therefore any analysis and prediction that is done using narrative content needs to take into account the industry background, management motivations and biases. For example we know that in highly competitive industries or firms operating in a highly litigious environment, poor performance are all likely to lead to less disclosure or attempts to blame external circumstances for unfavourable results. Any approach that can represent such knowledge and correctly capture tone would improve predictive modelling that seeks to guage future company performance.

7 In sum, it can be said that voluntary disclosure literature appears to offer an opportunity to increase understanding of the role of accounting information in firm valuation and corporate finance. This knowledge could be represented within biologically inspired systems through the use of an ontology or expert system, which with the semantics extracted from the actual linguistic data could lead to improve predictive capabilities of financial models. 3. Artificial Intelligence (AI) - Biologically Inspired Approaches Ekbia (2010) outlined some of the main approaches in AI and their basic understanding of intelligence. They are outlined below with brief reviews on their applicability to the financial domain:- Knowledge-Intensive AI is motivated by the idea that a vast amount of knowledge is the key to intelligence, in particular machines need to be given common sense knowledge for it to understand the meaning of concepts. SenticNet would fall under this umbrella, as would expert systems. These are programs that use information in a knowledge base, using a set of inference procedures, to solve problems that require significant human expertise for their solution. Expert systems (ES) lend themselves to dealing with qualitative data, as the knowledge base can be updated and inference mechanism adjusted. The information is used to mimic the decision making of experts. Expert Sytems have been used in forecasting for example FINEVA (FINancian AVAaluation) is a multicriteria knowledge-based ES to assess firm performance and viability. The output of this system demonstrates the ranking of analyzed firms based on class of risk. Within narratives the determinant of disclosure could be added to the knowledge base eg poor readability could be resultant from poor performance, fraud or that prevalence of good news and self references are indicative of good firm prospects or higher number of forward looking statements given to counteract stock prices which poorly reflect future earnings. In financial prediction and planning domain, several ES have been compared to statistical methods. Results given by Bahrammirzaee (2010) indicate that Experts systems outperform. However ES provide a prescription and not a prediction. That means that if a goal is given then, a knowledge based ES suggests a course of action, while a simulation model predicts the consequences of a selected course of action under some experimental conditions (Bahrammirzaee, 2010). Case-Based AI. The premise of this approach is that people reason and learn from experience and that the knowledge acquired from experience, captured in cases, is central to their understanding of the world. This would translate into AI as indexing cases as they occur so that, much later, they can be remembered and efficiently retrieved in similar situations. The algorithm would use such experience to reason about the task at hand. Sheng-Li et al (2009) argue that Case Based Reasoning (CBR) has good predictive capabilities, it is non parametric method which does not require any data distribution assumptions, an incremental learning technique that can retain new cases without reprocessing. However the determination of similar cases can be problematic. The authors propose a hybrid model for the financial prediction task:

8 CBR augmented with Genetic Algorithm (GA) and the fuzzy K nearest neighbour (KNN) method. GAs are used to compute the optimal weight vectors of financial variables and KNN are used to compute distances between cases. The authors in their conclusion contend that financial ratios extracted for the knowledge elicitation task can be extended to pick up more features of interest with which to compare cases. This is again where the technique could be augmented with narrative content that would aid the prediction task. Both CBR and Artificial Neural Networks (ANN) have been intensively used in the finance domain. The evidence indicates that CBR can be better in some areas for example (Behbood, 2011) conclude that it gives better results when markets deviate from a stable equilibrium. Other researchers have also found it to have superior performance that ANN (Slonim, 2001; Shen 2012). This approach utilises one of the fundamentals of brain like activity, learning. It learns from experience and uses that knowledge to make new deductions Biologically Inspired or Artificial Intelligence. These are models and techniques inspired by natural mechanisms, such as those studied in Biology. Examples would include Artificial Neural Networks (ANN), Machine Learning (ML), and Genetic Algorithms (GA). In Finance, they have been used in areas such as, business failure prediction, debt risk assessment, security market applications, financial forecasting, portfolio management, fraud detection and insurance. For example ANNs have been designed using financial data of banking customers as the input vector and the actual decisions of the credit analyst as the desired output vector. The objective of the system would be to imitate the human expert in granting credit and setting credit limits. Bahrammirzaee (2010) in his comprehensive review of ANN, finds that they outperform traditional statistical approaches but questions their accuracy. This he concludes is due to the difficulty of determining the proper size and structure of a neural net for a given problem, poor pattern matching capabilities and prediction based on past events which may not be repeated, especially within a financial setting. Within forecasting, typically financial ratios are used to feed into these models, accuracy could be significantly enhanced by using data captured from narratives. For example sentiment polarity extracted by SenticNet or building up features from grammar rules, context analysis at the sentence level to capture company strategic intentions or likelihood of bankruptcy. This could be quantified and fed to the ANN. Furthermore, within the realm of machine learning Chi et al (2011) use both financial and non-financial features with SVM model for financial distress prediction and get improved results as compared with approaches that use only financial features. SVM overcome the weakness ANN which tend to converge on local optimum with respect to the training data and generalise better with respect to the test data. SVM are known as maximal margin classifier in which the classification problem can be represented as an optimization process. Genetic Algorithms have been used to find parameters for an SVM for diagnosing business crises (Chih-Hung W and Gwo-Hshiung Tzeng, 2007). Genetic algorithms find the optimal set of features which are then added to the feature set, boosting performance of the SVM classifiers.

9 Outwith these 3 main approaches, hybrid systems are worth mentioning because hybridization (e.g., mixing different functions in order to perform a complex task) would also fall into the realm of Artificial Intelligence (AI). For example, Kuo et al (1996) designed a system for stock market forecasting that concerned qualitative and quantitative factors simultaneously. This system was composed of integrating NN (Neural Net) for quantitative factors and fuzzy Delphi model for qualitative factors and got acceptable results. Hybrid systems typically combine the power of statistical models with neural nets and get superior result (Bahrammirzaee, 2010). Neural Nets are merged with Fuzzy Logic mechanisms or probabilistic methods. These tools can also be used on their own and fall under the soft computing category. They are widely used in financial applications as are tolerant to imprecision and uncertainty, which a human expert would deal with regularly. For example, Tan et al (2005) propose Genetic complementary learning system (a combination of the Genetic Algorithm and Neural Nets). It is based on hippocampus-inspired learning that is believed to be a mechanism underlying pattern recognition in human. This coupled with GA the authors contend offer good solutions to stock price and bank failure prediction task. The authors themselves in the conclusion advocate that its predictive capabilities would be enhanced with more narrative content inclusion. To date the success of any prediction and modelling techniques employed using narratives have rested on the correct identification of features that capture semantics of the data in question. Within financial narratives for example a number or researchers have tried to pick up tone/sentiments expressed to link it to performance, strategic intentions, potential bankruptcy [see Appendix One]. However keyword searches are limited to syntactic analysis and fail to capture meaning which can be dispersed within natural language. 4. Preliminary Results Sentic Net To correct this deficiency, Cambria et al (2012) propose a concept based technology (SenticNet) using an ontology ConceptNet (a common sense knowledge base) tuned to picking up emotive content, using biologically-inspired affective categorisation model that can potentially describe the full range of emotional experiences. Dimensionality reduction techniques are employed to cluster similar concepts together. This when deployed within a financial narrative setting could pick up tone with polarity rating, more accurately. Loughran and Macdonald, 2012 (will be referred to as L&M)) build up a dictionary of words that denote sentiment in Finance, these could be used to build up the concepts within ConceptNet. For example L&M argue that words such as (tax, costs, loss, capital, cost, expense, and expenses) denoted as negative words generally. Within a financial setting, firm costs, sources of capital, or the amount of tax paid are neutral in nature; managers using this language are merely describing their operations. Once such relationships are captured within ConceptNet, SenticNet with its polarity detection and clustering techniques can classify sentences by tone. Thereafter once a sentence has been correctly tagged for its polarity, it can be used as training data to aid classification (this was done by Cambria et al, 2013 using Artificial Neural Network, which was fed affectively annotated text using SenticNet [43][44]). The above approach was followed (L&M s

10 financial wordlist was incorporated into SenticNet) and the algorithms were run over extracts (the management discussion and analysis section) from ten company annual reports within the UK Food and Beverage sector. Some of the sentences passed to SenticNet with results are shown in appendix two. SenticNet was able to pick up the correct tone in all sentences that were put through it. This would be highly beneficial as tone within a financial context is taken as indictor of company performance, likely stock markets returns, used in fraud detection, early warning of company distress and impression management. Plethora of researchers have used tone to guage firm behaviour [see Appendix One]. Diction (the words that the author uses) are one of the main ways to determine tone (Nordquist, 2012) and this is what SenticNet is performing, examining words and the concepts that are enveloped within these words, demystified using an affective ConceptNet ontology.

11 5. Discussion and Conclusion It is generally accepted that outbreak of corporate financial crises have been preceded by symptoms and alarms. Predictive modelling within finance has missed out on forecasting some big blunders. A significant contributing factor is that it has failed to take into account the abounding narrative content which many have touted to be key to knowledge discovery. The focus solely on non-text derived quantitative data is limiting as it is mostly backward-looking, point-in-time measures and therefore leads to inaccurate prediction. Research done on narrative disclosure strongly indicate that value relevant information is contained in documents such as annual reports, business news, analyst reviews. The question boils down to how to extract the relevant semantic information contained in textual format for use in analysis and prediction tasks? SenticNet for opinion mining was mentioned as a way forward to capture semantics in a robust and complete manner. SenticNet correctly picked up tone from extracts of company annual reports. This would greatly aid predictive capabilities of financial models as tone from extensive research in narratives has been shown to be an accurate indicator of firm intention and prospects. To build Biologically Inspired or Cognitive Systems a vital component is learning and using it to improve its performance. This is where SenticNet can be developed further to completely build computational model that mimic the human brain. Within finance analysis and prediction could be done using the captured semantics through cognitive systems like SenticNet and then results fed into Machine Learning algorithms, as shown by Cambria et al (2013). Similarly, semantics of non-opinionated data could be captured through ontologies, probabilities, lexical analysis as part of the building blocks for cognitive systems. What techniques are chosen depends on the task at hand. The question that needs to be asked is What data structures are useful for representing knowledge and what algorithms operate on those knowledge structures to produce intelligent behaviour? (Lee and Ho, 2009). Bahrammirzaee, (2010) concludes after reviewing previous research that the accuracy of these artificial intelligent methods is superior to that of statistical methods in dealing with financial problems, especially with regard to nonlinear patterns. References 1. Ahmed, K., & Courtis, J. K. (1999) Associations between corporate characteristics and disclosure levels in annual reports: a meta-analysis. British Accounting Review, 31(1), Bahrammirzaee, A. (2010) A comparative survey of artificial intelligence applications in finance: artificial neural networks, expert system and hybrid intelligent systems Neural Computing and Applications, Vol 19, Issue 8.

12 3. Balakrishnan, R., X. Y. Qiu, and P. Srinivasan. (2010) On the predictive ability of narrative disclosures in annual reports. European Journal of Operational Research Beattie, V., McInnes, W., & Fearnley, S. (2004) A methodology for analysing and evaluating narratives in annual reports: a comprehensive descriptive profile and metrics for disclosure quality attributes. Accounting Forum, 28 (3), Behbood, V. (2011) Intelligent financial warning model using Fuzzy Neural Network and case-based reasoning Computational Intelligence for Financial Engineering and Economics (CIFEr), 2011 IEEE Symposium). 6. Botosan, C. A. (1997) Disclosure level and the cost of equity capital The Accounting Review, 72 (3), Brennan et al (2007) Impression management: developing and illustrating a scheme of analysis for narrative disclosures a methodological note Accounting, Auditing and Accountability Journal, 22 (5): Brown and Tucker (2010) Large-Sample Evidence on Firms Year-over-Year MD&A Modifications Journal of Accounting research, Volume 49, Issue Cambria E. and Hussain A. (2012) Sentic Computing: Techniques, Tools, and Applications. Dordrecht, Netherlands: Springer, ISBN: Cambria, E. Mazzocco, T. and Hussain A. (2013) Application of multi-dimensional scaling and artificial neural networks for biologically inspired opinion mining Biologically Inspired Cognitive Architectures, doi: /j.bica Chi X. et al. (2011). Financial distress prediction based on SVM and MDA methods: the case of Chinese listed companies, Quality and Quantity, Volume 45, Number 3, pp (16). 12. Chih-Hung W, Gwo-Hshiung Tzeng (2007) A real-valued genetic algorithm to optimize the parameters of support vector machine for predicting bankruptcy Expert Systems with Applications Clatworthy, M. and Jones, M.J. (2001), The effect of thematic structure on the variability of annual report readability, Accounting, Auditing and Accountability Journal, Vol. 14 No. 3, pp Demers and Vega (2008) Linguistic Tone in Earnings Announcements: News or Noise?Linguistic Tone in Earnings Announcements: News or Noise? FRB International Finance Discussion Paper 15. Ekbia H.(2010) Fifty years of research in artificial intelligence Annual Review of Information Science and Technology, Volume 44 Issue 1.

13 16. Feng L. (2010) The Information Content of Forward-Looking Statements in Corporate Filings A Naïve Bayesian Machine Learning Approach Journal of Accounting research, Volume 48, Issue Feldman, R., Govindaraj, S., Livnat, J. and Segal, B. (2010), Management tone change, post earnings announcement drift and accruals, Review of Accounting Studies, Vol. 5 No. 4, pp Festinger, L. (1985. (first published 1957)), A theory of cognitive dissonance, Stanford, CA: Stanford University Press 19. Goel S., Gangolly J. (2010) Can Linguistic Predictors Detect Fraudulent Financial Filings? Journal of Emerging Technologies in Accounting, Vol. 7, No. 1, pp Hobson, J. L et al. (2012) Analysing Speech to Detect Financial Misreporting. Journal of Accounting Research Hussainey, K., Schleicher, T. and Walker, M. (2003). Undertaking Large-scale Disclosure Studies when AIMR-FAF Ratings are not available: the Case of Prices Leading Earnings. Accounting and Business Research, 33(4): Huang X. et al (2011), Tone Management Social Science Research Network 23. Jensen, M. and Meckling, W. (1976). Theory of the Firm: Managerial Behavior, Agency Costs and Ownership Structure Foundations of Organizational Strategy. 24. Jones, M.J. and Shoemaker, P.A. (1994). Accounting Narratives: A Review of Empirical Studies of Content and Readability Journal of Accounting Literature, 13: Kothari S., Li X., and Short J. (2009). The Effect of Disclosures by Management, Analysts, and Business Press on Cost of Capital, Return Volatility, and Analyst Forecasts: A Study using Content Analysis. The Accounting Review. September Kumar R. and Ravi V. (2007) Bankruptcy prediction in banks and firms via statistical and intelligent techniques A review, European Journal of Operational Research. 27. Kuo, R. J (1996). "Integration of artificial neural networks and fuzzy Delphi for stock market forecasting Systems, Man, and Cybernetics, IEEE International Conference 28. Larcker D. and Zakolyukina A. (2011) Detecting Deceptive Discussions in Conference Calls Journal of Accounting Research Volume 50, Issue Li, F. (2008) Annual report readability, current earnings, and earnings persistence Journal of Accounting and Economics 45: Li, F. (2010) The Information Content of Forward Looking Statements in Corporate Filings A Naïve Bayesian Machine Learning Approach Journal of Accounting Research Vol. 48 No. 5 December 2010

14 31. Loughran, T. and B. McDonald (2009) Barron s Red Flangs: Do They Actually Work? Working Paper,University of Notre Dame. 32. Merkl-Davies, D.M., Brennan, N.M and Vourvachis, P. (2013) A taxonomy of text analysis approaches in corporate narrative reporting research, working paper, Centre for Impression Management in Accounting Communication, Bangor Business School, UK. 33. Merkl-Davies, D. and Brennan, N. (2007) Discretionary Disclosure Strategies in Corporate Narratives: Incremental Information or Impression Management? Journal of Accounting Literature, 26: Manning and Shutze (1999), Foundations of Statistical Natural Language Processing (Book). 35. Newman M, J. Pennebaker, D. Berry S. (2003) Lying words: Predicting deception from linguistic styles Personality and Social Psychology Bulletin 29 (5): Nordquist R. (2012) Grammar & Composition Guide Book. 37. Shen Q. et al (2012) CBR case retrieval model research in business financial distress warning based on gray relation Proceedings - IEEE Symposium on Robotics and Applications, ISRA Sheng-Tun, Li and Hei-Fong Ho (2009) Predicting financial activity with evolutionary fuzzy case-based reasoning Expert Systems with Applications,January 2009, Pages Slonim T. (2001) Design issues in fuzzy case-based reasoning Fuzzy Sets and Systems, Vol 117, Issue Smith, M. and Taffler, R.J. (2000), The chairman s statement: a content analysis of discretionary narrative disclosures, Accounting, Auditing and Accountability Journal Vol. 13 No. 5, pp Tetlock P. (2007) Giving Content to Investor Sentiment: The Role of Media in the Stock Market The Journal of Finance, Volume 62, Issue 3, pages , June Tan, T. et al (2005) Brain-inspired Genetic Complementary Learning for Stock Market Prediction Evolutionary Computation, The 2005 IEEE Congress.

Ph.D. Program in Economics and Management. Accounting and governace The role of Corporate Disclosure 3 rd Term

Ph.D. Program in Economics and Management. Accounting and governace The role of Corporate Disclosure 3 rd Term Ph.D. Program in Economics and Management Accounting and governace The role of Corporate Disclosure 3 rd Term Saverio Bozzolan Department of Economics and Management Via del Santo, 33 E-mail: Saverio.bozzolan@unipd.it

More information

Sentiment analysis for news articles

Sentiment analysis for news articles Prashant Raina Sentiment analysis for news articles Wide range of applications in business and public policy Especially relevant given the popularity of online media Previous work Machine learning based

More information

Business Intelligence and Decision Support Systems

Business Intelligence and Decision Support Systems Chapter 12 Business Intelligence and Decision Support Systems Information Technology For Management 7 th Edition Turban & Volonino Based on lecture slides by L. Beaubien, Providence College John Wiley

More information

DATA MINING TECHNIQUES AND APPLICATIONS

DATA MINING TECHNIQUES AND APPLICATIONS DATA MINING TECHNIQUES AND APPLICATIONS Mrs. Bharati M. Ramageri, Lecturer Modern Institute of Information Technology and Research, Department of Computer Application, Yamunanagar, Nigdi Pune, Maharashtra,

More information

Neural Network Applications in Stock Market Predictions - A Methodology Analysis

Neural Network Applications in Stock Market Predictions - A Methodology Analysis Neural Network Applications in Stock Market Predictions - A Methodology Analysis Marijana Zekic, MS University of Josip Juraj Strossmayer in Osijek Faculty of Economics Osijek Gajev trg 7, 31000 Osijek

More information

Neural Networks for Sentiment Detection in Financial Text

Neural Networks for Sentiment Detection in Financial Text Neural Networks for Sentiment Detection in Financial Text Caslav Bozic* and Detlef Seese* With a rise of algorithmic trading volume in recent years, the need for automatic analysis of financial news emerged.

More information

Intrusion Detection via Machine Learning for SCADA System Protection

Intrusion Detection via Machine Learning for SCADA System Protection Intrusion Detection via Machine Learning for SCADA System Protection S.L.P. Yasakethu Department of Computing, University of Surrey, Guildford, GU2 7XH, UK. s.l.yasakethu@surrey.ac.uk J. Jiang Department

More information

KNOWLEDGE-BASED IN MEDICAL DECISION SUPPORT SYSTEM BASED ON SUBJECTIVE INTELLIGENCE

KNOWLEDGE-BASED IN MEDICAL DECISION SUPPORT SYSTEM BASED ON SUBJECTIVE INTELLIGENCE JOURNAL OF MEDICAL INFORMATICS & TECHNOLOGIES Vol. 22/2013, ISSN 1642-6037 medical diagnosis, ontology, subjective intelligence, reasoning, fuzzy rules Hamido FUJITA 1 KNOWLEDGE-BASED IN MEDICAL DECISION

More information

Sentiment analysis on tweets in a financial domain

Sentiment analysis on tweets in a financial domain Sentiment analysis on tweets in a financial domain Jasmina Smailović 1,2, Miha Grčar 1, Martin Žnidaršič 1 1 Dept of Knowledge Technologies, Jožef Stefan Institute, Ljubljana, Slovenia 2 Jožef Stefan International

More information

Social media based analytical framework for financial fraud detection

Social media based analytical framework for financial fraud detection Social media based analytical framework for financial fraud detection Abstract: With more and more companies go public nowadays, increasingly number of financial fraud are exposed. Conventional auditing

More information

Prediction of Stock Performance Using Analytical Techniques

Prediction of Stock Performance Using Analytical Techniques 136 JOURNAL OF EMERGING TECHNOLOGIES IN WEB INTELLIGENCE, VOL. 5, NO. 2, MAY 2013 Prediction of Stock Performance Using Analytical Techniques Carol Hargreaves Institute of Systems Science National University

More information

Marketing Mix Modelling and Big Data P. M Cain

Marketing Mix Modelling and Big Data P. M Cain 1) Introduction Marketing Mix Modelling and Big Data P. M Cain Big data is generally defined in terms of the volume and variety of structured and unstructured information. Whereas structured data is stored

More information

How To Use Neural Networks In Data Mining

How To Use Neural Networks In Data Mining International Journal of Electronics and Computer Science Engineering 1449 Available Online at www.ijecse.org ISSN- 2277-1956 Neural Networks in Data Mining Priyanka Gaur Department of Information and

More information

Learning is a very general term denoting the way in which agents:

Learning is a very general term denoting the way in which agents: What is learning? Learning is a very general term denoting the way in which agents: Acquire and organize knowledge (by building, modifying and organizing internal representations of some external reality);

More information

Positive Versus Negative News: Readability and Obfuscation in Financial Reports

Positive Versus Negative News: Readability and Obfuscation in Financial Reports Business, Economics and Public Policy Working Papers 2010-5 Positive Versus Negative News: Readability and Obfuscation in Financial Reports Leopold Bayerlein School of Business, Economics and Public Policy

More information

Facilitating Business Process Discovery using Email Analysis

Facilitating Business Process Discovery using Email Analysis Facilitating Business Process Discovery using Email Analysis Matin Mavaddat Matin.Mavaddat@live.uwe.ac.uk Stewart Green Stewart.Green Ian Beeson Ian.Beeson Jin Sa Jin.Sa Abstract Extracting business process

More information

Knowledge Discovery from patents using KMX Text Analytics

Knowledge Discovery from patents using KMX Text Analytics Knowledge Discovery from patents using KMX Text Analytics Dr. Anton Heijs anton.heijs@treparel.com Treparel Abstract In this white paper we discuss how the KMX technology of Treparel can help searchers

More information

Data Mining and Soft Computing. Francisco Herrera

Data Mining and Soft Computing. Francisco Herrera Francisco Herrera Research Group on Soft Computing and Information Intelligent Systems (SCI 2 S) Dept. of Computer Science and A.I. University of Granada, Spain Email: herrera@decsai.ugr.es http://sci2s.ugr.es

More information

Equity forecast: Predicting long term stock price movement using machine learning

Equity forecast: Predicting long term stock price movement using machine learning Equity forecast: Predicting long term stock price movement using machine learning Nikola Milosevic School of Computer Science, University of Manchester, UK Nikola.milosevic@manchester.ac.uk Abstract Long

More information

Using Artificial Intelligence to Manage Big Data for Litigation

Using Artificial Intelligence to Manage Big Data for Litigation FEBRUARY 3 5, 2015 / THE HILTON NEW YORK Using Artificial Intelligence to Manage Big Data for Litigation Understanding Artificial Intelligence to Make better decisions Improve the process Allay the fear

More information

Bagged Ensemble Classifiers for Sentiment Classification of Movie Reviews

Bagged Ensemble Classifiers for Sentiment Classification of Movie Reviews www.ijecs.in International Journal Of Engineering And Computer Science ISSN:2319-7242 Volume 3 Issue 2 February, 2014 Page No. 3951-3961 Bagged Ensemble Classifiers for Sentiment Classification of Movie

More information

An Overview of Knowledge Discovery Database and Data mining Techniques

An Overview of Knowledge Discovery Database and Data mining Techniques An Overview of Knowledge Discovery Database and Data mining Techniques Priyadharsini.C 1, Dr. Antony Selvadoss Thanamani 2 M.Phil, Department of Computer Science, NGM College, Pollachi, Coimbatore, Tamilnadu,

More information

Using Text and Data Mining Techniques to extract Stock Market Sentiment from Live News Streams

Using Text and Data Mining Techniques to extract Stock Market Sentiment from Live News Streams 2012 International Conference on Computer Technology and Science (ICCTS 2012) IPCSIT vol. XX (2012) (2012) IACSIT Press, Singapore Using Text and Data Mining Techniques to extract Stock Market Sentiment

More information

Predicting the Stock Market with News Articles

Predicting the Stock Market with News Articles Predicting the Stock Market with News Articles Kari Lee and Ryan Timmons CS224N Final Project Introduction Stock market prediction is an area of extreme importance to an entire industry. Stock price is

More information

A STUDY ON DATA MINING INVESTIGATING ITS METHODS, APPROACHES AND APPLICATIONS

A STUDY ON DATA MINING INVESTIGATING ITS METHODS, APPROACHES AND APPLICATIONS A STUDY ON DATA MINING INVESTIGATING ITS METHODS, APPROACHES AND APPLICATIONS Mrs. Jyoti Nawade 1, Dr. Balaji D 2, Mr. Pravin Nawade 3 1 Lecturer, JSPM S Bhivrabai Sawant Polytechnic, Pune (India) 2 Assistant

More information

6.2.8 Neural networks for data mining

6.2.8 Neural networks for data mining 6.2.8 Neural networks for data mining Walter Kosters 1 In many application areas neural networks are known to be valuable tools. This also holds for data mining. In this chapter we discuss the use of neural

More information

The relation between news events and stock price jump: an analysis based on neural network

The relation between news events and stock price jump: an analysis based on neural network 20th International Congress on Modelling and Simulation, Adelaide, Australia, 1 6 December 2013 www.mssanz.org.au/modsim2013 The relation between news events and stock price jump: an analysis based on

More information

Improving Decision Making and Managing Knowledge

Improving Decision Making and Managing Knowledge Improving Decision Making and Managing Knowledge Decision Making and Information Systems Information Requirements of Key Decision-Making Groups in a Firm Senior managers, middle managers, operational managers,

More information

A Proposed Prediction Model for Forecasting the Financial Market Value According to Diversity in Factor

A Proposed Prediction Model for Forecasting the Financial Market Value According to Diversity in Factor A Proposed Prediction Model for Forecasting the Financial Market Value According to Diversity in Factor Ms. Hiral R. Patel, Mr. Amit B. Suthar, Dr. Satyen M. Parikh Assistant Professor, DCS, Ganpat University,

More information

Text Opinion Mining to Analyze News for Stock Market Prediction

Text Opinion Mining to Analyze News for Stock Market Prediction Int. J. Advance. Soft Comput. Appl., Vol. 6, No. 1, March 2014 ISSN 2074-8523; Copyright SCRG Publication, 2014 Text Opinion Mining to Analyze News for Stock Market Prediction Yoosin Kim 1, Seung Ryul

More information

DEVELOPMENT OF FUZZY LOGIC MODEL FOR LEADERSHIP COMPETENCIES ASSESSMENT CASE STUDY: KHOUZESTAN STEEL COMPANY

DEVELOPMENT OF FUZZY LOGIC MODEL FOR LEADERSHIP COMPETENCIES ASSESSMENT CASE STUDY: KHOUZESTAN STEEL COMPANY DEVELOPMENT OF FUZZY LOGIC MODEL FOR LEADERSHIP COMPETENCIES ASSESSMENT CASE STUDY: KHOUZESTAN STEEL COMPANY 1 MOHAMMAD-ALI AFSHARKAZEMI, 2 DARIUSH GHOLAMZADEH, 3 AZADEH TAHVILDAR KHAZANEH 1 Department

More information

Customer Classification And Prediction Based On Data Mining Technique

Customer Classification And Prediction Based On Data Mining Technique Customer Classification And Prediction Based On Data Mining Technique Ms. Neethu Baby 1, Mrs. Priyanka L.T 2 1 M.E CSE, Sri Shakthi Institute of Engineering and Technology, Coimbatore 2 Assistant Professor

More information

A GENERAL TAXONOMY FOR VISUALIZATION OF PREDICTIVE SOCIAL MEDIA ANALYTICS

A GENERAL TAXONOMY FOR VISUALIZATION OF PREDICTIVE SOCIAL MEDIA ANALYTICS A GENERAL TAXONOMY FOR VISUALIZATION OF PREDICTIVE SOCIAL MEDIA ANALYTICS Stacey Franklin Jones, D.Sc. ProTech Global Solutions Annapolis, MD Abstract The use of Social Media as a resource to characterize

More information

01219211 Software Development Training Camp 1 (0-3) Prerequisite : 01204214 Program development skill enhancement camp, at least 48 person-hours.

01219211 Software Development Training Camp 1 (0-3) Prerequisite : 01204214 Program development skill enhancement camp, at least 48 person-hours. (International Program) 01219141 Object-Oriented Modeling and Programming 3 (3-0) Object concepts, object-oriented design and analysis, object-oriented analysis relating to developing conceptual models

More information

Fuzzy Cognitive Map for Software Testing Using Artificial Intelligence Techniques

Fuzzy Cognitive Map for Software Testing Using Artificial Intelligence Techniques Fuzzy ognitive Map for Software Testing Using Artificial Intelligence Techniques Deane Larkman 1, Masoud Mohammadian 1, Bala Balachandran 1, Ric Jentzsch 2 1 Faculty of Information Science and Engineering,

More information

HYBRID PROBABILITY BASED ENSEMBLES FOR BANKRUPTCY PREDICTION

HYBRID PROBABILITY BASED ENSEMBLES FOR BANKRUPTCY PREDICTION HYBRID PROBABILITY BASED ENSEMBLES FOR BANKRUPTCY PREDICTION Chihli Hung 1, Jing Hong Chen 2, Stefan Wermter 3, 1,2 Department of Management Information Systems, Chung Yuan Christian University, Taiwan

More information

Towards applying Data Mining Techniques for Talent Mangement

Towards applying Data Mining Techniques for Talent Mangement 2009 International Conference on Computer Engineering and Applications IPCSIT vol.2 (2011) (2011) IACSIT Press, Singapore Towards applying Data Mining Techniques for Talent Mangement Hamidah Jantan 1,

More information

A Study Of Bagging And Boosting Approaches To Develop Meta-Classifier

A Study Of Bagging And Boosting Approaches To Develop Meta-Classifier A Study Of Bagging And Boosting Approaches To Develop Meta-Classifier G.T. Prasanna Kumari Associate Professor, Dept of Computer Science and Engineering, Gokula Krishna College of Engg, Sullurpet-524121,

More information

DATA MINING TECHNOLOGY. Keywords: data mining, data warehouse, knowledge discovery, OLAP, OLAM.

DATA MINING TECHNOLOGY. Keywords: data mining, data warehouse, knowledge discovery, OLAP, OLAM. DATA MINING TECHNOLOGY Georgiana Marin 1 Abstract In terms of data processing, classical statistical models are restrictive; it requires hypotheses, the knowledge and experience of specialists, equations,

More information

KEY KNOWLEDGE MANAGEMENT TECHNOLOGIES IN THE INTELLIGENCE ENTERPRISE

KEY KNOWLEDGE MANAGEMENT TECHNOLOGIES IN THE INTELLIGENCE ENTERPRISE KEY KNOWLEDGE MANAGEMENT TECHNOLOGIES IN THE INTELLIGENCE ENTERPRISE RAMONA-MIHAELA MATEI Ph.D. student, Academy of Economic Studies, Bucharest, Romania ramona.matei1982@gmail.com Abstract In this rapidly

More information

Predictive Modeling in Workers Compensation 2008 CAS Ratemaking Seminar

Predictive Modeling in Workers Compensation 2008 CAS Ratemaking Seminar Predictive Modeling in Workers Compensation 2008 CAS Ratemaking Seminar Prepared by Louise Francis, FCAS, MAAA Francis Analytics and Actuarial Data Mining, Inc. www.data-mines.com Louise.francis@data-mines.cm

More information

Knowledge Based Descriptive Neural Networks

Knowledge Based Descriptive Neural Networks Knowledge Based Descriptive Neural Networks J. T. Yao Department of Computer Science, University or Regina Regina, Saskachewan, CANADA S4S 0A2 Email: jtyao@cs.uregina.ca Abstract This paper presents a

More information

Data Warehousing and Data Mining in Business Applications

Data Warehousing and Data Mining in Business Applications 133 Data Warehousing and Data Mining in Business Applications Eesha Goel CSE Deptt. GZS-PTU Campus, Bathinda. Abstract Information technology is now required in all aspect of our lives that helps in business

More information

Data Mining Yelp Data - Predicting rating stars from review text

Data Mining Yelp Data - Predicting rating stars from review text Data Mining Yelp Data - Predicting rating stars from review text Rakesh Chada Stony Brook University rchada@cs.stonybrook.edu Chetan Naik Stony Brook University cnaik@cs.stonybrook.edu ABSTRACT The majority

More information

NEURAL NETWORKS IN DATA MINING

NEURAL NETWORKS IN DATA MINING NEURAL NETWORKS IN DATA MINING 1 DR. YASHPAL SINGH, 2 ALOK SINGH CHAUHAN 1 Reader, Bundelkhand Institute of Engineering & Technology, Jhansi, India 2 Lecturer, United Institute of Management, Allahabad,

More information

Data Isn't Everything

Data Isn't Everything June 17, 2015 Innovate Forward Data Isn't Everything The Challenges of Big Data, Advanced Analytics, and Advance Computation Devices for Transportation Agencies. Using Data to Support Mission, Administration,

More information

Using News Articles to Predict Stock Price Movements

Using News Articles to Predict Stock Price Movements Using News Articles to Predict Stock Price Movements Győző Gidófalvi Department of Computer Science and Engineering University of California, San Diego La Jolla, CA 9237 gyozo@cs.ucsd.edu 21, June 15,

More information

Comparing Artificial Intelligence Systems for Stock Portfolio Selection

Comparing Artificial Intelligence Systems for Stock Portfolio Selection Abstract Comparing Artificial Intelligence Systems for Stock Portfolio Selection Extended Abstract Chiu-Che Tseng Texas A&M University Commerce P.O. BOX 3011 Commerce, Texas 75429 Tel: (903) 886-5497 Email:

More information

Students will become familiar with the Brandeis Datastream installation as the primary source of pricing, financial and economic data.

Students will become familiar with the Brandeis Datastream installation as the primary source of pricing, financial and economic data. BUS 211f (1) Information Management: Financial Data in a Quantitative Investment Framework Spring 2004 Fridays 9:10am noon Lemberg Academic Center, Room 54 Prof. Hugh Lagan Crowther C (781) 640-3354 hugh@crowther-investment.com

More information

Towards Multi-modal Cognitive Big Data Informatics: Real World Case Studies & Future Challenges Professor AMIR HUSSAIN, PhD, Fellow UK HEA, Senior Member IEEE Founding Chief-Editor: Cognitive Computation

More information

Credit Risk Assessment of POS-Loans in the Big Data Era

Credit Risk Assessment of POS-Loans in the Big Data Era Credit Risk Assessment of POS-Loans in the Big Data Era Yiyang Bian 1,2, Shaokun Fan 1, Ryan Liying Ye 1, J. Leon Zhao 1 1 Department of Information Systems, City University of Hong Kong 2 School of Management,

More information

Measuring Qualitative Information in Capital Markets Research

Measuring Qualitative Information in Capital Markets Research Measuring Qualitative Information in Capital Markets Research Elaine Henry ehenry@exchange.sba.miami.edu School of Business University of Miami Andrew J. Leone a.leone@miami.edu School of Business University

More information

Data Mining Solutions for the Business Environment

Data Mining Solutions for the Business Environment Database Systems Journal vol. IV, no. 4/2013 21 Data Mining Solutions for the Business Environment Ruxandra PETRE University of Economic Studies, Bucharest, Romania ruxandra_stefania.petre@yahoo.com Over

More information

Qualitative Corporate Dashboards for Corporate Monitoring Peng Jia and Miklos A. Vasarhelyi 1

Qualitative Corporate Dashboards for Corporate Monitoring Peng Jia and Miklos A. Vasarhelyi 1 Qualitative Corporate Dashboards for Corporate Monitoring Peng Jia and Miklos A. Vasarhelyi 1 Introduction Electronic Commerce 2 is accelerating dramatically changes in the business process. Electronic

More information

Module Catalogue for the Bachelor Program in Computational Linguistics at the University of Heidelberg

Module Catalogue for the Bachelor Program in Computational Linguistics at the University of Heidelberg Module Catalogue for the Bachelor Program in Computational Linguistics at the University of Heidelberg March 1, 2007 The catalogue is organized into sections of (1) obligatory modules ( Basismodule ) that

More information

Stock Portfolio Selection using Data Mining Approach

Stock Portfolio Selection using Data Mining Approach IOSR Journal of Engineering (IOSRJEN) e-issn: 2250-3021, p-issn: 2278-8719 Vol. 3, Issue 11 (November. 2013), V1 PP 42-48 Stock Portfolio Selection using Data Mining Approach Carol Anne Hargreaves, Prateek

More information

Real Stock Trading Using Soft Computing Models

Real Stock Trading Using Soft Computing Models Real Stock Trading Using Soft Computing Models Brent Doeksen 1, Ajith Abraham 2, Johnson Thomas 1 and Marcin Paprzycki 1 1 Computer Science Department, Oklahoma State University, OK 74106, USA, 2 School

More information

Introduction to Data Mining Techniques

Introduction to Data Mining Techniques Introduction to Data Mining Techniques Dr. Rajni Jain 1 Introduction The last decade has experienced a revolution in information availability and exchange via the internet. In the same spirit, more and

More information

EFFICIENT DATA PRE-PROCESSING FOR DATA MINING

EFFICIENT DATA PRE-PROCESSING FOR DATA MINING EFFICIENT DATA PRE-PROCESSING FOR DATA MINING USING NEURAL NETWORKS JothiKumar.R 1, Sivabalan.R.V 2 1 Research scholar, Noorul Islam University, Nagercoil, India Assistant Professor, Adhiparasakthi College

More information

A Big Data Analytical Framework For Portfolio Optimization Abstract. Keywords. 1. Introduction

A Big Data Analytical Framework For Portfolio Optimization Abstract. Keywords. 1. Introduction A Big Data Analytical Framework For Portfolio Optimization Dhanya Jothimani, Ravi Shankar and Surendra S. Yadav Department of Management Studies, Indian Institute of Technology Delhi {dhanya.jothimani,

More information

International Journal of Computer Science Trends and Technology (IJCST) Volume 3 Issue 3, May-June 2015

International Journal of Computer Science Trends and Technology (IJCST) Volume 3 Issue 3, May-June 2015 RESEARCH ARTICLE OPEN ACCESS Data Mining Technology for Efficient Network Security Management Ankit Naik [1], S.W. Ahmad [2] Student [1], Assistant Professor [2] Department of Computer Science and Engineering

More information

ARTIFICIAL INTELLIGENCE METHODS IN STOCK INDEX PREDICTION WITH THE USE OF NEWSPAPER ARTICLES

ARTIFICIAL INTELLIGENCE METHODS IN STOCK INDEX PREDICTION WITH THE USE OF NEWSPAPER ARTICLES FOUNDATION OF CONTROL AND MANAGEMENT SCIENCES No Year Manuscripts Mateusz, KOBOS * Jacek, MAŃDZIUK ** ARTIFICIAL INTELLIGENCE METHODS IN STOCK INDEX PREDICTION WITH THE USE OF NEWSPAPER ARTICLES Analysis

More information

International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014

International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014 RESEARCH ARTICLE OPEN ACCESS A Survey of Data Mining: Concepts with Applications and its Future Scope Dr. Zubair Khan 1, Ashish Kumar 2, Sunny Kumar 3 M.Tech Research Scholar 2. Department of Computer

More information

Stabilization by Conceptual Duplication in Adaptive Resonance Theory

Stabilization by Conceptual Duplication in Adaptive Resonance Theory Stabilization by Conceptual Duplication in Adaptive Resonance Theory Louis Massey Royal Military College of Canada Department of Mathematics and Computer Science PO Box 17000 Station Forces Kingston, Ontario,

More information

dm106 TEXT MINING FOR CUSTOMER RELATIONSHIP MANAGEMENT: AN APPROACH BASED ON LATENT SEMANTIC ANALYSIS AND FUZZY CLUSTERING

dm106 TEXT MINING FOR CUSTOMER RELATIONSHIP MANAGEMENT: AN APPROACH BASED ON LATENT SEMANTIC ANALYSIS AND FUZZY CLUSTERING dm106 TEXT MINING FOR CUSTOMER RELATIONSHIP MANAGEMENT: AN APPROACH BASED ON LATENT SEMANTIC ANALYSIS AND FUZZY CLUSTERING ABSTRACT In most CRM (Customer Relationship Management) systems, information on

More information

Chapter Managing Knowledge in the Digital Firm

Chapter Managing Knowledge in the Digital Firm Chapter Managing Knowledge in the Digital Firm Essay Questions: 1. What is knowledge management? Briefly outline the knowledge management chain. 2. Identify the three major types of knowledge management

More information

IT services for analyses of various data samples

IT services for analyses of various data samples IT services for analyses of various data samples Ján Paralič, František Babič, Martin Sarnovský, Peter Butka, Cecília Havrilová, Miroslava Muchová, Michal Puheim, Martin Mikula, Gabriel Tutoky Technical

More information

Linguistic Preference Modeling: Foundation Models and New Trends. Extended Abstract

Linguistic Preference Modeling: Foundation Models and New Trends. Extended Abstract Linguistic Preference Modeling: Foundation Models and New Trends F. Herrera, E. Herrera-Viedma Dept. of Computer Science and Artificial Intelligence University of Granada, 18071 - Granada, Spain e-mail:

More information

International Journal of Computer Trends and Technology (IJCTT) volume 4 Issue 8 August 2013

International Journal of Computer Trends and Technology (IJCTT) volume 4 Issue 8 August 2013 A Short-Term Traffic Prediction On A Distributed Network Using Multiple Regression Equation Ms.Sharmi.S 1 Research Scholar, MS University,Thirunelvelli Dr.M.Punithavalli Director, SREC,Coimbatore. Abstract:

More information

Software project cost estimation using AI techniques

Software project cost estimation using AI techniques Software project cost estimation using AI techniques Rodríguez Montequín, V.; Villanueva Balsera, J.; Alba González, C.; Martínez Huerta, G. Project Management Area University of Oviedo C/Independencia

More information

Forecasting stock markets with Twitter

Forecasting stock markets with Twitter Forecasting stock markets with Twitter Argimiro Arratia argimiro@lsi.upc.edu Joint work with Marta Arias and Ramón Xuriguera To appear in: ACM Transactions on Intelligent Systems and Technology, 2013,

More information

Social Media Mining. Data Mining Essentials

Social Media Mining. Data Mining Essentials Introduction Data production rate has been increased dramatically (Big Data) and we are able store much more data than before E.g., purchase data, social media data, mobile phone data Businesses and customers

More information

STATISTICA. Financial Institutions. Case Study: Credit Scoring. and

STATISTICA. Financial Institutions. Case Study: Credit Scoring. and Financial Institutions and STATISTICA Case Study: Credit Scoring STATISTICA Solutions for Business Intelligence, Data Mining, Quality Control, and Web-based Analytics Table of Contents INTRODUCTION: WHAT

More information

Neural Networks in Data Mining

Neural Networks in Data Mining IOSR Journal of Engineering (IOSRJEN) ISSN (e): 2250-3021, ISSN (p): 2278-8719 Vol. 04, Issue 03 (March. 2014), V6 PP 01-06 www.iosrjen.org Neural Networks in Data Mining Ripundeep Singh Gill, Ashima Department

More information

Big Data with Rough Set Using Map- Reduce

Big Data with Rough Set Using Map- Reduce Big Data with Rough Set Using Map- Reduce Mr.G.Lenin 1, Mr. A. Raj Ganesh 2, Mr. S. Vanarasan 3 Assistant Professor, Department of CSE, Podhigai College of Engineering & Technology, Tirupattur, Tamilnadu,

More information

ON INTEGRATING UNSUPERVISED AND SUPERVISED CLASSIFICATION FOR CREDIT RISK EVALUATION

ON INTEGRATING UNSUPERVISED AND SUPERVISED CLASSIFICATION FOR CREDIT RISK EVALUATION ISSN 9 X INFORMATION TECHNOLOGY AND CONTROL, 00, Vol., No.A ON INTEGRATING UNSUPERVISED AND SUPERVISED CLASSIFICATION FOR CREDIT RISK EVALUATION Danuta Zakrzewska Institute of Computer Science, Technical

More information

Research on the Factor Analysis and Logistic Regression with the Applications on the Listed Company Financial Modeling.

Research on the Factor Analysis and Logistic Regression with the Applications on the Listed Company Financial Modeling. 2nd International Conference on Social Science and Technology Education (ICSSTE 2016) Research on the Factor Analysis and Logistic Regression with the Applications on the Listed Company Financial Modeling

More information

A Review of Data Mining Techniques

A Review of Data Mining Techniques Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 3, Issue. 4, April 2014,

More information

Defending Networks with Incomplete Information: A Machine Learning Approach. Alexandre Pinto alexcp@mlsecproject.org @alexcpsec @MLSecProject

Defending Networks with Incomplete Information: A Machine Learning Approach. Alexandre Pinto alexcp@mlsecproject.org @alexcpsec @MLSecProject Defending Networks with Incomplete Information: A Machine Learning Approach Alexandre Pinto alexcp@mlsecproject.org @alexcpsec @MLSecProject Agenda Security Monitoring: We are doing it wrong Machine Learning

More information

Sentiment Analysis. D. Skrepetos 1. University of Waterloo. NLP Presenation, 06/17/2015

Sentiment Analysis. D. Skrepetos 1. University of Waterloo. NLP Presenation, 06/17/2015 Sentiment Analysis D. Skrepetos 1 1 Department of Computer Science University of Waterloo NLP Presenation, 06/17/2015 D. Skrepetos (University of Waterloo) Sentiment Analysis NLP Presenation, 06/17/2015

More information

Department of Accounting and Finance

Department of Accounting and Finance Department of Accounting and Finance Modules, other than Introductory modules may have pre-requisites or co-requisites (please, see module descriptions below) and a student must have undertaken and passed

More information

Natural Language to Relational Query by Using Parsing Compiler

Natural Language to Relational Query by Using Parsing Compiler Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 4, Issue. 3, March 2015,

More information

Financial Trading System using Combination of Textual and Numerical Data

Financial Trading System using Combination of Textual and Numerical Data Financial Trading System using Combination of Textual and Numerical Data Shital N. Dange Computer Science Department, Walchand Institute of Rajesh V. Argiddi Assistant Prof. Computer Science Department,

More information

Stock Market Prediction Using Data Mining

Stock Market Prediction Using Data Mining Stock Market Prediction Using Data Mining 1 Ruchi Desai, 2 Prof.Snehal Gandhi 1 M.E., 2 M.Tech. 1 Computer Department 1 Sarvajanik College of Engineering and Technology, Surat, Gujarat, India Abstract

More information

Volume 2, Issue 12, December 2014 International Journal of Advance Research in Computer Science and Management Studies

Volume 2, Issue 12, December 2014 International Journal of Advance Research in Computer Science and Management Studies Volume 2, Issue 12, December 2014 International Journal of Advance Research in Computer Science and Management Studies Research Article / Survey Paper / Case Study Available online at: www.ijarcsms.com

More information

Neural Networks and Back Propagation Algorithm

Neural Networks and Back Propagation Algorithm Neural Networks and Back Propagation Algorithm Mirza Cilimkovic Institute of Technology Blanchardstown Blanchardstown Road North Dublin 15 Ireland mirzac@gmail.com Abstract Neural Networks (NN) are important

More information

Study Plan for the Master Degree In Industrial Engineering / Management. (Thesis Track)

Study Plan for the Master Degree In Industrial Engineering / Management. (Thesis Track) Study Plan for the Master Degree In Industrial Engineering / Management (Thesis Track) Plan no. 2005 T A. GENERAL RULES AND CONDITIONS: 1. This plan conforms to the valid regulations of programs of graduate

More information

Data Quality Mining: Employing Classifiers for Assuring consistent Datasets

Data Quality Mining: Employing Classifiers for Assuring consistent Datasets Data Quality Mining: Employing Classifiers for Assuring consistent Datasets Fabian Grüning Carl von Ossietzky Universität Oldenburg, Germany, fabian.gruening@informatik.uni-oldenburg.de Abstract: Independent

More information

Introduction. A. Bellaachia Page: 1

Introduction. A. Bellaachia Page: 1 Introduction 1. Objectives... 3 2. What is Data Mining?... 4 3. Knowledge Discovery Process... 5 4. KD Process Example... 7 5. Typical Data Mining Architecture... 8 6. Database vs. Data Mining... 9 7.

More information

Master of Science in Health Information Technology Degree Curriculum

Master of Science in Health Information Technology Degree Curriculum Master of Science in Health Information Technology Degree Curriculum Core courses: 8 courses Total Credit from Core Courses = 24 Core Courses Course Name HRS Pre-Req Choose MIS 525 or CIS 564: 1 MIS 525

More information

BIG DATA IN HEALTHCARE THE NEXT FRONTIER

BIG DATA IN HEALTHCARE THE NEXT FRONTIER BIG DATA IN HEALTHCARE THE NEXT FRONTIER Divyaa Krishna Sonnad 1, Dr. Jharna Majumdar 2 2 Dean R&D, Prof. and Head, 1,2 Dept of CSE (PG), Nitte Meenakshi Institute of Technology Abstract: The world of

More information

Predictive time series analysis of stock prices using neural network classifier

Predictive time series analysis of stock prices using neural network classifier Predictive time series analysis of stock prices using neural network classifier Abhinav Pathak, National Institute of Technology, Karnataka, Surathkal, India abhi.pat93@gmail.com Abstract The work pertains

More information

Towards SoMEST Combining Social Media Monitoring with Event Extraction and Timeline Analysis

Towards SoMEST Combining Social Media Monitoring with Event Extraction and Timeline Analysis Towards SoMEST Combining Social Media Monitoring with Event Extraction and Timeline Analysis Yue Dai, Ernest Arendarenko, Tuomo Kakkonen, Ding Liao School of Computing University of Eastern Finland {yvedai,

More information

Analyzing survey text: a brief overview

Analyzing survey text: a brief overview IBM SPSS Text Analytics for Surveys Analyzing survey text: a brief overview Learn how gives you greater insight Contents 1 Introduction 2 The role of text in survey research 2 Approaches to text mining

More information

A Proposed Algorithm for Spam Filtering Emails by Hash Table Approach

A Proposed Algorithm for Spam Filtering Emails by Hash Table Approach International Research Journal of Applied and Basic Sciences 2013 Available online at www.irjabs.com ISSN 2251-838X / Vol, 4 (9): 2436-2441 Science Explorer Publications A Proposed Algorithm for Spam Filtering

More information

Data Mining Applications in Higher Education

Data Mining Applications in Higher Education Executive report Data Mining Applications in Higher Education Jing Luan, PhD Chief Planning and Research Officer, Cabrillo College Founder, Knowledge Discovery Laboratories Table of contents Introduction..............................................................2

More information

ANALYSIS OF WEB-BASED APPLICATIONS FOR EXPERT SYSTEM

ANALYSIS OF WEB-BASED APPLICATIONS FOR EXPERT SYSTEM Computer Modelling and New Technologies, 2011, Vol.15, No.4, 41 45 Transport and Telecommunication Institute, Lomonosov 1, LV-1019, Riga, Latvia ANALYSIS OF WEB-BASED APPLICATIONS FOR EXPERT SYSTEM N.

More information

U.P.B. Sci. Bull., Series C, Vol. 77, Iss. 1, 2015 ISSN 2286 3540

U.P.B. Sci. Bull., Series C, Vol. 77, Iss. 1, 2015 ISSN 2286 3540 U.P.B. Sci. Bull., Series C, Vol. 77, Iss. 1, 2015 ISSN 2286 3540 ENTERPRISE FINANCIAL DISTRESS PREDICTION BASED ON BACKWARD PROPAGATION NEURAL NETWORK: AN EMPIRICAL STUDY ON THE CHINESE LISTED EQUIPMENT

More information

Text Mining - Scope and Applications

Text Mining - Scope and Applications Journal of Computer Science and Applications. ISSN 2231-1270 Volume 5, Number 2 (2013), pp. 51-55 International Research Publication House http://www.irphouse.com Text Mining - Scope and Applications Miss

More information

Machine Learning and Data Mining. Fundamentals, robotics, recognition

Machine Learning and Data Mining. Fundamentals, robotics, recognition Machine Learning and Data Mining Fundamentals, robotics, recognition Machine Learning, Data Mining, Knowledge Discovery in Data Bases Their mutual relations Data Mining, Knowledge Discovery in Databases,

More information