The Annual Inflation Rate Analysis Using Data Mining Techniques



Similar documents
The Analysis of Currency Exchange Rate Evolution using a Data Mining Technique

THE USING FACTOR ANALYSIS METHOD IN PREDICTION OF BUSINESS FAILURE

Using Principal Component Analysis in Loan Granting

USING LINEAR REGRESSION IN THE ANALYSIS OF FINANCIAL-ECONOMIC PERFORMANCES

A Decision Tree for Weather Prediction

MULTIPLE REGRESSION ANALYSIS OF MAIN ECONOMIC INDICATORS IN TOURISM. R, analysis of variance, Student test, multivariate analysis

ANALYSIS OF THE ECONOMIC PERFORMANCE OF A ORGANIZATION USING MULTIPLE REGRESSION

Data Mining for Model Creation. Presentation by Paul Below, EDS 2500 NE Plunkett Lane Poulsbo, WA USA

Factor Analysis. Principal components factor analysis. Use of extracted factors in multivariate dependency models

Applying TwoStep Cluster Analysis for Identifying Bank Customers Profile

Multiple Regression in SPSS This example shows you how to perform multiple regression. The basic command is regression : linear.

Forecasting methods applied to engineering management

T-test & factor analysis

A STUDY ON THE RETURN ON EQUITY FOR THE ROMANIAN INDUSTRIAL COMPANIES

Principal Component Analysis

The influence of teacher support on national standardized student assessment.

Simple linear regression

POST-HOC SEGMENTATION USING MARKETING RESEARCH

Overview of Factor Analysis

Data Mining. for Process Improvement DATA MINING. Paul Below, Quantitative Software Management, Inc. (QSM)

Chapter 13 Introduction to Linear Regression and Correlation Analysis

Effective and Efficient Tools in Human Resources Management Control

Universitatea de Medicină şi Farmacie Grigore T. Popa Iaşi Comisia pentru asigurarea calităţii DISCIPLINE RECORD/ COURSE / SEMINAR DESCRIPTION

Factor Analysis. Advanced Financial Accounting II Åbo Akademi School of Business

Money and the Key Aspects of Financial Management

How to Get More Value from Your Survey Data

CLASSIFICATION OF EUROPEAN UNION COUNTRIES FROM DATA MINING POINT OF VIEW, USING SAS ENTERPRISE GUIDE

Comparison of sales forecasting models for an innovative agro-industrial product: Bass model versus logistic function

Univariate Regression

Factor Analysis. Chapter 420. Introduction

IDENTITY AND ACCESS MANAGEMENT- A RISK-BASED APPROACH. Ion-Petru POPESCU 1 Cătălin Alexandru BARBU 2 Mădălina Ecaterina POPESCU 3

Website for Human Resources Management in a Public Institution Using Caché Object-Oriented Model

Review Jeopardy. Blue vs. Orange. Review Jeopardy

Multiple Regression Used in Macro-economic Analysis

Adaptive Demand-Forecasting Approach based on Principal Components Time-series an application of data-mining technique to detection of market movement

Rachel J. Goldberg, Guideline Research/Atlanta, Inc., Duluth, GA

Behavioral Entropy of a Cellular Phone User

Dimensionality Reduction: Principal Components Analysis

STATISTICS AND DATA ANALYSIS IN GEOLOGY, 3rd ed. Clarificationof zonationprocedure described onpp

FACTOR ANALYSIS. Factor Analysis is similar to PCA in that it is a technique for studying the interrelationships among variables.

To do a factor analysis, we need to select an extraction method and a rotation method. Hit the Extraction button to specify your extraction method.

2. Linearity (in relationships among the variables--factors are linear constructions of the set of variables) F 2 X 4 U 4

THE EFFECTS OF SOCIO-DEMOGRAPHIC AND PSYCHOLOGICAL CHARACTERISTICS ON EMPLOYEE INCOME

Aspects in Development of Statistic Data Analysis in Romanian Sanitary System

MULTIPLE REGRESSIONS ON SOME SELECTED MACROECONOMIC VARIABLES ON STOCK MARKET RETURNS FROM

Calculating the Probability of Returning a Loan with Binary Probability Models

A Brief Introduction to Factor Analysis

Chapter 23. Inferences for Regression

Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics

DATA MINING TECHNOLOGY. Keywords: data mining, data warehouse, knowledge discovery, OLAP, OLAM.

DATA VERIFICATION IN ETL PROCESSES

International Journal of Computer Trends and Technology (IJCTT) volume 4 Issue 8 August 2013

University of Petroşani, România

Competition as an Effective Tool in Developing Social Marketing Programs: Driving Behavior Change through Online Activities

4. There are no dependent variables specified... Instead, the model is: VAR 1. Or, in terms of basic measurement theory, we could model it as:

Statistics in Psychosocial Research Lecture 8 Factor Analysis I. Lecturer: Elizabeth Garrett-Mayer

Factor Analysis. Sample StatFolio: factor analysis.sgp

DATA MINING AND CUSTOMER RELATIONSHIP MANAGEMENT FOR CLIENTS SEGMENTATION

Organizational Factors Affecting E-commerce Adoption in Small and Medium-sized Enterprises

ScienceDirect. Using Internet as a Commercial Tool: a Case Study of E-Commerce in Resita

DISCRIMINANT FUNCTION ANALYSIS (DA)

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )

How To Identify Noisy Variables In A Cluster

User Behaviour on Google Search Engine

DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9

EFFECT OF ENVIRONMENTAL CONCERN & SOCIAL NORMS ON ENVIRONMENTAL FRIENDLY BEHAVIORAL INTENTIONS

Data analysis process

Easily Identify Your Best Customers

Introduction to Principal Components and FactorAnalysis

5. Multiple regression

Principle Component Analysis and Partial Least Squares: Two Dimension Reduction Techniques for Regression

Improving the Performance of Data Mining Models with Data Preparation Using SAS Enterprise Miner Ricardo Galante, SAS Institute Brasil, São Paulo, SP

Multiple Regression. Page 24

Principal Component Analysis

Anale. Seria Informatică. Vol. VII fasc Annals. Computer Science Series. 7 th Tome 1 st Fasc. 2009

Using Data Mining for Mobile Communication Clustering and Characterization

PRINCIPAL COMPONENT ANALYSIS

Exploratory Factor Analysis of Demographic Characteristics of Antenatal Clinic Attendees and their Association with HIV Risk

Component Ordering in Independent Component Analysis Based on Data Power

5.2 Customers Types for Grocery Shopping Scenario

IMPLICATIONS OF LOGISTIC SERVICE QUALITY ON THE SATISFACTION LEVEL AND RETENTION RATE OF AN E-COMMERCE RETAILER S CUSTOMERS

Teaching Multivariate Analysis to Business-Major Students

Exploratory Factor Analysis

A Comparison of Variable Selection Techniques for Credit Scoring

RISK ANALYSIS ON THE LEASING MARKET

Curriculum Map Statistics and Probability Honors (348) Saugus High School Saugus Public Schools

SPSS Guide: Regression Analysis

Multivariate Analysis of Ecological Data

Big Data, Socio- Psychological Theory, Algorithmic Text Analysis, and Predicting the Michigan Consumer Sentiment Index

1/27/2013. PSY 512: Advanced Statistics for Psychological and Behavioral Research 2

Part 2: Analysis of Relationship Between Two Variables

Modelling, Extraction and Description of Intrinsic Cues of High Resolution Satellite Images: Independent Component Analysis based approaches

New Ensemble Combination Scheme

Knowledge Discovery in Stock Market Data

INSURANCE CONSULTANTS ATTITUDE TOWARDS RELATIONSHIP MARKETING ELEMENTS

THE USE OF REGRESSION ANALYSIS IN MARKETING RESEARCH

Chapter 7 Factor Analysis SPSS

APPRAISAL OF FINANCIAL AND ADMINISTRATIVE FUNCTIONING OF PUNJAB TECHNICAL UNIVERSITY

Common factor analysis

Data Mining and Visualization

Transcription:

Economic Insights Trends and Challenges Vol. I (LXIV) No. 4/2012 121-128 The Annual Inflation Rate Analysis Using Data Mining Techniques Mădălina Cărbureanu Petroleum-Gas University of Ploiesti, Bd. Bucuresti 39, Ploiesti, Romania e-mail: mcarbureanu@upg-ploiesti.ro Abstract Inflation, indicator of the unbalance that affects the national economies, proved to be through the years a permanent presence in economy. The inflation consequences that the population faces, such as the decrease in the buying capacity and not only, make necessary the annual inflation rate IPC analysis through the usage of various statistical techniques and methods, such as data mining techniques. In the present paper, for annual inflation rate IPC analysis and prediction it is used a set of data mining techniques, respectively the principal components analysis, decision trees and linear regression. The data mining techniques, respectively the statistical techniques are useful tools for economists, statisticians, financial analysts etc., in the analysis and prediction of various economic indicators. Key words: inflation annual rate, principal components, regression, decision trees JEL Classification: C38, C81 Introduction The political, economic and social events from a certain period of time have a positive and negative effect on the annual inflation rate IPC evolution, with direct consequences on the population standard of living. Moreover, there is a set of factors that directly influences the annual inflation rate IPC evolution, such as: administrated prices, the fuels prices, the volatile prices of food stuffs (LFO), the adjusted CORE2 indicator (measure of the main inflation) and tobacco and alcohol beverages prices 1. The mentioned factors will be used in developing the annual inflation rate IPC analysis and prediction using a set of data mining techniques, such as: principal components analysis (PCA), decision trees and linear regression. The paper is structured as follows: o The data mining techniques as research instruments for analysis and prediction, in which case the main applications and advantages of using such techniques in the economic field are presented; 1 Romanian National Bank, Report on inflation may 2012, at http: // www.bnro.ro / Publication ocuments.aspx?icid=3922.

122 Mădălina Cărbureanu o o The IPC annual inflation rate analysis and prediction through the application of some data mining techniques (principal components analysis, decision trees and linear regression); Concluding thoughts regarding the applicability and utility of some data mining techniques such as PCA, decision tress and linear regression in the analysis and prediction of different economic indicators. Data Mining Tehniques as Instruments for Analysis and Prediction The various government institutions, especially the banks, for instance Romanian National Bank (BNR), are working with large data bases. If they do not employ adequate instruments the useful information extraction and analysis from the enormous data bases cannot be achieved. The data mining techniques represent a useful tool in solving the various problems from the economic domain, through correlation identification, rules, patterns etc., that can be traced throughout the analyzed data, thus helping the analyst to take the best decisions, and also to analyse and predict various economic indicators. In the economic domain, the usage of some data mining techniques, as classification, regression, clustering etc., brings a set of advantages, such as the optimization of the institutional activity through the best decisions taking 2. Moreover, in the literature it is presented a set of data mining applications employed in the analysis, prediction and modeling of inflation behaviour 3, in inflation analysis measured through consuming prices 4 etc. Among the data mining techniques, those that are most used in the economic domain, in order to predict the various indicators, are the ones presented in Figure 1. Fig. 1. Data mining techniques The ability to predict the evolution of an economic indicator with accuracy, as in the case of the annual rate inflation IPC, is essential for taking the most adequate decisions, measures to counter-attack the negative effects generated by a negative evolution of the analised economic 2 Andronie, M., Criş an, D., Commercially Available Data Mining Tools used in the Economic Environment, Database System Journal, Vol. I, No. 2/2010, pp. 45-54. 3 Ericsson, N., Constructive data mining:modeling australian inflation, at http: // www. economics.smu.edu.sg/events/paper/neilericssonaustralian.pdf 4 Cao, L., et al, New Frontiers in applind data mining, Springer-Verlag, Berlin Heidelberg, 2012, pp.179-181.

The Annual Inflation Rate Analysis Using Data Mining Techniques 123 indicator. To that aim, the data mining tehniques can be a useful tool in taking the best decisions. The Data Mining Techniques Application Data mining techniques have wide applicability in many fields ranging from finances, economy, and marketing to medicine, astronomy, and meteorology 4. From the multitude of the data mining techniques, there have been chosen the principal components analysis (PCA), linear regression and decision trees, to be applied in IPC annual inflation rate analysis and prediction. Principal Components Analysis The economic-social reality is influenced by a large number of variables (indicators). The principal components analysis (PCA) aims at the reduction of the number of used variables, for obtaining a number of representative variables, named factors 5. Table 1 presents the data base developed using the data supplied by Romanian National Bank (BNR) for the analyzed variables: IPC annual inflation rate, administrated prices, fuels prices, the volatile prices of food products (LFO), the adjusted CORE2 indicator (measure unit for base inflation) and tobacco and alcohol beverages prices 6. Year 2008 2009 2010 Month IPC inflation annual rate (%) Table 1. Data base structure Administrated prices (%) Tobacco and alcohol beverages prices (%) LFO (%) Adjusted CORE2 (%) October 7.39 1.6 0.6 1.1 4.1 November 6.74 1.5 0.6 0.8 3.8 December 6.30 1.6 0.6 0.3 3.7 January 6.71 1.5 0.8 0.4 4.0 February 6.89 1.5 0.8 0.6 4.0 March 6.71 1.5 0.9 0.6 3.7 April 6.45 1.5 1.2 0.4 3.3 May 5.95 1.4 1.2 0.2 3.1 June 5.86 1.5 1.2 0.3 2.9 July 5.06 0.7 1.1 0.5 2.8 August 4.96 0.9 1.2 0.2 2.7 September 4.94 0.9 1.4 0.1 2.5 October 4.30 0.8 1.3 0.1 2.1 November 4.65 0.7 1.6 0.4 2.0 December 4.74 0.6 1.8 0.7 1.6 January 5.20 0.6 2.4 0.8 1.4 February 4.49 0.4 2.3 0.6 1.2 March 4.20 0.4 2.2 0.6 1.0 April 4.28 0.5 1.8 0.7 1.2 5 Gorunescu, F., Data mining. Concepte, modele şi tehnici, Editura Albastră, Cluj-Napoca, 2006, pp. 23-24. 6 Romanian National Bank, Monthly bulletins, 2010-2012, at http://www.bnro.ro/raportul-asuprainflatiei-3342.aspx.

124 Mădălina Cărbureanu 2010 2011 2012 Table 1 (cont.) May 4.42 0.6 1.8 0.7 1.3 June 4.38 0.6 1.8 0.6 1.4 July 7.14 1.3 1.9 1.1 2.8 August 7.58 1.3 1.9 1.4 2.9 September 7.77 1.3 1.6 1.8 3.0 October 7.88 1.4 1.8 1.9 2.9 November 7.73 1.4 1.5 1.9 2.9 December 7.96 1.4 1.4 2.2 3.0 January 6.99 1.2 0.9 2.3 2.6 February 7.60 1.2 0.9 2.7 2.8 March 8.01 1.2 0.9 3.0 2.9 April 8.34 1.3 1.0 3.1 2.9 May 8.41 1.3 1.1 3.1 2.9 June 7.93 1.3 1.1 2.6 2.9 July 4.85 0.8 0.6 1.6 1.9 August 4.25 0.9 0.5 1.0 1.9 September 3.45 0.9 0.5 0.3 1.8 October 3.55 1.0 0.5 0.4 1.7 November 3.44 1.0 0.5 0.3 1.6 December 3.14 1.1 0.5 0.0 1.5 January 2.72 1.08 0.42-0.34 1.56 February 2.59 1.06 0.42-0.27 1.38 March 2.40 1.03 0.43-0.41 1.35 For the PCA analysis, the procedure Statistics-Data reduction-factor from SPSS environment was applied. The statistical results are presented in Tables 2, 3 and 4. Table 2. Correlation matrix IPC PRICES TABACCO_ALCOHOL LFO CORE2 IPC 1.000.680.079.768.745 PRICES.680 1.000 -.464.273.880 TABACCO_ALCOHOL.079 -.464 1.000.043 -.345 LFO.768.273.043 1.000.253 CORE2.745.880 -.345.253 1.000 The correlation matrix indicates the correlations between the analyzed variables. As it can be observed, we have positive and also negative correlations between the analyzed variables. Meaningful positive correlations are between IPC and LFO (0.76), between PRICES and CORE2 (0.88) and between CORE 2 and IPC (0.74). Correlation Component Total Initial Eigenvalues % of Variance Cumulative % Table 3. Total variance explained Total Extraction Sums of Squared Loadings % of Variance Cumulative % Total Rotation Sums of Squared Loadings % Cumulative of % Variance 1 2.914 58.284 58.284 2.914 58.284 58.284 2.196 43.920 43.920 2 1.357 27.133 85.416 1.357 27.133 85.416 2.075 41.496 85.416 3.619 12.378 97.794 4.110 2.198 99.992 5.000.008 100.000

The Annual Inflation Rate Analysis Using Data Mining Techniques 125 The table Total Variance Explained brings the first specific information of factorial analysis. Five principal components (factors) were generated, but only two of them reached the selection criterion (Eigenvalue>=1). On the Extraction Sums of Squared Loadings columns, we have the Eigenvalues, the explained variance and cumulative variance for these two factors, in the context of initial solution (without rotation) 7. The variance explained by each factor was distributed as follows: factor I-58.284%, and factor II-27.133%. Together those two factors explain 85.416% from the analyzed values variation. On the Rotation Sums of Squared Loadings columns, we have the same values for these two factors, but after the application of rotation procedure. It can be noticed a redistribution of the variance explained by each factor (factor I-43.920%; factor II-41.496%) in the context of the same total variation (85.416%). As it can be observed, through the rotation method the first factor lowers its saturation degree, in favor of the second factor. The Scree Plot (Figure 2) presents the Eigenvalues for all the principal components obtained through analysis, which are numerically represented in Table 3. Fig. 2. Scree Plot The factorial solution (after rotation) is presented in Table 4. Table 4. Rotated Component Matrix(a) Component 1 2 IPC.951.297 LFO.860 PRICES.447.845 TABACCO_ALCOHOL.301 -.813 CORE2.510.781 The data from Table 4 allows final conclusions regarding the analyzed variables factorial structure, as follows: o Factor I is composed by IPC (0.95) and LFO (0.86) variables; o Factor II is composed by PRICES (0.84) and CORE2(0.78) variables. The character and intensity of the correlation between IPC and LFO and between PRICES and CORE2 is highlighted with the help of a graphical procedure, named scatterplot, as it can be observed in Figure 3. 7 Popa., M., Analiza factorială exploratorie, at http://www mpopa.ro/statistica_master/14_ analiza_fact.pdf.

126 Mădălina Cărbureanu Fig. 3. Scatterplot Using PCA, we have determined those factors that influence the IPC annual rate of inflation, factors between which there is a strong linear relation, as it can be observed in Figure 3. The Simple Linear Regression The simple linear regression is a SPSS procedure used to determine the model that establishes (through a regression equation) the association between a dependent variable (for instance IPC) and an independent one (for instance CORE2) 8. The model obtained can be used to predict the IPC annual rate of inflation (IPC). The simple linear regression model form is 9 : (1) where: β 0 is the origin ordinate (shows the variable medium value), β 1 is the line gradient, and ε is a random variable that shows the deviation between the observed values and the model estimated values 9. The correlation (association) between IPC and CORE2 is presented in Figure 4. Fig. 4. The correlation between IPC and CORE2 The most important results obtained through the application of SPSS regression procedure are presented in Table 5 and Table 6. 8 Gorunescu, F., Data mining. Concepte, modele şi tehnici, Editura Albastră, Cluj-Napoca, 2006, pp. 23-24. 9 Jaba, E., Econometrie aplicată, Editura Universităţii Alexandru Ioan Cuza, Iaşi, 2008, pp. 6-15.

The Annual Inflation Rate Analysis Using Data Mining Techniques 127 Table 5. Model Summary Model R R Square Adjusted R Square Std. Error of the Estimate 1.745(a).554.543 1.13256 Model Table 6. Coefficients Unstandardized coefficients Standardized coefficients B Std. error Beta t Sig. 1 (Constant) 2.269.548 4.137.000 CORE2.745 1.422.207 6.876.000 The R Square values indicate that 55% from IPC variation is explained by the variation of CORE2. The Coefficients table contains the B coefficient (nonstandardized) and Beta coefficient (standardized), which can be used in the prediction equation. So the linear regression equation is: (2) Using equation (2) and knowing the CORE2 value for a certain month, it can be predicted the IPC annual inflation rate (IPC). According BNR, the CORE2 value for April is 1.8%, so the IPC estimated value for the same month using equation (2) is 4.8286 %, value that corresponds to the National Institute of Statistics (INSSE) predictions 10. Decision Trees One of the main data mining techniques is represented by the decision trees, used in the affiliation prediction of some instances to various categories 11. For obtaining the decision tree (decision rules), it was used the C5.0 algorithm implemented in See5 environment 12. Applying the C5.0 algorithm on the data from Table 1, there were obtained the following rules that compose the searched decision tree: o if CORE2>2.1 then IPC=big; o if CORE2<=2.1 AND PRICES>0.8 then IPC=medium; o if CORE2<=2.1 AND PRICES<=0.8 AND LFO<=0.7 then IPC=medium; o if CORE2<=2.1 AND PRICES<=0.8 AND LFO>0.7 then IPC=big. Using these decision rules, with the help of See5 we can make predictions of the IPC annual inflation rate, as it can be observed in Figure 5. Fig. 5. Prediction of IPC using See5 In Figure 5 it can be observed that the predicted value for IPC annual rate of inflation for April 2012 is a medium one, namely it belongs to (2.5 4.5) interval, with a 88% probability. 10 INSSE, at http://www.insse.ro/cms/files/statistici/comunicate/ipc/a12/ipc04r12.pdf 11 Gorunescu, F., Data mining. Concepte, modele şi tehnici, Editura Albastră, Cluj-Napoca, 2006, pp. 23-24. 12 Khosrow-Pour, M., Emerging Trends and Challenges in Information Technology Management, Editura Idea Group, 2006, pp. 282-283.

128 Mădălina Cărbureanu Conclusions This paper has highlighted the use of some statistical data mining techniques, as principal components analysis (PCA), simple linear regression and decision trees, in the evolution analysis and prediction of some economic indicators, such as IPC annual rate of inflation. So, using PCA we have been able to establish those factors that consistently influence the IPC annual inflation rate evolution, while through the usage of linear simple regression we have achieved the IPC annual inflation rate prediction for April 2012. Those are only a few of the advantages of employing statistical data mining techniques in the analysis of various economic indicators, being useful instruments for statisticians, economists etc. References 1. Andronie, M., Criş an, D., Commercially Available Data Mining Tools used in the Economic Environment, Database System Journal, Vol. I, No. 2/2010, pp. 45-54. 2. Cao, L. et al, New Frontiers in applind data mining, Springer-Verlag, Berlin Heidelberg, 2012, pp.179-181. 3. Ericsson, N., Constructive data mining: modeling australian inflation, at http: // www. economics.smu.edu.sg/events/paper/neilericssonaustralian.pdf 4. Gorunescu, F., Data mining. Concepte, modele ş i tehnici, Editura Albastră, Cluj-Napoca, 2006, pp. 23-24. 5. INSSE, at http://www.insse.ro/cms/files/statistici/comunicate/ipc/a12/ipc04r12.pdf. 6. Jaba, E., Econometrie aplicată, Editura Universităţii Alexandru Ioan Cuza, Iaşi, 2008, pp. 6-15. 7. Khosrow-Pour, M., Emerging Trends and Challenges in Information Technology Management, Editura Idea Group, 2006, pp. 282-283. 8. Popa, M., Analiza factorială exploratorie, at http://www mpopa.ro/statistica_master/14_ analiza_fact.pdf. 9. Romanian National Bank, Report on inflation May 2012, at http: // www.bnro.ro /Publication Documents.aspx? icid = 3922. 10. Romanian National Bank, Monthly bulletins, 2010-2012, at http://www.bnro.ro/raportul-asuprainflatiei-3342.aspx. Analiza ratei anuale a inflaţiei folosind tehnici de data mining Rezumat Inflaţia, indicator al dezechilibrului ce afectează economiile naţionale, s-a dovedit de-a lungul anilor a fi o prezenţă permanentă în cadrul economiei. Consecinţele inflaţiei pe care le suportă populaţia, precum scăderea puterii de cumpărare şi nu numai, fac necesară analiza permanentă a ratei anuale a inflaţiei IPC prin utilizarea unor variate metode şi tehnici statistice, ca de exemplu tehnicile de data mining. În prezenta lucrare, pentru analiza şi predicţia evoluţiei ratei anuale a inflaţiei IPC este utilizat un set de tehnici de data mining, respectiv analiza componentelor principale, arbori de decizie şi regresia liniară. Tehnicile de data mining, respectiv tehnicile statistice sunt instrumente utile economiştilor, statisticienilor, analiştilor financiari etc., în analiza şi predicţia diferiţilor indicatori economici.