MULTIPLE REGRESSION ANALYSIS OF MAIN ECONOMIC INDICATORS IN TOURISM. R, analysis of variance, Student test, multivariate analysis



Similar documents
SPSS Guide: Regression Analysis

One-Way Analysis of Variance: A Guide to Testing Differences Between Multiple Groups

Multiple Regression in SPSS This example shows you how to perform multiple regression. The basic command is regression : linear.

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS

Example: Boats and Manatees

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

Part 2: Analysis of Relationship Between Two Variables

ANALYSIS OF THE ECONOMIC PERFORMANCE OF A ORGANIZATION USING MULTIPLE REGRESSION

This chapter will demonstrate how to perform multiple linear regression with IBM SPSS

Module 5: Multiple Regression Analysis

Chapter 13 Introduction to Linear Regression and Correlation Analysis

1.5 Oneway Analysis of Variance

Statistical analysis and projection of wood veneer industry in Turkey:

THE RELATIONSHIP BETWEEN WORKING CAPITAL MANAGEMENT AND DIVIDEND PAYOUT RATIO OF FIRMS LISTED IN NAIROBI SECURITIES EXCHANGE

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

Regression Analysis: A Complete Example

Additional sources Compilation of sources:

2013 MBA Jump Start Program. Statistics Module Part 3

One-Way Analysis of Variance

Influence of Aggressivenessand Conservativenessin Investing and Financing Policies on Performance of Industrial Firms in Kenya

Week TSX Index

Data Mining Techniques Chapter 5: The Lure of Statistics: Data Mining Using Familiar Tools

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION

5. Linear Regression

Simple linear regression

CRITICAL FACTORS AFFECTING THE UTILIZATION OF CLOUD COMPUTING Alberto Daniel Salinas Montemayor 1, Jesús Fabián Lopez 2, Jesús Cruz Álvarez 3

Multiple Linear Regression

MULTIPLE REGRESSION WITH CATEGORICAL DATA

THE INFLUENCE OF MARKETING INTELLIGENCE ON PERFORMANCES OF ROMANIAN RETAILERS. Adrian MICU 1 Angela-Eliza MICU 2 Nicoleta CRISTACHE 3 Edit LUKACS 4

2. What is the general linear model to be used to model linear trend? (Write out the model) = or

" Y. Notation and Equations for Regression Lecture 11/4. Notation:

Moderator and Mediator Analysis

DISCRIMINANT FUNCTION ANALYSIS (DA)

Multiple Regression Used in Macro-economic Analysis

Final Exam Practice Problem Answers

Running Head: HUMAN RESOURCE PRACTICES AND ENTERPRISE PERFORMANCE. Pakistan. Muzaffar Asad. Syed Hussain Haider. Muhammad Bilal Akhtar

DEPARTMENT OF PSYCHOLOGY UNIVERSITY OF LANCASTER MSC IN PSYCHOLOGICAL RESEARCH METHODS ANALYSING AND INTERPRETING DATA 2 PART 1 WEEK 9

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )

Chapter Seven. Multiple regression An introduction to multiple regression Performing a multiple regression on SPSS

Class 19: Two Way Tables, Conditional Distributions, Chi-Square (Text: Sections 2.5; 9.1)

Premaster Statistics Tutorial 4 Full solutions

ANALYSIS OF USER ACCEPTANCE OF A NETWORK MONITORING SYSTEM WITH A FOCUS ON ICT TEACHERS

CHAPTER 4: Trends of Online Marketing As Adopted By Indian Railways

Regression step-by-step using Microsoft Excel

Outline. Topic 4 - Analysis of Variance Approach to Regression. Partitioning Sums of Squares. Total Sum of Squares. Partitioning sums of squares

MULTIPLE REGRESSIONS ON SOME SELECTED MACROECONOMIC VARIABLES ON STOCK MARKET RETURNS FROM

One-Way Analysis of Variance (ANOVA) Example Problem

Multiple Regression. Page 24

Testing for Granger causality between stock prices and economic growth

APPLICATION OF LINEAR REGRESSION MODEL FOR POISSON DISTRIBUTION IN FORECASTING

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.

2. Simple Linear Regression

Recall this chart that showed how most of our course would be organized:

Chapter Four. Data Analyses and Presentation of the Findings

Assessing Farmers' Sustainable Agricultural Practice Needs: Implication for a Sustainable Farming System

Sample Size and Power in Clinical Trials

Calculating the Probability of Returning a Loan with Binary Probability Models

Study on the Working Capital Management Efficiency in Indian Leather Industry- An Empirical Analysis

Multiple Regression: What Is It?

IMPACT AND SIGNIFICANCE OF TRANSPORTATION AND SOCIO ECONOMIC FACTORS ON STUDENTS CLASS ATTENDANCE IN NIGERIA POLYTECHNICS: A

Elementary Statistics Sample Exam #3

A Primer on Forecasting Business Performance

POST-HOC SEGMENTATION USING MARKETING RESEARCH

STAT 350 Practice Final Exam Solution (Spring 2015)

Descriptive Statistics

International Statistical Institute, 56th Session, 2007: Phil Everson

Impact of Firm Specific Factors on the Stock Prices: A Case Study on Listed Manufacturing Companies in Colombo Stock Exchange.

Determinants of the Total Quality Management Implementation in SMEs in Iran (Case of Metal Industry)

Does organizational culture cheer organizational profitability? A case study on a Bangalore based Software Company

Course Objective This course is designed to give you a basic understanding of how to run regressions in SPSS.

Chapter 23. Inferences for Regression

Predictability Study of ISIP Reading and STAAR Reading: Prediction Bands. March 2014

Introduction. Research Problem. Larojan Chandrasegaran (1), Janaki Samuel Thevaruban (2)

Course Text. Required Computing Software. Course Description. Course Objectives. StraighterLine. Business Statistics

Notes on Applied Linear Regression

Effect of the learning support and the use of project management tools on project success: The case of Pakistan

Chapter 10. Key Ideas Correlation, Correlation Coefficient (r),

A SUCCESFULL WAY TO SELL HANDICRAFTS IN ALBANIA

DETERMINANTS OF CUSTOMER RETENTION IN HOTEL INDUSTRY

Causal Forecasting Models

An analysis appropriate for a quantitative outcome and a single quantitative explanatory. 9.1 The model behind linear regression

KSTAT MINI-MANUAL. Decision Sciences 434 Kellogg Graduate School of Management

General Regression Formulae ) (N-2) (1 - r 2 YX

CHAPTER 5: CONSUMERS ATTITUDE TOWARDS ONLINE MARKETING OF INDIAN RAILWAYS

Doing Multiple Regression with SPSS. In this case, we are interested in the Analyze options so we choose that menu. If gives us a number of choices:

Regression Analysis (Spring, 2000)

COMPARISONS OF CUSTOMER LOYALTY: PUBLIC & PRIVATE INSURANCE COMPANIES.

Chapter 1 Introduction. 1.1 Introduction

Frequency of using WhatsApp Messenger among College Students in Salem District, TamilNadu

Statistiek II. John Nerbonne. October 1, Dept of Information Science

Multicollinearity Richard Williams, University of Notre Dame, Last revised January 13, 2015

Univariate Regression

Introduction to General and Generalized Linear Models

Association Between Variables

SWOT Analysis Solutions for Developing the Banking Business Based on Relationships between the Banks Indicators

16 : Demand Forecasting

An Analysis of the Telecommunications Business in China by Linear Regression

Chapter 5 Analysis of variance SPSS Analysis of variance

IAPRI Quantitative Analysis Capacity Building Series. Multiple regression analysis & interpreting results

Transcription:

Journal of tourism [No. 8] MULTIPLE REGRESSION ANALYSIS OF MAIN ECONOMIC INDICATORS IN TOURISM Assistant Ph.D. Erika KULCSÁR Babeş Bolyai University of Cluj Napoca, Romania Abstract This paper analysis the measure between GDP dependent variable in the sector of hotels and restaurants and the following independent variables: overnight stays in the establishments of, arrivals in the establishments of and investments in hotels and restaurants sector in the period of analysis 1995-007. With the multiple regression analysis I found that investments and tourist arrivals are significant predictors for the GDP dependent variable. Based on these results, I identified those components of the marketing mix, which in my opinion require investment, which could contribute to the positive development of tourist arrivals in the establishments of. Key words: coefficient of determinations - of dependencies JEL classification: D1, L83 R, analysis of variance, Student test, multivariate analysis 1. INTRODUCTION Regression analysis generally belongs to the multivariate methods, while being an explanatory method of analysis. The regression analysis describes through a model the relationship between a dependent variable and one or more independent variables. The GDP achieved in the hotels and restaurants sector may depends on the investments in hotels and restaurants, the arrivals of tourist arrivals in the establishments of, the overnight stays in the establishments of. The main purpose of this analysis is to determine to what extent GDP is affected by the three independent variables and which are the measures that should be taken based on the results - obtained by using the SPSS program - in tourism marketing field. Objectives of the study: 1. Analysis of variance (ANOVA) regarding multiple regression. To achieve a statistical test (Student test) for each independent variable, in order to know which regression coefficient can be 0 and which can not be. 3. To determine the intensity of the interdependence connection between independent variables. MULTIPLE REGRESSION The multiple regression analysis studies the simultaneous emotions that two or more independent variables may have over one dependent variable (Lefter, 004, p.364). 59

Years Revista de Turism [Nr. 8] Table 1 - Evolution of the main economic indicators in tourism in the period 1995-007 GDP obtained in the sector of hotels, restaurants (million RON updated prices)* 1995 8461.5 1389.0 7070 4111 1996 1194. 1040.7 6595 1838 1997 9375.7 67.3 577 19611 1998 7758.8 581.4 555 19183 1999 607.9 56.3 5109 17670 000 5491.7 316.7 490 17647 001 537. 584.8 4875 181 00 4586.9 557.0 4847 1777 003 4556.1 704.4 5057 17845 004 4850.4 956.9 5639 18501 005 587. 1080.6 5805 18373 006 7060.5 133.0 616 1899 007 840.4 1600.9 697 0593 Source: Romanian Statistical Year Book, 008, pp. 477, 517, 86-87 * upgraded values (007 reference period) Using the SPSS programs kit in the case of multiple regressions I have come to the following results: Table - Regression Coefficients: Investments in hotels and restaurants (million RON in updated prices)* Tourists arrivals in the establishments of (total, thousands of tourists) Overnight stays the establishments of (total, thousands) Unstandardized Standardized Model Indicators Coefficients Coefficients t Sig. B Std. Error Beta 1 (Constant) -1606.351 36.650-4.474.00 Investments in hotels and restaurants (million RON in -8.173.167-1.463-3.771.004 current prices) Tourists arrivals in the establishments of touristic 6.584 1.749.340 3.765. 004 reception (total, thousand tourists) Overnight stays the establishments of (total, -.389. 414 -.344 -.939.37 thousands) Dependent Variable: The Gross Domestic Product obtained in the hotels and restaurants sector (millions RON, current prices) Based on nonstandard coefficient we get the regression equation: ŷ =-1606.351-8. 173 x 1 +6. 584 x -0.389 x 3 (1) where x 1 - represents the investments in hotels and restaurants (millions RON, current prices) x - represents tourists arrivals in the establishments of (total, thousands of tourists) x 3 - represents overnight stays the establishments of (total, thousands) Table 3 - Estimation of Standard Deviation Model Summary Std. Error of the Model R R Square Adjusted R Square Estimate 1.916(a).840.786 1019.0549 60

Journal of tourism [No. 8] Predictors: (Constant), Overnights stays the establishments of (total, thousands), Investments in hotels and restaurants (millions RON, current prices), Tourists arrivals in the establishments of (total, thousands of tourists) The coefficient of determination R indicating the percent of how much of the total variance is explained by the independent variable is 84% (Table 3). The analysis of variance for multiple regressions will be made starting from the following results: Table 4 - Variation analysis ANOVA (b) Model Sum of Squares df Mean Square F Sig. Regression 49017946.8 3 16339315.47 15.734.001(a) Residual 934656.147 9 103847.905 Total 583640.49 1 a. Predictors: (Constant), Overnights stays the establishments of (total, thousands), Investments in hotels and restaurants (millions RON, current prices), Tourists arrivals in the establishments of (total, thousands of tourists) b. Dependent variable: The Gross Domestic Product obtained in the hotels and restaurants sector (millions RON, current prices) The result is that most part of the total variance is generated by the regression equation. If we would make the report between the variance owed to regression (SSR) and the total variance (SST) we would get the coefficient of determination R, highlighted in the previous table (Table 3). SSR SST 49017946.8 583640.49 R = = = 0. 840 () This means that 84% of the total variance is explained by the regression model while 16% by unknown factors, that variance representing the error (SSE). Starting from this error the multiple standard error of estimation is gained based on the following relation: SSE 934656.147 S = = 11019.0549 3 n ( k + 1) 13 (3 + 1) = y (3) In order to test the validity of multiple regression model a global test must be used that researches whether all the independent variables have regression coefficients equal with 0, or in other words if the explained variance is not due to a random. The regression coefficients of the sample have as correspondent at the population level the regression coefficient marked as β 1, β, β 3,... The alternative and null hypotheses are formulated as follows: H 0 : β1 = β = β 3 = 0 H 1 = not all β coefficients are equal to zero In order to test the null hypothesis we turn to F test that requires an analysis of the variance identified in the ANOVA table above. From the data in the previous table (Table 4) it can be ascertained that the value of the calculated F is 15.734 for the variance generated by the regression. The critical value of F, at the significance level of 0.05 with 3 degrees of freedom at numerator and 9 at denominator, is 3.86. By comparing the values of F it results that it is compulsory to accept the alternative hypothesis, meaning the fact that not all the regression coefficients are equal to zero. This means that a significant influence of multiple regression model occurs over dependent variables. The issue that arises now is to know which regression coefficients may be zero and which not. It is imposed therefore to achieve an individual evaluation of regression coefficients. It is therefore compulsory to make an assessment the realization of a statistical test for each under the conditions where the null hypothesis implies that each coefficient β is equal to zero and the alternative hypothesis states that it is different from zero. The test used is the Student test, respectively t with n-(k+1) degree of freedom. For each of the three variables, from the SPSS results, we get the calculated t values (Table ). These are: -3.771 for investments in hotels and restaurants, 3.765 for tourists arrivals in the establishments of, -0.939 for overnight stays the establishments of touristic reception. In order to define the decision rule concerning the null hypothesis, the calculated t values will be compared with the critical value of t at a significance level of 0.05 in the case of a two-tailed test, with 13- (3+1), meaning with 9 degrees of freedom. This value is ±.6. The result is: In the cases when investments calculated t (- 3.771) is lower than critical t (-.6). The level of significance indicated by the test 0,004 is lower than the chosen level of significance of 0.05. Therefore the null hypothesis is rejected and it is accepted that β 1 is 61

Revista de Turism [Nr. 8] different from zero. In the case of the variable tourists arrivals in the establishments of the calculated t (3. 765) is higher than critical t (.6). The level of significance indicated by the test of 0.004 is lower than the chosen level of significance 0.05. Therefore the null hypothesis is rejected and it is β is different from zero. In the case of overnight stays the accepted that establishments of, the calculated t (-0.939) is higher than critical t (-.6). Likewise level of significance indicated by the test of 0.37 is higher than the chosen level of significance 0.05. In this case the null hypothesis is accepted and it is considered that β 3 is equal to zero. Therefore, it is considered that the variable overnight stays in the establishments of is not a significant predicator for the dependent variable GDP obtained in the hotel and restaurant sectors. In this case the regression model will no longer contain this variable. Table 5 - Estimation of standard error deviation Model Summary Model R R Square Adjusted R Square Std. Error of the Estimate 1.908(a).84.789 101.9791 If we will make the determinations for the new regression model, the result will be: a. Predictors: (Constant), Tourists arrivals in the establishments of (total, thousands of tourists), Investments in hotels and restaurants (millions RON current prices) In this case the coefficient of determination R indicating the percentage of how much of the total variance is explained by independent variables, is 8.4% (Table 5) as compared to 84% in the case of the model with 3 variables (Table 3). This means that the abandoned variable caused a rise of R by 1.6% an extremely small growth for an independent variable. The reach of a standard error of the estimation even lower as compared to the initial model is determined: 101.97 (table no. 5) as opposed to 11019.05 million RON (table no. 3). The other processing results are: Table 6 - Regression coefficients Coefficients (a) Model 1 Indicators Unstandardized Coefficients B Std. Error Beta (Constant) -16570.909 3580.97 Investments in hotels and restaurants (million RON in current prices) tourists arrivals in the establishments of touristic reception (total, thousand tourists) -6.887 1.669-1.33 Standardized Coefficients t Sig. - 4.68-4.17.001.00 5.147.841 1.89 6.13.000 a. Dependent Variable dependent: The Gross Domestic Product obtained in the hotel and restaurants sector (millions RON, current prices) A new regression equation results from the above: ŷ = -16570.909-6.887 x 1 +5.147 x (4) Table 7 - Analysis of variance ANOVA (b) Model Sum of Squares df Mean Square F Sig. 1 Regression 4810936.356 4051468.178 3.439.000(a) Residual 106166.073 10 10616.607 Total 583640.49 1 6

Journal of tourism [No. 8] Predictors: (Constant), Investments in hotels and restaurants (millions RON, current prices), Tourists arrivals in the establishments of touristic reception (total, thousands of tourists) b. Dependent variable: The Gross Domestic Product obtained in the hotels and restaurants sector (millions RON, current prices) On the basis of the data above a higher value of calculated F can be also ascertained of 3.439 (Table 7) as compared to 15.734 (Table 4) that enforces the statistical significance of the explanation by regression of variable dependents..1. The issue of multicollinearity The most wide spread issue that comes up in the case of multiple regression is related to the results of the powerful interdependent connections between independent variables (Lefter, 004, p.370). In order to construe correctly the relations between variables one must know the correlation coefficients, which are presented in the following matrix: Table 8 - Methods of multivariate analysis of dependencies s Gross Domestic Product obtained in the hotel and restaurants sector (millions RON, current prices) Investments in hotels and restaurants (millions RON, current prices) Tourists arrivals in the establishments of (total, thousands of tourists ) Overnight stays the establishments of (total, thousands) Gross Domestic Product obtained in the hotel and restaurants sector (millions RON, current prices) Investments in hotels and restaurants (millions RON, current prices) Tourists arrivals in the establishments of touristic reception (total, thousands of tourists ) Overnight stays the establishments of touristic reception (total, thousands) 1.406.74(**).757(**) Sig. (-.168.005.003.406 1.896(**).660(*) Sig. (-.168.000.014.74(**).896(**) 1.883(**) Sig. (-.005.000.000.757(**).660(*).883(**) 1 Sig. (-.003.014.000 The data above shows the existence of certain correlations between independent variables of the model, namely the existence of multicollinearity. This phenomenon distortions the standard error of the estimation and can lead to incorrect conclusions regarding to which of the variable is statistically significant or not. The most powerful correlation exists between the investments in hotels and restaurants and tourists arrivals in the establishments of since it has the value of 0.896 then between tourists arrivals in the establishments of and the, overnight stays the establishments of that is 0.883 and finally the correlation between the overnight stays the establishments of and investments in hotels and restaurants which is 0.660 (Table 8). It is considered that there are no special problems if the correlation coefficients between the independent variables are within -0.70 and +0.70. Usually, when between the two independent variables the correlation coefficients are high, one of the variables is eliminated from the model (Lefter, 004, p. 379). The correlation coefficient between the investments in hotels and restaurants and tourists arrivals in the establishments of is 0.896. Also, the correlation coefficient between the tourists arrivals in the establishments of and overnight stays the establishments of is 0.883 and it is significant at a level of significance of 0.05 (Table 8). In order to avoid any distortion phenomenon the variable tourists overnight stays in the establishments of can be given up, for example. 63

Revista de Turism [Nr. 8] According to the results obtained by multiple regression analysis both investments and arrivals of tourists are an important predictor of GDP. It is a paradox that Romania failed to make tourism a key sector despite the fact that it has a complex and varied tourist potential, having the raw material to develop particular tourist products such as: mountain and cultural circuits, spa and rural tourism etc. In order to know the more demanding requirements of tourists quantitative and qualitative marketing research should be made periodically. Qualitative research among managers of tourist facilities and quantitative research among tourists for a profound knowledge of the consumers of travel services, all of which require investments, but I am firmly convinced that such investments would have a positive contribution to the evolution of GDP. The companies in Romanian tourism should emphasize more on marketing policies that can be also found at the level of marketing strategies. In my opinion it should be understood that the differentiation strategy through staff involved in the provision and marketing of tourism services plays a crucial role in attracting and maintaining customers. I also believe that considering the conditions of an increasingly powerful competition, the entrepreneurs in Romanian tourism should use continuous renewal of their offer, because this way they can adapt products and services to the increasingly demanding requirements of tourists. 3. CONCLUSION As a consequence, it can be ascertained that the variables: Investments in hotels and restaurants; Tourists arrivals in the establishments of are significant predictors for the GDP dependent variable obtained in the hotel and restaurant sector, in the analyzed period 1995-007. The most powerful correlation is between: The GDP obtained in the hotels and restaurants sector and, overnight stays the establishments of of 0.757, The GDP obtained in hotels and restaurants sector and tourists arrivals in the establishments of 0.74 and finally, The GDP obtained in the hotels and restaurants sectors and investments in hotels and restaurants, of 0.406. Among independent variables the most powerful correlation occurs between investments in hotels and restaurants and tourists arrivals in the establishments of, this being 0.896. The number of employees in tourism has increased but unfortunately without their training. The managers of tourist facilities have not been motivated to change their staff or to invest in training sessions. In order to resist competition in the next years, I believe that the focus should be on investments in hotels and restaurants, this variable having a strong correlation with the variable of tourists arrivals in the establishments of. Inevitably the following question arises: what are those components of tourism marketing mix that would help to increase the number of tourists arrivals in the establishments of? I do not deny that promotion is not a particularly important element in attracting customers and certainly the number of tourists might increase due to the promotion of Romania in the cold season in underground stations in Rome and Milan, action that has cost nearly 100.000 Euros and this is not the only international campaign to promote tourism to our country - in Paris, the Ministry of Tourism has launched and Romania land of choice campaign - but my personal belief is that more importance should be given to investments in expanded marketing mix components, more precisely to investments in people, processes and in physical evidence that would help retain tourists. Even if the posters posted in Milan and Rome will have a positive effect and they will contribute to increasing tourist arrivals during the winter if we do not pay increased attention and we do not invest in the three elements of the specific marketing mix and not only, there is a risk to lose these tourists won by promotion. Strategies for diversification, differentiation, renewal of products, of flexibility should have an increased application in future. BIBLIOGRAPHY 1. Lefter, C. (004) Marketing researches, Published by Infomarket, Braşov.. Popescu, C. (Grant Manager), Neguţ, S., Bran, F., Alexandru, F., Ioan, I., Rădulescu, C.V., Dogaru, D, Dragne, D.M., Necşuliu, (Borto) G., Simion, G., (005) The role of touristic activity in regional development of Romania, Journal of Science Politics and Scientometry - Special Edition, p. 8. 3. *** (008) Romanian Statistic Year Book, pp. 477, 517, 86-87. 4. *** (008) The system of statistic indicators of tourist activity, http://www.bibliotecadigitala.ase.ro/biblioteca/pagina.asp?id=cap3. 5. *** http://www.insse.ro/cms/rw/pages/index.ro.do, 009. 64