Application of Hedonic Price Model in the Prague Property Market

Similar documents
The Impact of Broadband Deployment on Recreational and Seasonal Property Values- A Hedonic Model

Chapter 10. Key Ideas Correlation, Correlation Coefficient (r),

Course Objective This course is designed to give you a basic understanding of how to run regressions in SPSS.

Market Prices of Houses in Atlanta. Eric Slone, Haitian Sun, Po-Hsiang Wang. ECON 3161 Prof. Dhongde

Simple linear regression

A Model for Price Assessment of Residential Property in Bulgaria and its Implementation Options as SaaS

A Statistical Analysis of the Prices of. Personalised Number Plates in Britain

Microeconomics Sept. 16, 2010 NOTES ON CALCULUS AND UTILITY FUNCTIONS

The Effect of Proximity to Water on Human Behavior

The impact of open spaces on property values in Portland, Oregon

Flood Hazards, Insurance, and House Prices - A Hedonic Property Price Analysis 1

The Cobb-Douglas Production Function

Journal Of Financial And Strategic Decisions Volume 11 Number 1 Spring 1998

Hedonic prices for crude oil

Example: Boats and Manatees

The Impact of Interest Rate Shocks on the Performance of the Banking Sector

Solution: The optimal position for an investor with a coefficient of risk aversion A = 5 in the risky asset is y*:

Agency theory and foreclosure sales of properties

19 : Theory of Production

Section 14 Simple Linear Regression: Introduction to Least Squares Regression

LOGIT AND PROBIT ANALYSIS

Chapter 5 Estimating Demand Functions

BINOMIAL OPTIONS PRICING MODEL. Mark Ioffe. Abstract

Chapter 5. Conditional CAPM. 5.1 Conditional CAPM: Theory Risk According to the CAPM. The CAPM is not a perfect model of expected returns.

Causes of Inflation in the Iranian Economy

Comparison of Logging Residue from Lump Sum and Log Scale Timber Sales James O. Howard and Donald J. DeMars

Acquisition Lesson Plan for the Concept, Topic or Skill---Not for the Day

An introduction to Value-at-Risk Learning Curve September 2003

Introduction to Quantitative Methods

Measuring BDC s impact on its clients

Module 5: Multiple Regression Analysis

I. Basic concepts: Buoyancy and Elasticity II. Estimating Tax Elasticity III. From Mechanical Projection to Forecast

UNIVERSITY OF WAIKATO. Hamilton New Zealand

The Loss in Efficiency from Using Grouped Data to Estimate Coefficients of Group Level Variables. Kathleen M. Lang* Boston College.

Regression III: Advanced Methods

Premaster Statistics Tutorial 4 Full solutions

X X X a) perfect linear correlation b) no correlation c) positive correlation (r = 1) (r = 0) (0 < r < 1)

NATIONAL ASSOCIATION OF REALTORS. National Center for Real Estate Research

Augmenting Housing Sales Data to Improve Hedonic Estimates of Golf Course Frontage

Univariate Regression

Part 2: Analysis of Relationship Between Two Variables

Chapter 5 Financial Forwards and Futures

1) Write the following as an algebraic expression using x as the variable: Triple a number subtracted from the number

As we explained in the textbook discussion of statistical estimation of demand

Urban Economics. Land Rent and Land Use Patterns PART II Fachgebiet Internationale Wirtschaft Prof. Dr. Volker Nitsch Nicolai Wendland

RESEARCHING THE CONNECTION BETWEEN THE URBAN OPEN SPACES AND PROPERTY VALUES IN THE CASE OF BUDAPEST

Chapter 3 Quantitative Demand Analysis

One Period Binomial Model

Real Estate Agent Remarks: Help or Hype?

Technical Efficiency Accounting for Environmental Influence in the Japanese Gas Market

HYPOTHESIS TESTING: CONFIDENCE INTERVALS, T-TESTS, ANOVAS, AND REGRESSION

Logs Transformation in a Regression Equation

The siting of cellular phone transmitting antennas, their base stations, and

A Primer on Forecasting Business Performance

Journal Of Financial And Strategic Decisions Volume 9 Number 2 Summer 1996

Unit 31 A Hypothesis Test about Correlation and Slope in a Simple Linear Regression

MATH BOOK OF PROBLEMS SERIES. New from Pearson Custom Publishing!

Non Parametric Inference

Physics Lab Report Guidelines

Chapter 3 Local Marketing in Practice

Simple Linear Regression Inference

Chapter 9: Perfect Competition

Public Choices and Property Values

INFLATION, INTEREST RATE, AND EXCHANGE RATE: WHAT IS THE RELATIONSHIP?

Markups and Firm-Level Export Status: Appendix

Time Series Analysis

Chapter 7 Monopoly, Oligopoly and Strategy

3.3 Applications of Linear Functions

Moderator and Mediator Analysis

New Work Item for ISO Predictive Analytics (Initial Notes and Thoughts) Introduction

Module 3: Correlation and Covariance

Simple Predictive Analytics Curtis Seare

Transport demands in suburbanized locations

9 Hedging the Risk of an Energy Futures Portfolio UNCORRECTED PROOFS. Carol Alexander 9.1 MAPPING PORTFOLIOS TO CONSTANT MATURITY FUTURES 12 T 1)

DOES GOVERNMENT R & D POLICY MAINLY BENEFIT SCIENTISTS AND ENGINEERS?

Marketing Mix Modelling and Big Data P. M Cain

Session 9 Case 3: Utilizing Available Software Statistical Analysis

The Effect of Housing on Portfolio Choice. July 2009

Econometrics Problem Set #3

Risk Decomposition of Investment Portfolios. Dan dibartolomeo Northfield Webinar January 2014

Inequality, Mobility and Income Distribution Comparisons

T-test & factor analysis

II- Review of the Literature

The Effects of Start Prices on the Performance of the Certainty Equivalent Pricing Policy

Descriptive Statistics

Conditional guidance as a response to supply uncertainty

PITFALLS IN TIME SERIES ANALYSIS. Cliff Hurvich Stern School, NYU

Information Management and Business Review (ISSN ) Vol. 8, No. 1, pp , February 2016

The Impact of Commissions on Home Sales in Greater Boston

INCORPORATION OF LIQUIDITY RISKS INTO EQUITY PORTFOLIO RISK ESTIMATES. Dan dibartolomeo September 2010

4. Simple regression. QBUS6840 Predictive Analytics.

Measurement with Ratios

Chapter 9. The Valuation of Common Stock. 1.The Expected Return (Copied from Unit02, slide 36)

An Analysis of Price Determination and Markups in the Air-Conditioning and Heating Equipment Industry

National Money as a Barrier to International Trade: The Real Case for Currency Union Andrew K. Rose and Eric van Wincoop*

Lecture Notes on Elasticity of Substitution

Lecture 2. Marginal Functions, Average Functions, Elasticity, the Marginal Principle, and Constrained Optimization

1. What is the critical value for this 95% confidence interval? CV = z.025 = invnorm(0.025) = 1.96

Transcription:

Application of Hedonic Price Model in the Prague Property Market Jan MELICHAR a*, Ondřej VOJÁČEK b, Pavel RIEGER c, Karel JEDLIČKA d a Charles University Environment Center, jan.melichar@czp.cuni.cz b Department of Environmental Economics, University of Economics, ovojacek1@centrum.cz c Department of Law, University of Economics, pavel.rieger@vse.cz d Department of Mathematics, University of West Bohemia, smrcek@kma.fav.zcu.cz Abstract Hedonic price model was developed to quantify the influence of structural, accessibility and environmental attributes on the price of housing in the city of Prague. We have considered the size of flat as structural variable, the distance from the dwelling to the city center and to the nearest tube station as accessibility variables, and the proximity to the nearest urban forest as environmental variable. We have developed several regression models and tested their robustness in terms of the significance of parameters and the amount of variability explained. The sample is made up of 1,708 observations gathered from the city of Prague. Results show the size of flat is the most relevant variable explaining the price of housing. We have found also a significant inverse relationship between the selling price and its distance from a city center and urban forests. Keywords: hedonic price, regression analysis, geographical information system, property market Introduction The paper presents hedonic price model (HPM) used to estimate the differential premium on property value derived from structural attributes and from proximity to some neighborhood and environmental attributes. This methodology is applied to estimate the implicit value of housing proximity to city center, tube station and urban forest in the city of Prague. We have tested various types of regression models to infer the significant determinants of housing market in Prague. Relevant information used in the regression analysis comes from a sample of 1,708 residential dwellings. Hedonic price model is derived from Lancaster s consumer theory [Lancaster, 1966] and it empirical models have been applied since the 1960s. HPM is based on the idea that properties are not homogenous and can differ in respect to a variety of characteristics. The method relies on the fact that house prices are affected by many factors such as number of rooms, size of flat or garden, or access to city center. We suppose that the price of housing is determined by the particular combination of characteristics it displays. We can describe any particular property by the quantities and the qualities or characteristics of its structure, location and environs. To reveal the empirical relationship between the price of housing and its characteristics, the regression analysis is used as the primary statistical tool. Property prices are regressed against sets of explanatory variables, these are typically calculated using structural attributes of the * Address of correspondence: Jan Melichar, U Kříže 8, 158 00 Prague, Czech Republic

housing (such as number of bedrooms), neighborhood variables (quality of local schools), accessibility variables (proximity to railway station) and environmental variables (ambient noise level). The results of regression analysis are used to derive a hedonic price function that indicates how much the price of housing will change for a small change in each characteristic, holding all other characteristics constant. The hedonic price function can be used to determine how much more must be paid for a property with an each extra unit of a particular housing characteristic. This is known as the implicit price of a property characteristic. The application of the hedonic price method to examine the impacts of environmental characteristics on the house prices has been widespread, ranging from air quality to noise exposure, landscape and urban amenities. In this study, our primer interest is to examine the impact of urban forest on the house prices. More than 30 studies have shown that people are willing to pay more for a housing located close to an urban open space than for a house that does not offer this amenity. With few exceptions [Luttik, 2000; Schroeder, 1982] studies find that homes adjacent to naturalistic parks and open spaces are typically valued at about 8 to 20 percent more than comparable properties [Crompton, 2001]. Values show a linear decline with distance from the edge of an open space, with a positive price effect declining to near zero at about a half mile away [Hammer et al., 1974; Tyrvainen et Miettinen, 2000; More et al., 1988]. Also, the study done by Morancho [2008] indicates that the price of the housing relates inversely with the distance that separates it from an urban green space. This result is in accordance with Bolitzer and Netusil [2000] who concluded that proximity to an open-space can have a statistically significant effect on home selling price. As well as with Tyrvainen and Miettinen [2000] whose results demonstrated that a 1 km increase in the distance from the nearest forested area leads to an average 5.9 % decrease in the market price of the dwelling. In the following sections of the paper, the theoretical foundations of the method, the description of data set, the empirical models used and the results obtained are presented. Theoretical rinciples of hedonic price method Griliches [1971] and Rosen [1974] provided the theoretical support for the development of the hedonic price models. HPM relates the market price of the housing commodity (house, flat or rental housing) with the structural and accessibility characteristics that define it. It enables us to derive the monetary value of each characteristic that is calculated by observing the differences in the housing prices sharing the same attributes. The basic hypothesis is that the housing is formed by a heterogeneous set of characteristics. Therefore, when buying a house, we consider the price paid as the sum of the prices paid for each attribute (called implicit price). This could be expressed as follows: ( x, x ) P = f,..., (1) 1 2 xn where P is the market price of the house and x 1, x 2,, x n are the housing characteristics it embodies. The partial derivatives of the price with respect to the previous variables, P/ x i, provide information on the marginal implicit price for an additional unit of each characteristic. The set of structural and accessibility variables such as size of flat could not be the only determinants that explain the price of housing commodity. There are also environmental aspects such as proximity to a green area, noise level that could also explain the differences in

its market price. Then, the price function specified in equation (1) can be formulated as follows: ( x, x,..., x z) P = f (2) 1 2 n, where P is again the house market price; x 1, x 2,, x n are structural and accessibility characteristics and z is the environmental variable without a market price (the hedonic variable). The essence of the method consists of finding what portion of the price is determined by the hedonic variable. This information is obtained again by calculating the partial derivative of the price with respect to the variable z, P/ z, which gives us the marginal implicit value for an additional unit of the environmental asset, and thus allows us to obtain an estimate of its monetary value. The theoretical model specified in equation (2) has been used to obtain the implicit price of the distance to urban forests as environmental variable considered in this study. Study area and data In order to apply the hedonic price model we had to obtain relevant information about sales of dwellings realized on the Prague s real estate market. Individual housing characteristics including selling price were obtained from the real estate catalogue operating by the Czech company reality.cz (see http://www.reality.cz). The data set contains information on 1,708 apartments and flats sold to personal ownership from 2005 to spring 2008. See the geographical position of data set in Figure 1 as was detected by ArcExplorer. Figure 1: Delineation of geographical position of the data set using ArcExplorer In this study, the distance from the dwelling to the city center, the nearest tube station (accessibility variables) and the nearest urban forest (environmental variable) were measured by Geographical Information System (GIS), software ArcGIS respectively. GIS also served as

the important tool for detecting wrong observations (duplicity in observations, wrong positions of dwellings etc.). Altogether, 279 records have been detected as wrong, hereafter not used in the further analysis. All variables used in the regression analysis, including their definitions and expected influence on the dependent variable are summarized in Table 1. Table 1: Dependent, structural, accessibility and environmental variables Variable Description Expected sign* Price Price of the property (in CZK) DV Flat_area The area of the flat (in the square meters) + Center Distance to the city center (in meters) - Tube Distance to the nearest underground station (in meters) - Forest Distance to the edge of the nearest urban forest (in meters) - Note: DV marks dependent variable * + increasing / - reducing effect on the selling price Variable Price is the selling price of flat or apartment expressed in CZK and represents the dependent variable. In the proposed regression models the variable price is explained by the following explanatory variables Flat_area: the size of flat (structural variable), Center: distance to the city center represented by the Saint Wenceslas Statue (accessibility variable), Tube: the proximity to the nearest tube station (accessibility variable), Forest: distance to the nearest urban forest in meters (environmental variable). Altogether, we have considered 21 of the main urban forests and forest parks in Prague. Table 2 gives maximum, minimum and average values for the dependent variable and the explanatory variables. Table 2: Descriptive statistics (n = 1,708) Variable Mean Standard deviation Minimum Maximum Price 4,008,013 2,987,803 400,000 35,000,000 Flat_area 74 36 15 455 Center 5,130 3,355 86 20,435 Tube 1,432 1,564 17 9,774 Forest 1,560 1,333 1 7,074 The average price of flat is 4 Mil. CZK and ranges from 400 thousands CZK to 35 Mil. CZK. The size of flat is in average 74 m 2. The average distance from dwelling to the city center is slightly above 5 km, on the other hand the distance to the nearest tube station is in average around 1.5 km, and the proximity to the nearest urban forest is slightly above 1.5 km. These results indicate that, in general, people prefer to live in the suburb parts of Prague, quite far from the city center, but with an easy accessibility to the tube station and the green area. Empirical models

The relationship between the selling price and the characteristics of housing can take several functional forms. In this study, we have specified four functional forms: linear, log-linear, linear-log, and log-log. If the linking price relationship with the characteristics of housing is assumed to be linear (both the dependent and explanatory variables enter the regression in their linear form), the equation (2) becomes: P i = b x + b x +... + b 1 1i 2 2i n i = 1, 2,..., T x ni + b z z i + ε i (3) where x 1i, x 2i,, x ni, z i are variables describing the attributes of housing i, parameters b 1, b 2,, b n, b z represent the marginal implicit price of each attribute and ε i is the error term. The marginal implicit price for an additional unit of the environmental good z is b z. Under the linear specification, the willingness to pay for an additional unit remains constant, i.e. it does not depend on the starting level of z. This assumption is a strong restriction since, as Rosen [1974] points out, there are many reasons to suppose the relationship between the price and the environmental variable to be non-linear. Thus, logarithmic specifications are frequently formulated, as follows. In the case of log-linear specification, the log of the dependent variable is regressed against linear explanatory variables: ( Pi ) = b1 x1i + b2 x2i +... + bn xni + bz zi + εi ln (4) In equation (4), both its slope and elasticity change at each point are the same sign as b n. On the other hand, linear-log specification represents the relationship where a linear dependent variable is regressed against the log of the explanatory variables: Pi ( x1i ) + b2 ln( x2i ) +... + bn ln( xni ) + bz ( zi ) + εi = b1 ln ln (5) The slope of the function is b n /x and elasticity of the variable is represented by b n /y. Log-log is the specification form where both the dependent and explanatory variables enter the regression in their log form, as follows: ( Pi ) = b1 ln( x1i ) + 2 ln( bx2i ) +... + bn ln( xni ) + bz ln( zi ) + εi ln (6) A logarithmic model allows us to measure the impact that changes in explanatory variables cause in the dependent variable in relative terms. Then, the parameter b n represents the value of elasticity of corresponding variable. Results Table 3 presents the results of the ordinary least squares (OLS) regressions for the four specified models. The explanatory variables all are statistically significant at the 0.01 significance level, except Tube variable for linear, linear-log and log-log model that is not significant even at the 0.05 level. All significant variables have the expected signs. Log-linear

model however fail to confirm the researchers expectation, that the distance to the tube station would have negative impact on the price, i.e. the higher the distance to the tube, the lower the price. The distance to the tube station is almost in all regression models insignificant, also it seems to be very unstable in the terms of the estimated sign. The reason is the high correlation between the distance to the city center and the distance to the nearest tube station. The Pearson correlation coefficient of these two variables is 0.54. That is why, Tube variable is excluded in the final model. Table 3: Estimated coefficients for the specified models (t-statistic in parenthesis) Linear Linear-log 1 Log-linear 2 Log-log 3 Flat_area Center Tube Forest Constant 54,000 3,939,424 0.01 0.88 (39.03) (35.04) (44.61) (51.50) -264.76-1,308,635-0.000054-0.25 (-13.54) (-17.01) (-15.91) (-21.17) 76.80 ns -66,782 ns 0.000023 0.003803 ns (1.80) (-1.04) (3.14) (0.39) -157.14-187,747-0.000028-0.03 (-4.17) (-4.75) (-4.21) (-5.00) 1,481,644-11,281 ns 15 14 (9.09) (-0.01) (514.69) (102.26) Adjusted R 2 0.55 0.55 0.62 0.71 F-statistic 530 530 689 1,022 Observations 1,708 1,708 1,708 1,708 All coefficients are significant at the 0.01 level. ns Not significant at the 0.05 level. 1 Explanatory variables are in their log form. 2 Dependent variable is in logarithmic scale. 3 All variables are in their log form. The variability of the housing prices measured by R 2 adjusted is accounted differently in the estimated models. Both models, linear and linear-log, explain 55 % of the variation in the housing prices, log-linear accounts 62 %. However, log-log model accounts for 75 % of the price variance, thus provides the best fit from all these models. To see how the predictability of log-log model is changed when the variable of the distance to the nearest tube station is excluded we have run OLS regression for log-log model once more. The results are presented in the following Table 4. Table 4: Log-log model estimated by OLS, Tube variable excluded Unstandardized Standard Standardized B B coefficient error coefficient t-ratio P-value Constant 13.56 0.13 103.40 0.00 Ln_Flat_area 0.88 0.02 0.69 51.53 0.00 Ln_Center -0.25 0.01-0.35-25.62 0.00

Ln_Forest -0.03 0.01-0.07-5.24 0.00 Adjusted R2 0.71 F-statistic 1,363 Observations 1,708 Dependent variable as Ln of price The estimated coefficients have similar values as those derived in log-log model with Tube variable. Regarding the partial significance of each variable, the Student s t-statistic reveals the housing living area as the variable with the greatest explanatory power. It is followed by the variable referring to the distance to the city center and to the nearest urban forest. The explanatory variables all are statistically significant at the 0.01 significance level. The set of explanatory variables accounts for 71 % of the price variance and the F-ratio test indicates that the model fits better (F-ratio = 1,363) than the log-log where the variable of the distance to the tube station is included (F-ratio = 1,022). Using the estimated coefficients presented in Table 3, the relationship between the distance to the nearest urban forest and the price of housing with a given set of characteristics can be approximated by a hedonic price function. The hedonic price function derived for each model is presented in Figure 2. The hedonic price functions show the decrease in the housing price as the distance from urban forest increases while holding the other variables constant. Figure 2: Hedonic price function for each specified model 5 500 000 Price (CZK) 4 500 000 3 500 000 linear-log log-log linear log-linear 2 500 000 0 500 1 000 1 500 2 000 2 500 3 000 3 500 4 000 Distance to the nearest urban forest (in meters) However, the price reduction is quite moderate. This indicates low elasticity, i.e. the percentage change in the price of flat associated with a 1 % change in the proximity to urban forest. The elasticity ranges in average from -0.06 % for linear model to -0.03 % for log-log model. Nevertheless, considering the hedonic price function derived from log-log model, it is evident that the price of flat falls down significantly up to 400 meters from dwelling, then decreases slightly. According to the obtained coefficients in the final log-log model and having the variables at their mean values it can be said that:

Each additional square meter in the flat size increases the price by 2,263 CZK, An increase in the distance from the city center about 1 meter decreases the price by 630 CZK, and The same increase in the distance from urban forest decline the price by 79 CZK. This means that a 1 km increase in the distance from the nearest forested area leads to an average 1.96 % decrease in the market price of the dwelling in Prague. Conclusions As preceding hedonic pricing studies have shown, the effects of urban environmental amenities such as urban forest on the price of housing are measurable and significant. Using a hedonic price model and ordinary least squares regression, we have estimated a hedonic price function of housing (flats and apartments) in which the sale price is related with the proximity to urban forests in the city of Prague. Together with a set of conventional explanatory variables such as the size of flat, the distance to the center city and the proximity to the nearest tube station, the distance to the nearest urban forest or forested park as environmental variable was included in the right-hand side of the regression. We have tested four types of regression models. Compared to linear, linear-log and log-linear, log-log model accounts for 75 % of the price variance, thus provides the best fit from all these models. Regression analysis revealed the size of flat as the variable with the greatest explanatory power. Other statistical variables are the distance to the city center and to the nearest urban forest. As expected, the size of housing affects the price of flat positively, on the other hand, the distance to the city center and to the nearest urban forest decrease the price of housing. According to the estimates derived by log-log model, every 1 m further away from urban forest means a drop of 79 CZK in the housing price. It means that a 1 km increase in the distance from urban forest leads to an average 1.96 % decrease in the housing price. Compared to other studies that had assessed this effect at about 6 to 20 %, the distance effect in this study can be considered as modest. Although, the influence is moderate, the proximity to urban forest is significant determinant influencing people s preferences where to live. The policy implication for urban planning could be to establish and maintain the green areas, even in the suburb parts of Prague where, at present, people prefer to live. Acknowledgement The research on this study was supported by the grant of the Internal Grant Agency, the University of Economics: IG 50 90 37 Application of the hedonic price theory at the labour and real estate market in the Czech Republic. This support is gratefully acknowledged. References Bolitzer, B., Netusil, N. R. (2000): The impact of open spaces on property values in Portland, Oregon, Journal of Environmental Management, 59 (3), 185 193. Crompton, J. L. (2001): Parks and Economic Development, Washington D. C.: American Planning Association.

Griliches, Z. (1971): Price Indexes and Quality Change, Harvard University Press, Cambridge, MA. Hammer, T. R., Coughlin, R. E., Horn, E. T. (1974): The Effect of a Large Park on Real Estate Value, Journal of the American Institute of Planners, 40, 274-277. Lancanster, K. J. (1966): A new approach to consumer theory, Journal of Political Economy, 74, 132-57. Luttik, J. (2000): The Value of Trees, Water and Open Space as Reflected by House Prices in the Netherlands, Landscape and Urban Planning, 48, 161-167. Morancho, A. B. (2003): A hedonic valuation of urban green areas, Landscape and Urban Planning, 66, 35 41. More, T. A., Stevens, T. H., Allen, P. G. (1988): Valuation of Urban Parks, Landscape and Urban Planning, 15, 139-152. Rosen, S. (1974): Hedonic prices and explicit markets: production differentiation in pure competition, Journal of Political Economics, 82, 34 55. Schroeder, T. D. (1982): The Relationship of Local Park and Recreation Services to Residential Property Values, Journal of Leisure Research, 14 (3), 223-234. Tyrvainen, L., Miettinen, A. (2000): Property prices and urban forest amenities, Journal of Environmental Economics and Management, 39, 205 223.