Real Estate Customer Relationship Management using Data Mining Techniques Tianya Hou and Andy K.D. WONG (852) 27667805 tianya.hou@conncet.polyu.hk and bskdwong@polyu.edu.hk Department of Building and Real Estate, The Hong Kong Polytechnic University Hung Hom, Kowloon, Hong Kong Abstract The development of real estate market is undergoing three stages: from product-oriented to market-oriented, and finally to customer-oriented. Incorporation of customer relationship management (CRM) system into real estate enterprises would help explain and predict the behaviors of real estate decision-makers, which would benefit enterprises in fighting for market shares and winning customers in the fierce competition. However the large amount of the complicated data produced from real estate CRM system has already outpaced the digestion of human beings, and the traditional statistics methods cannot fulfill the requirements of the. To tackle this problem, data mining techniques are proposed to integrate into real estate CRM system. Data mining techniques would help real estate enterprises identify the taste and the preference of real estate purchasers. According to that information, strategies about the land location, the apartment type, and the marketing tools will be set up. This paper firstly presents a review of some CRM concepts relevant to real estate and the corresponding data mining applications in this area, and suggests how these can expand real estate study. Secondly, it provides an example: the customer housing demand using data mining technique. Factors which most affect customer housing demand have been identified. Some implications about developing strategies and market investigation for the real estate firms are inferred, in order to improve the sales turnover rates and reduce the market cost. Key words: Real Estate Market, Customer Relationship Management (CRM), Data Mining 1. Introduction The development of real estate market is undergoing three stages: from product-oriented to market-oriented, and finally to customer-oriented ( 叶 开 ). With the size of the housing, real estate, and mortgage-related industry expanding and the high market competition, it is critical that traders use the best possible information available to improve backend operations and sharpen their customer offerings in real estate business. So the incorporation of customer relationship management (CRM) system into real estate enterprises must be enforced. Customer relationship management helps administrators in recording back and forth 1
communications, tracking sales, and developing marketing strategies, where generates a huge amount of data to outpace the digestion of human beings. Data alone is meaningless to most people. But when it s used to create interesting products for consumers and agents, it can add value to a business. Data mining can enable home buyers and sellers to access more home information that previously was either unavailable or locked up in real estate business. Effective and comprehensive market information can enable the stakeholders stand out from the competition and give the consumers what they are looking for. The rest of this paper is organized as follows. In Section 2, this paper firstly presents a review of some CRM concepts relevant to real estate and the corresponding data mining applications in this area, and suggests how these can expand real estate study. In Section 3, it provides an example: the customer housing demand using data mining technique. Factors which most affect customer housing demand have been identified. Some implications about developing strategies and market investigation for the real estate firms are inferred, in order to improve the sales turnover rates. Finally, the prospect about application of data mining techniques in real estate CRM are made in Section 4. 2. Customer Relationship Management Some real estate organizations are still asking why CRM?. Unlike other industries (pharmaceuticals and financial services, for example) where sales support and relationship management are ubiquitous, the value proposition for CRM systems in real estate is not well understood. This is especially odd since real estate is traditionally relationship and sales driven, built on commission incentives, having complex referral models and long sales cycles. The negotiations are complex and resulting transactions are often unique. It operates in a high competitive environment where retention of tenants and successful sales is directly correlated to asset value and rate of return. With an abundance of projects to invest in, buyers have vast choices to make their real estate investment decisions. For the developers, the main challenge is to identify the market segment through the marketing campaigns with sustainable follow-up efforts to build strong pipeline and convert the footfalls into customers. Customer servicing and satisfaction can be a key differentiator for the real estate developers, where strategic technologies like customer relationship management (CRM) can play an important role in this. CRM is a business approach that integrates people, processes and technology to maximize the relations of an organization with all types of customers (Chen and Popovich). It entails acquiring and developing knowledge about one s customers and using this information across the various touch points to balance revenue and profits with maximum customer satisfaction. Leveraging data from CRM and external list data, target campaigns based on numerous customer attributes can be conducted. CRM helps in improving segmentation scalability for targeting millions of prospects. This also helps in enhancing marketing productivity, increasing conversion rates and 2
incidentally reduces campaign execution costs. Because of the rapid process of information technology, the amount of information stored in real estate CRM is rapidly increasing. These huge databases are often represented by high-dimensional feature vectors. Finding the valuable information hidden in those databases and identifying appropriate models is a difficult task. All the problems can be solved by data mining. 3. Data Mining and its application in real estate industry. Data mining is the process of exploration and, by automatic or semi-automatic means, of large quantities of data in order to discover meaningful patterns and rules. It combines techniques from machine learning, pattern recognition, statistics, database theory, and visualization to extract concepts, concept interrelations, and interesting patterns automatically from large corporate databases. Two primary functions of data mining are: prediction, which involves finding unknown values/relationships/patterns from known values; and description, which provides interpretation of a large database (Guo.). The application of data mining technology has become more widely in real estate industry. The typical uses of data mining technology in real estate industry are shown in Table 1. Table1 The application of data mining technology in real estate industry Application Application purpose Application Authors type methods Property assessment Association based on customer segmentation Customer loyalty Residential mortgage default Evaluate the house for many factors such as the residential location, transportation, surroundings, public institutions, and so on. Analyze customers characteristics for different customer segmentation, based on those results predict the customers purchase intention. Judge customers loyalty base on their attributes such as residential zone, age, income, gender, education and so on Provide some advices to the risk management department on how to prevent the mortgage default based on different 3 Regression Hedonic Artificial neural network Decision tree Association Decision tree Artificial neural network Clustering Leontion. et al. Carlos. 吉 同 路 等 北 京 大 视 野 社 会 经 济 调 查 有 限 公 司 李 莉 岑 希 Cunha. Agard. and
Housing trading and matching system Building and apartment layout risk class Based on housing trading and matching system, customers can obtain the desired type of housing information as soon as possible Find out the desired building from a large amount buildings in different condition 4 Nearest neighbour algorithm Clustering Decision tree Bayesian Artificial neural network Chang Chu 王 洋 卫 易 辰 4.Case study: customer demand of high-rise housing. 4.1 Data In order to understand the consumer s decision when choosing high-rise houses, and which of the factors individual customers prefer in combination with one another. A questionnaire survey was conducted, and each consumer was asked which factors they valued most when buying a high-rise house, and each of them can only choose 9 factors. As a result, a data set that list the factors preferred by 1079 customers was produced. The possible factors affecting the sale of high-rise houses and their abbreviations are represented as shown in Table 2. Table 2 The factors affecting the sale of high-rise houses Factors Abbreviations Regional plate RP Convenient transportation CT Supporting facilities SF Appreciation potential AP Education facilities EF Large district LD Low price LP Free area FA Apartment layout AL Brand developers BD Landscape vision LV Property management PM Hospital H School S Kindergarten K Bank B Farm product market FPM and
Supermarket Convenient store Restaurant Public garden Chess and card room Senior citizen activity centre Children recreation facilities Sports ground Open air theater Swimming pool Club S CS R PG CCR SCAC CRF SG OAT SP C There are 2 columns in the data sets, shown in Table 3. Table 3 The final data style Name Model Role Measurement level Description RS ID Nominal Respondents serials AF Target Nominal Affecting factors The data set has over 3000 rows. Each row of the data set represents a customer-factors combination. Therefore, a single customer can have multiple rows in the data set, each row representing one of the factors he or she think very important for value the houses. 4.2 Association Analysis Association is conducted to do customer s high-rise housing demand. It will produce thousands of rules. Because of the limitation of space, not all the rules are included in our studies. Some interesting association rules are shown in Table 4. Table 4 Some interesting association rules Association Rules Support (%) Confidence (%) Lift CS&FPM LP 4.36 21.56 1.73 SCAC&CCR H&CT 5.01 58.70 1.64 CS&S LP 4.17 20.27 1.63 SP&SG AP 4.73 31.48 1.60 SCAC&CT FPM&H 6.68 19.83 1.56 The table contains information for each rule. Consider the rule A B: Support of A B is the probability that a customer prefers both A and B. Confidence of A B is the probability that a customer prefer B given that the customer prefer A. lift of A B is a measure of strength of the association. If the Lift =2 for the rule A B, then a customer preferring A is twice as likely to prefer B than a customer chosen at random. Take the rule CS&FPM LP for example, 4.36% of customers prefer the house which is nearby convenient stores and farm production markets with the low price. 21.56% of customers like the house near to convenient stores and farm production markets care more about the price. The lift of the relationship CS&FPM LP is 1.73. 5
Therefore, if taking low price as a selling point, the customers like convenient stores and farm production markets are more likely to pay money to buy the house than the customers chosen at random. Other association rules can be explained in a similar way. Large amount of data already outpace the digestion of the human being, In this case study, over 3000 rows with 28 variables are included. It is impossible to extract some useful information from that data just by experts experiences and statistics methods. Association can assist us to solve that problem, broaden the analysts thinking mode, and produce some strategies for reference. 5.Conclusion With the rapidly growing data size in real estate industry when conducting customer relationship management, real estate enterprises strongly need a powerful data tools to transfer those huge data into valuable knowledge. The traditional statistics methods can not fulfill this requirement yet. Data mining techniques can compensate the limitations of statistics. Data mining can assistant real estate enterprises to identify the taste and the preference of real estate purchasers, to broaden the analysts thinking mode, and to provide some strategies for reference at the same time. References 叶 开 房 地 产 企 业 的 客 户 细 分 战 略 Chen I.J. and Popovich K.. Understanding customer relationship management (CRM): People, process and technology, Business Process Management Journal, Vol. 9 Iss: 5, pp.672 688, 2003. Guo L.J.. Applying Data Mining Techniques in Property/Casualty Insurance. Chapter 8 of Industry Applications of Data Mining. Leontion. et al.. Need a Home? Start the Data Mining!- A Data Mining Application in Weka on Real Estate Market. Carlos.. A comparison of data mining methods for mass real estate appraisal. 吉 同 路 等 住 宅 与 房 地 产 电 子 政 务 中 数 据 挖 掘 的 应 用 研 究 北 京 大 视 野 社 会 经 济 调 查 有 限 公 司 房 地 产 行 业 研 究 咨 询 部, 房 地 产 行 业 客 户 信 息 的 数 据 挖 掘 李 莉 基 于 决 策 树 数 据 挖 掘 技 术 在 地 产 营 销 中 的 应 用 岑 希 数 据 挖 掘 技 术 在 房 地 产 行 业 的 应 用 综 述 Cunha. and Agard.. Business process reengineering with data mining in real estate credict attribution: a case study. Chang. and Chu.. Applying Data Mining and XML Technology to Build a Web-Based House Trading and Matching System. 王 洋 基 于 数 据 挖 掘 聚 类 分 析 的 房 地 产 市 场 信 息 处 理 技 术 的 应 用 卫 易 辰 数 据 挖 掘 技 术 在 房 地 产 户 型 选 择 分 析 中 的 应 用 6