HANDLING IMPRECISION IN QUALITATIVE DATA WAREHOUSE: URBAN BUILDING SITES ANNOYANCE ANALYSIS USE CASE
|
|
|
- Gerald Lawson
- 10 years ago
- Views:
Transcription
1 International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, Volume XL-2/W1, 213 8th International Symposium on Spatial Data Quality, 3 May - 1 June 213, Hong Kong HANDLING IMPRECISION IN QUALITATIVE DATA WAREHOUSE: URBAN BUILDING SITES ANNOYANCE ANALYSIS USE CASE F. Amanzougarene, a, b, * M. Chachoua a, K. Zeitouni b a EIVP, École des ingénieurs de la ville de Paris, 8, rue Rébeval, 7519 Paris, France- (fatiha.amanzougarene, chachoua)@eivpparis.fr b PRISM, Université de Versailles-SQ, 45 Avenue des Etats-Unis 7835 Versailles, France- [email protected] KEYWORDS: Decision-making, imprecision management, data warehouse, multidimensional model, qualitative data processing, urban sites annoyance. ABSTRACT: Data warehouse means a decision support database allowing integration, organization, historisation, and management of data from heterogeneous sources, with the aim of exploiting them for decision-making. Data warehouses are essentially based on multidimensional model. This model organizes data into facts (subjects of analysis) and dimensions (axes of analysis). In classical data warehouses, facts are composed of numerical measures and dimensions which characterize it. Dimensions are organized into hierarchical levels of detail. Based on the navigation and aggregation mechanisms offered by OLAP (On-Line Analytical Processing) tools, facts can be analyzed according to the desired level of detail. In real world applications, facts are not always numerical, and can be of qualitative nature. In addition, sometimes a human expert or learned model such as a decision tree provides a qualitative evaluation of phenomenon based on its different parameters i.e. dimensions. Conventional data warehouses are thus not adapted to qualitative reasoning and have not the ability to deal with qualitative data. In previous work, we have proposed an original approach of qualitative data warehouse modeling, which permits integrating qualitative measures. Based on computing with words methodology, we have extended classical multidimensional data model to allow the aggregation and analysis of qualitative data in OLAP environment. We have implemented this model in a Spatial Decision Support System to help managers of public spaces to reduce annoyances and improve the quality of life of the citizens. In this paper, we will focus our study on the representation and management of imprecision in annoyance analysis process. The main objective of this process consists in determining the least harmful scenario of urban building sites, particularly in dense urban environments. 1 INTODUCTION Data warehouses and on-line analytical processing (OLAP) constitute the main elements of decision support systems. A data warehouse means a decision support database allowing integration, organization, historization, and management of data from heterogeneous sources, with the aim of exploiting them for decision-making (Kimball, 22; Inmon, 25). OLAP refers to the technology that allows users to efficiently retrieve the information stored in a data warehouse. To conceptualize data in a data warehouse, the multidimensional model is used. This model organizes data into facts (subjects of analysis) and dimensions (perspectives of analysis). A fact is composed of numerical measures and dimensions which characterize it. A dimension is organized into hierarchical levels of detail. Based on the navigation and aggregation mechanisms offered by OLAP tools, facts can be analysed according to the desired level of detail. In some real world applications, the subject of analysis may be subjective and consequently its measures are provided in qualitative fashion. In addition, sometimes a human expert or a prediction model such as a decision tree can be used to provide a qualitative evaluation of some phenomenon based on its different parameters. This arises in many applications such as customer satisfaction, process control, consumer products, and annoyance evaluation. Conventional data warehouses are thus not adapted to human reasoning and have not the ability to deal with qualitative data. In previous work, we have presented an original work that aims at making it possible to handle raw qualitative measures and providing a more flexible method for the multidimensional analysis over that type of data. Based on computing with words methodology, we have introduced qualitative measures and aggregates as an extension of multidimensional data model of a data warehouse. Using these measures and aggregates, OLAP queries allow the decision maker to manipulate data in a qualitative fashion using linguistic terms. In this paper, we will extend this model to deal with both qualitative and quantitative measures, which leads to handle imprecise data in a data warehouse. Compared to the state of the art, there exists several research works addressing aggregation over imprecise and uncertain data, among which those proposed in (Laurent, 21; Molina, 26; Burdick, 27; Delgado, 27). Our study will focus on the fuzzy fusion of qualitative and quantitative measures in the context of data warehouses. To illustrate the problematic and our proposal, we will consider throughout this paper the case of urban building sites annoyance. This paper is structured as follows. In the second section we present our work motivation and the use case related to urban building sites annoyance evaluation and analysis. In the section 3, propose the data model allowing the combination of qualitative and quantitative measures in the context of imprecise multidimensional databases. In the section 4 we present the experimentation framework that consists of a Spatial Decision Support System (SDSS) designed to the annoyance analysis. Finally, in the last section we conclude and present some perspectives. 2 MOTIVATION AND USE CASE: URBAN BUILDING SITES ANNOYANCE Although indispensable for the development and renovation of cities, urban building sites are often a source of various kinds of nuisance. These nuisances have not negligible impacts on quality of life of urban citizens. This issue is crucial and 133
2 International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, Volume XL-2/W1, 213 8th International Symposium on Spatial Data Quality, 3 May - 1 June 213, Hong Kong becomes more complex in cities with high population density. The main objective of our work is to develop a spatial decision support system (SDSS) dedicated to reducing the annoyance generated by urban building sites. We make the observation that, in human reasoning, the annoyance is evaluated subjectively and qualitatively by using an ordinal scale of linguistic degrees. Therefore, for a perfect match with the human expert reasoning, we propose in this paper a qualitative model of annoyance evaluation. In our previous studies (Amanzougarene, 212), we have presented a quantitative model that allows evaluating urban people annoyance due to the noise. By comparison, in the present work, we generalize our previous model by privileging a qualitative data handling of annoyance. We also extend our previous model of annoyance evaluation to other types of nuisance than noise, which strengthens the interest of multidimensional analysis. Indeed, an urban building site is generally likely to cause many nuisances Dimensions of Annoyance: In practice, the choice of factors to be considered for the annoyance evaluation depends on the human experts appreciation. In our case study, the experts have retained some factors related to individual, nuisance and environment. The latter is actually a combination of space and time dimensions. This leads to a multidimensional representation described by Figure 1 below and including the dimensions: (1) category of population grouping the factors related to the individual, (2) nuisance grouping the factors related to the nuisance, (3) space, and (4) time. Notice that the choice of the dimensions is application-dependant, and could add or ignore some factors such as the building type or gender. Our model adapts to other schemas as well. 2.1 Qualitative Representation of Annoyance Notion of Annoyance: As several studies show it, annoyance is an unpleasant sensation experienced by an individual facing deterioration in the quality of her/his environment (Guski, 1999; Nordin, 26; Moser, 26; Robin, 27). The annoyance may be caused by various nuisances (noise, odour, vibration, traffic congestion, air pollution ). According to various factors (intensity, moment, type ), a nuisance is likely to cause a more or less important annoyance to individual. Note that the level of annoyance caused by one or more nuisances can be different from one individual to another, depending on various factors (sensibility, age, acceptability ). This means that, a phenomenon which is not at all annoying for an individual can be extremely annoying for another individual. That reflects the subjective character of the annoyance notion. Thus, for the rest of our study, we propose the following definition for the annoyance notion. Definition1. In a spatiotemporal environment, annoyance is subjective relationship between an individual and a harmful phenomenon. In other words, an individual can be only annoyed, in the presence of one or more harmful phenomena for this individual. Thus, a human expert can evaluate subjectively the degree of annoyance, according the various factors (Amanzougarene et al. 211). The most relevant factors can be classified in three categories: (1) factors related to individual, (2) factors related to nuisance, (3) factors related to environment. The table below shows these three categories, with the main factors. Factors related to individual Age Health condition Gender Socio-professional category Acceptability Awareness Factors related to nuisance Nuisance type Nuisance intensity Exposure duration Frequency Table 1. Main factors of annoyance Factors related to environment Space Time These three categories of factors will be used to evaluate the annoyance and to determine the scenario of building site which produce the minimum of annoyance. Figure 1. Multidimensional representation of annoyance Categories of Population: In our case study, the categories of population exposed to nuisances are represented by a typical individual. These are determined by a combination of factors already presented in Table 1. For instance, a category of population could be healthy senior manager which means implicitly an adult individual of male gender, whose category socio-professional is manager, in good health condition. Another category of population could be housewife mother, meaning an unemployed adult of female gender. A third one could be child breathing problems, meaning a young individual who is not in good health condition. Nuisances: Nuisances to which are exposed the different categories of population concerned by the carrying out of an urban building site can be classified into three categories: (1) Nuisances related to the living environment: characterizing unwanted changes in habits of the population impacted (2) Nuisances related to the landscape insertion of urban building sites: describing changes in the visual environment, and (3) Sensorial nuisances: such as noise, dust, odour and vibration. Time: People are not annoyed in the same way according to the moments of the day and the periods of the year. For example, given residential area, a height noise can be accepted during the day but not at all at night. In our case study, we define a hierarchy of time. This hierarchy consists to divide year on two periods: (1) rainy period and (2) non-rainy period. The weekdays are divided into three moments: morning, evening, and night. Space: The annoyance of an individual may vary depending on his distance to the source of nuisance. Indeed, nuisances generated by urban building sites are not present in a uniform manner inside the influence area. It is thus important to decompose this 134
3 International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, Volume XL-2/W1, 213 8th International Symposium on Spatial Data Quality, 3 May - 1 June 213, Hong Kong area into several sub-areas. For example: the immediate vicinity, the influence area, and the boundary of the influence area. For this dimension we will use an a priori geographical zoning. 2.2 Annoyance Evaluation In human reasoning, the subjective evaluation of annoyance is done qualitatively by using a finite scale of linguistic degrees, such as, low, high. Generally, the human subject uses ordered scales with 5 or 7 linguistic degrees (Yager, 27). In our case study, the evaluation process is as follow (For the sake of space, we will briefly describe the evaluation process): 1. Define four combinations of dimensions: (Category of population-time), (Time-Nuisance), (Category of population-nuisance) and (Intensity-Nuisance). 2. For each combination, a scale of 1 to 4 is used. 3. The value of a given evaluation is the product of the values corresponding to the precedent combinations. 4. Thus, the interval of evaluation is [1-256]. This interval is divided to five subintervals: [-1], ]1-3], ]3-6], ]6-1] and ]1-256]. To these subintervals, it is associated respectively the following linguistic terms : Note: thereafter, in the interests of simplifying notations, we will represent these linguistic degrees respectively by 3 TOWARDS A QUALITATIVE MULTIDIMENTIONAL MODEL FOR HANDLING IMPRECISE DATA In this section, first, we will describe the multidimensional model of annoyance that will be used as a running example for the rest of the paper. Then, we will describe our proposed model to represent and mange imprecision in context of data warehouse. 3.1 Multidimensional Data Model of Annoyance In our case study concerning urban building sites, the subject of analysis corresponds to the annoyance. This subject is analyzed according to the dimensions we have presented in section 2.2. Namely: nuisances, categories of population, time, and space. To model data of urban building sites, we have used a star schema represented by Figure 2. It is actually a schema of a spatiotemporal data warehouse, since space and time dimensions with hierarchies (for sink of simplicity, we omit the detail of dimension attributes in the figure). To represent it, we use the graphical formalisms proposed by (Malinovski, 28). We have defined a fact table Annoyance. Thus, data from Annoyance table are analysed according to the dimensions: Nuisances, Categories of population, Time, and Space. Measures associated with the Annoyance fact table are: degree of annoyance, and population density. Example of Annoyance Evaluation: Let us consider a given location L1 where one has three nuisances, noise, odour, and dust. An extract of the annoyance evaluation carried out by the human experts is shown in Table 2 below. We note that, this evaluation takes into account only the following factors: (1) socio-professional category (SPC), (2) age, (3) type, (4) intensity, (5), time of day, and (6) period of year. In this evaluation, 5 levels of nuisance intensity are considered. Level 1 corresponds to the absence of nuisance, which means that the degree of annoyance is i.e. not at all annoyed. This table is an extract of the decision matrix carried out by the experts based on different dimensions of annoyance. This matrix will serve as knowledge base to populate the data warehouse designed to contain data related to annoyances. This warehouse constitutes the core of our SDSS. Space = L 1 Nuisance Type Noise Intensity 1 2 Category of population SPC Inactive resident Old Age Young Adult people Time Morning Evening Night Period Non-rainy period Time Table 2. Application: example of annoyance evaluation Figure 2. Multidimensional data model of annoyance In the current model, we are faced to two problems: 1. The model expert used to evaluate degree of annoyance provides a qualitative crisp values. Thus, this model does not capture the imprecision inherent to this measure. 2. The fusion of measures: Indeed, the managers of public spaces are interesting by the analysis of the impact of annoyance. This measure is derived from degree of annoyance, which is qualitative measure, and population density, which is numerical one. We recall that degree of annoyance is the annoyance level of an individual type representing a given category of population. Thus we define the impact of annoyance as the overall level of annoyance of a given category of population, taking into account the density of this category. For the first problem we propose to use fuzzy set to represent the imprecision which is inherent to data. 135
4 International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, Volume XL-2/W1, 213 8th International Symposium on Spatial Data Quality, 3 May - 1 June 213, Hong Kong For the second problem, we propose to use the fuzzy fusion approach which is based on the fuzzy inference. 3.2 Fuzzy Data Model In this section, we present the proposed data model focusing of the imprecise qualitative measures. This model is based on the fuzzy theory introduced by Zadeh (Zadeh, 1975). This choice is justified by: 1. Fuzzy theory offer techniques that allow to model imprecise or vague knowledge that very few other methodologies allow; 2. It is the only way to treat in the same part of the knowledge provided digitally (by instruments) and knowledge expressed symbolically (by a human observer). The model that we propose to represent imprecise measures is based on the concept of linguistic variable introduced in (Zadeh, 1975). Definition 2. A linguistic variable is a 3-tuple (N, X, T L ), where N = a symbolic variable defined of X X = universe of discourse T L = {A, A 1 A n } a set of labels (terms linguistics). Each label corresponds to a fuzzy set represented by a trapezoidal membership function. Let be an imprecise value of type. We write and we use the following general form to represent it: ( ) ( ) Where ( ) is the membership degree of to the fuzzy set For sink of simplicity, we can write ( ) ( ) Application Representation of Degree of Annoyance: To represent degree of annoyance measure, we propose to use the following linguistic variable: ( is the name of the variable. the universe of discourse. ) where a set of labels represented by the trapezoidal fuzzy sets shown by figure 3 below. In the interests of simplifying notations, we will represent these terms respectively by is the name of the variable. the universe of discourse. a set of labels represented by the trapezoidal fuzzy sets as it is shown by figure 4 below. In the interests of simplifying notations, we will represent these terms respectively by Figure 4. Fuzzy representation of population density measure Let be an imprecise value of type. We use the following general form to represent it: ( ) ( ) Representation of Impact of Annoyance: To represent the impact of annoyance, which is the output of the combination process of impact of annoyance and population density, we use the following linguistic variable: ( ) Where : is the name of the variable. : The universe of discourse. For sink of simplify, we will represent these terms respectively by. Let be an imprecise value of type. We use the following form to represent it: ( ) ( ( ) ) 3.3 Fuzzy fusion The model of fuzzy fusion that we propose allows the combination of qualitative and quantitative measures. This model is based on the fuzzy inference. Based on the works of (Siler, 25; Zadah, 1975; Feng, 21) we will examine its different components Fuzzy rule of inference: Fuzzy inference can describe the relationship between the causative and resultant variables in terms of a collection of IF-THEN fuzzy rules. Each fuzzy IF- THEN rule can take the following general form: Figure 3. Fuzzy representation of degree of annoyance measure Let be an imprecise value of type. We use the following form to represent it: ( ) ( ( ) ( ) ) Representation of Population Density: To represent population density measure, we use the following linguistic variable: ( ) Where This rule corresponds to the general statement and can be presented in the form of predicates (using the notation presented in section 3.2) as follow: Where, and are three linguistic variables, for which it is associated respectively the fuzzy sets, and. Now, an instance of the previous rule is given by: 136
5 International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, Volume XL-2/W1, 213 8th International Symposium on Spatial Data Quality, 3 May - 1 June 213, Hong Kong And the objective is to compute the conclusion ( ) given the general statement and ( ) ( ). Solution: ( ) ( ) (( ) )) Computing fuzzy conclusion: To find the conclusion ( ) we use the extended modus ponens rule, as defined by Zadeh (Zadeh 1975). If ( ) ( ) And ( ) Then ( ) ( ( ) ( )) ( ) Where is a fuzzy relation. Note that there are different approaches to calculate the relation (Siler 25). In our case, this relation has been established by the experts in the form of table (see tables 3 below for example). : is a max min or product-sum composition. : is a min or product operator. Now membership degree function of ( ) is given by: ( ) ( ( ) ( )) (( ) )) ( ) ( ) (( ) )) Where ( ) and (( ) )) are raw vector and matrices of compatibles dimensions Application: The derived measure impact of annoyance is evaluated by combining population density and degree of annoyance. We can write: The relation R corresponding to this rule is given by Tables 3 below. ( ) is given by the table below Table 4. Table used to calculate ( ) ( ) population density (DoP) degree of annoyance (DoA) Table 3. Table used to represent the fuzzy relation R A simply representation of the relation R can be a binary relation given by the matrix bellow. [ ] Now given ( ) ( ) ( ) ( ) How to calculate the combination of these two variables, let be ( ) Thus using max-min composition we obtain: ( ) [ ] So in fuzzy set form ( ) ( ) 3.4 Implementation of fuzzy fusion in data warehouse We have implemented fuzzy fusion as stocked procedure. This function is used to calculate the derived measure impact of annoyance. Procedure Fuzzy-Fusion (x, y, Op 1, Op 2 ) Input: x, y are two values of imprecise nature // for instance: population density and typical individual annoyance. Op 1 : the premises combination. Op 2 : the composition operator. Begin Construct the implication vector Read the relation matrix Do the composition Return the result // annoyance impact End This procedure has been integrated within the data server (see figure 5 below). 137
6 International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, Volume XL-2/W1, 213 8th International Symposium on Spatial Data Quality, 3 May - 1 June 213, Hong Kong 4 SPATIAL DECISION SUPPORT SYSTEM OF ANNOYANCE ANALYSIS Our objective is to assess the feasibility, the effectiveness, and the expressivity of our model within an off-the-shelf DBMS. Most modern DBMSs allow the extensibility of their typing system and/or functions. Our choice was Microsoft SQL/Server. We show that using user defined aggregates allows us developing our model. It can be used in any SQL query. The only limitation is that those functions cannot be used in the SSAS OLAP tool for interactive analysis of the multidimensional model. The global architecture of this system consists of three main components: 1. The database management system (DBMS) for managing the fact and dimensions tables of annoyance 2. OLAP server for the multidimensional analysis of annoyance, that consists of multidimensional data cubes constructing and exploitation; 3. User interface allows visualizing the annoyance aggregation data using tabular or cartographic representation. approach to the prediction of annoyance, so that it helps predict the best place and time for new building site. REFERENCES Amanzougarene, F., Chachoua, M., Zeitouni, K., 212. Qualitative representation of building sites annoyance. In Proceedings of the 212 ACM workshop on City data management workshop, pp Burdick, D., Deshpande, P. M., Jayram, T. S., Ramakrishnan, R., Vaithyanathan, S., 27. OLAP over uncertain and imprecise data. The VLDB Journal, 16(1), pp Delgado, M., Molina, C., Ariza, L. R., Sánchez, D., Miranda M. A. V., 27. F-Cube Factory: a fuzzy OLAP system for supporting imprecision. International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems (IJUFKS), pp Feng, F., Li, Y., Li, C., Han, B., 21. Soft Set Based Approximate Reasoning: A Quantitative Logic Approach. Springer Berlin Heidelberg, pp Guski, R. & Felscher-suhr U The concept of noise annoyance: how international experts see it. Journal of Sound and Vibration 223(4): Inmon, W. H., 25. Building the Data Warehouse, 4th Edition. John Wiley & Sons, Inc., New York, NY, USA. Kimball, R., Ross, M., 22. The Data Warehouse Toolkit: The Complete Guide to Dimensional Modeling, 2nd. John Wiley & Sons, Inc., New York, NY, USA. Figure 5. System architecture 5 CONCLUSION AND FUTURE WORK The main objective of this work is the extension of conventional data warehouses to allow the integration and processing of both qualitative and quantitative measures. Based on computing with words methodology, we had, in a previous work (Amanzougarene, 212), introduced qualitative measures and aggregates as an extension of multidimensional data model of a data warehouse. Using these measures and aggregates, OLAP queries allow the decision maker to manipulate data in a qualitative fashion using linguistic terms. This paper has proposed an extension to deal with fuzzy data model which is in intermediary solution between conventional numeric measures and crisp qualitative. To illustrate our proposal, we have considered the case of urban building sites annoyance. We have proposed an original approach which allows managing the annoyance and its inherent imprecision, as in commons sense reasoning, by using linguistic expressions. In our future work, we will extend our approach of evaluation in order to include spatial and temporal extent of annoyance as measures in the multidimensional data model of data warehouse. We will also, define aggregation operations allowing data processing of these extents (e.g. fusion of annoyance influence areas, and concatenation of exposure time interval). Indeed, that will improve the decisions of managers of public spaces concerning urban building sites planning. So far, we have focused on the measurement and aggregation in a qualitative fashion for the annoyance from previous inputs. We also intend to extend our Laurent, A., 21. Generating Fuzzy Summaries from Fuzzy Multidimensional Databases. In Proceedings of the 4th International Conference on Advances in Intelligent Data Analysis, pp Malinowski, E., Zimányi, E., 28. Advanced Data Warehouse Design: From Conventional to Spatial and Temporal Applications. Springer-Verlag. Molina, C., Rodríguez-Ariza, L., Sánchez, D., Vila, M. A., 26. A new fuzzy multidimensional model. IEEE Transactions On Fuzzy Systems, 14(6), pp Moser, G., Robin, M., 26. Environmental annoyances: an urban-specific threat to quality of life? European review of applied psychology, 56(1), pp Nordin, S., Lidén, E., 26. Environmental odor annoyance from air pollution from steel industry and bio-fuel processing. Journal of Environmental Psychology, 26(2), pp Robin, M., Matheau-Police, A., Couty, C., 27. Development of a scale of perceived environmental annoyances in urban settings. Journal of Environmental Psychology, 27(1), pp Siler, W., Buckeley, J., 25. Fuzzy Expert Systems and Fuzzy Reasoning. John Wiley & Sons, Inc., Hoboken, NJ, USA. Yager, R. R., 27. Aggregation of ordinal information. Fuzzy Optimization and Decision Making, 6(3), pp Zadeh, L. A., The concept of a linguistic variable and its application to approximate reasoning. Information Sciences, 8(3), pp
Fuzzy Spatial Data Warehouse: A Multidimensional Model
4 Fuzzy Spatial Data Warehouse: A Multidimensional Model Pérez David, Somodevilla María J. and Pineda Ivo H. Facultad de Ciencias de la Computación, BUAP, Mexico 1. Introduction A data warehouse is defined
PartJoin: An Efficient Storage and Query Execution for Data Warehouses
PartJoin: An Efficient Storage and Query Execution for Data Warehouses Ladjel Bellatreche 1, Michel Schneider 2, Mukesh Mohania 3, and Bharat Bhargava 4 1 IMERIR, Perpignan, FRANCE [email protected] 2
A Methodology for the Design of a Fuzzy Data Warehouse
A Methodology for the Design of a Fuzzy Data Warehouse Lior Sapir, Armin Shmilovici, Member IEEE, Lior Rokach Abstract A data warehouse is a special database used for storing business oriented information
A Design and implementation of a data warehouse for research administration universities
A Design and implementation of a data warehouse for research administration universities André Flory 1, Pierre Soupirot 2, and Anne Tchounikine 3 1 CRI : Centre de Ressources Informatiques INSA de Lyon
BUILDING OLAP TOOLS OVER LARGE DATABASES
BUILDING OLAP TOOLS OVER LARGE DATABASES Rui Oliveira, Jorge Bernardino ISEC Instituto Superior de Engenharia de Coimbra, Polytechnic Institute of Coimbra Quinta da Nora, Rua Pedro Nunes, P-3030-199 Coimbra,
CONTINUOUS DATA WAREHOUSE: CONCEPTS, CHALLENGES AND POTENTIALS
Geoinformatics 2004 Proc. 12th Int. Conf. on Geoinformatics Geospatial Information Research: Bridging the Pacific and Atlantic University of Gävle, Sweden, 7-9 June 2004 CONTINUOUS DATA WAREHOUSE: CONCEPTS,
A FUZZY LOGIC APPROACH FOR SALES FORECASTING
A FUZZY LOGIC APPROACH FOR SALES FORECASTING ABSTRACT Sales forecasting proved to be very important in marketing where managers need to learn from historical data. Many methods have become available for
Tracking System for GPS Devices and Mining of Spatial Data
Tracking System for GPS Devices and Mining of Spatial Data AIDA ALISPAHIC, DZENANA DONKO Department for Computer Science and Informatics Faculty of Electrical Engineering, University of Sarajevo Zmaja
Linguistic Preference Modeling: Foundation Models and New Trends. Extended Abstract
Linguistic Preference Modeling: Foundation Models and New Trends F. Herrera, E. Herrera-Viedma Dept. of Computer Science and Artificial Intelligence University of Granada, 18071 - Granada, Spain e-mail:
Volume 2, Issue 12, December 2014 International Journal of Advance Research in Computer Science and Management Studies
Volume 2, Issue 12, December 2014 International Journal of Advance Research in Computer Science and Management Studies Research Article / Survey Paper / Case Study Available online at: www.ijarcsms.com
Dimensional Modeling for Data Warehouse
Modeling for Data Warehouse Umashanker Sharma, Anjana Gosain GGS, Indraprastha University, Delhi Abstract Many surveys indicate that a significant percentage of DWs fail to meet business objectives or
LEARNING SOLUTIONS website milner.com/learning email [email protected] phone 800 875 5042
Course 20467A: Designing Business Intelligence Solutions with Microsoft SQL Server 2012 Length: 5 Days Published: December 21, 2012 Language(s): English Audience(s): IT Professionals Overview Level: 300
Alejandro Vaisman Esteban Zimanyi. Data. Warehouse. Systems. Design and Implementation. ^ Springer
Alejandro Vaisman Esteban Zimanyi Data Warehouse Systems Design and Implementation ^ Springer Contents Part I Fundamental Concepts 1 Introduction 3 1.1 A Historical Overview of Data Warehousing 4 1.2 Spatial
University of Gaziantep, Department of Business Administration
University of Gaziantep, Department of Business Administration The extensive use of information technology enables organizations to collect huge amounts of data about almost every aspect of their businesses.
Business Intelligence in E-Learning
Business Intelligence in E-Learning (Case Study of Iran University of Science and Technology) Mohammad Hassan Falakmasir 1, Jafar Habibi 2, Shahrouz Moaven 1, Hassan Abolhassani 2 Department of Computer
MS 20467: Designing Business Intelligence Solutions with Microsoft SQL Server 2012
MS 20467: Designing Business Intelligence Solutions with Microsoft SQL Server 2012 Description: This five-day instructor-led course teaches students how to design and implement a BI infrastructure. The
DATA WAREHOUSING AND OLAP TECHNOLOGY
DATA WAREHOUSING AND OLAP TECHNOLOGY Manya Sethi MCA Final Year Amity University, Uttar Pradesh Under Guidance of Ms. Shruti Nagpal Abstract DATA WAREHOUSING and Online Analytical Processing (OLAP) are
On Development of Fuzzy Relational Database Applications
On Development of Fuzzy Relational Database Applications Srdjan Skrbic Faculty of Science Trg Dositeja Obradovica 3 21000 Novi Sad Serbia [email protected] Aleksandar Takači Faculty of Technology Bulevar
BUILDING A WEB-ENABLED DATA WAREHOUSE FOR DECISION SUPPORT IN CONSTRUCTION EQUIPMENT MANAGEMENT
BUILDING A WEB-ENABLED DATA WAREHOUSE FOR DECISION SUPPORT IN CONSTRUCTION EQUIPMENT MANAGEMENT Hongqin Fan ([email protected]) Graduate Research Assistant, University of Alberta, AB, T6G 2E1, Canada Hyoungkwan
Data Warehousing Systems: Foundations and Architectures
Data Warehousing Systems: Foundations and Architectures Il-Yeol Song Drexel University, http://www.ischool.drexel.edu/faculty/song/ SYNONYMS None DEFINITION A data warehouse (DW) is an integrated repository
A Brief Tutorial on Database Queries, Data Mining, and OLAP
A Brief Tutorial on Database Queries, Data Mining, and OLAP Lutz Hamel Department of Computer Science and Statistics University of Rhode Island Tyler Hall Kingston, RI 02881 Tel: (401) 480-9499 Fax: (401)
Dimensional Modeling and E-R Modeling In. Joseph M. Firestone, Ph.D. White Paper No. Eight. June 22, 1998
1 of 9 5/24/02 3:47 PM Dimensional Modeling and E-R Modeling In The Data Warehouse By Joseph M. Firestone, Ph.D. White Paper No. Eight June 22, 1998 Introduction Dimensional Modeling (DM) is a favorite
Designing Business Intelligence Solutions with Microsoft SQL Server 2012 Course 20467A; 5 Days
Lincoln Land Community College Capital City Training Center 130 West Mason Springfield, IL 62702 217-782-7436 www.llcc.edu/cctc Designing Business Intelligence Solutions with Microsoft SQL Server 2012
DIMENSION HIERARCHIES UPDATES IN DATA WAREHOUSES A User-driven Approach
DIMENSION HIERARCHIES UPDATES IN DATA WAREHOUSES A User-driven Approach Cécile Favre, Fadila Bentayeb, Omar Boussaid ERIC Laboratory, University of Lyon, 5 av. Pierre Mendès-France, 69676 Bron Cedex, France
RESEARCH ON THE FRAMEWORK OF SPATIO-TEMPORAL DATA WAREHOUSE
RESEARCH ON THE FRAMEWORK OF SPATIO-TEMPORAL DATA WAREHOUSE WANG Jizhou, LI Chengming Institute of GIS, Chinese Academy of Surveying and Mapping No.16, Road Beitaiping, District Haidian, Beijing, P.R.China,
CUBE INDEXING IMPLEMENTATION USING INTEGRATION OF SIDERA AND BERKELEY DB
CUBE INDEXING IMPLEMENTATION USING INTEGRATION OF SIDERA AND BERKELEY DB Badal K. Kothari 1, Prof. Ashok R. Patel 2 1 Research Scholar, Mewar University, Chittorgadh, Rajasthan, India 2 Department of Computer
Copyright 2007 Ramez Elmasri and Shamkant B. Navathe. Slide 29-1
Slide 29-1 Chapter 29 Overview of Data Warehousing and OLAP Chapter 29 Outline Purpose of Data Warehousing Introduction, Definitions, and Terminology Comparison with Traditional Databases Characteristics
An Introduction to Data Warehousing. An organization manages information in two dominant forms: operational systems of
An Introduction to Data Warehousing An organization manages information in two dominant forms: operational systems of record and data warehouses. Operational systems are designed to support online transaction
Fluency With Information Technology CSE100/IMT100
Fluency With Information Technology CSE100/IMT100 ),7 Larry Snyder & Mel Oyler, Instructors Ariel Kemp, Isaac Kunen, Gerome Miklau & Sean Squires, Teaching Assistants University of Washington, Autumn 1999
Efficient Integration of Data Mining Techniques in Database Management Systems
Efficient Integration of Data Mining Techniques in Database Management Systems Fadila Bentayeb Jérôme Darmont Cédric Udréa ERIC, University of Lyon 2 5 avenue Pierre Mendès-France 69676 Bron Cedex France
OLAP and Data Mining. Data Warehousing and End-User Access Tools. Introducing OLAP. Introducing OLAP
Data Warehousing and End-User Access Tools OLAP and Data Mining Accompanying growth in data warehouses is increasing demands for more powerful access tools providing advanced analytical capabilities. Key
Course 6234A: Implementing and Maintaining Microsoft SQL Server 2008 Analysis Services
Course 6234A: Implementing and Maintaining Microsoft SQL Server 2008 Analysis Services Length: Delivery Method: 3 Days Instructor-led (classroom) About this Course Elements of this syllabus are subject
Concepts of Database Management Seventh Edition. Chapter 9 Database Management Approaches
Concepts of Database Management Seventh Edition Chapter 9 Database Management Approaches Objectives Describe distributed database management systems (DDBMSs) Discuss client/server systems Examine the ways
CHAPTER-24 Mining Spatial Databases
CHAPTER-24 Mining Spatial Databases 24.1 Introduction 24.2 Spatial Data Cube Construction and Spatial OLAP 24.3 Spatial Association Analysis 24.4 Spatial Clustering Methods 24.5 Spatial Classification
Implementing Data Models and Reports with Microsoft SQL Server 20466C; 5 Days
Lincoln Land Community College Capital City Training Center 130 West Mason Springfield, IL 62702 217-782-7436 www.llcc.edu/cctc Implementing Data Models and Reports with Microsoft SQL Server 20466C; 5
Available online at www.sciencedirect.com Available online at www.sciencedirect.com. Advanced in Control Engineering and Information Science
Available online at www.sciencedirect.com Available online at www.sciencedirect.com Procedia Procedia Engineering Engineering 00 (2011) 15 (2011) 000 000 1822 1826 Procedia Engineering www.elsevier.com/locate/procedia
Tutorials for Project on Building a Business Analytic Model Using Data Mining Tool and Data Warehouse and OLAP Cubes IST 734
Cleveland State University Tutorials for Project on Building a Business Analytic Model Using Data Mining Tool and Data Warehouse and OLAP Cubes IST 734 SS Chung 14 Build a Data Mining Model using Data
Data Integration and ETL Process
Data Integration and ETL Process Krzysztof Dembczyński Intelligent Decision Support Systems Laboratory (IDSS) Poznań University of Technology, Poland Software Development Technologies Master studies, second
Data warehouses. Data Mining. Abraham Otero. Data Mining. Agenda
Data warehouses 1/36 Agenda Why do I need a data warehouse? ETL systems Real-Time Data Warehousing Open problems 2/36 1 Why do I need a data warehouse? Why do I need a data warehouse? Maybe you do not
Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization
Turban, Aronson, and Liang Decision Support Systems and Intelligent Systems, Seventh Edition Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization
Data Mining and Database Systems: Where is the Intersection?
Data Mining and Database Systems: Where is the Intersection? Surajit Chaudhuri Microsoft Research Email: [email protected] 1 Introduction The promise of decision support systems is to exploit enterprise
SQL Server 2012 Business Intelligence Boot Camp
SQL Server 2012 Business Intelligence Boot Camp Length: 5 Days Technology: Microsoft SQL Server 2012 Delivery Method: Instructor-led (classroom) About this Course Data warehousing is a solution organizations
Indexing Techniques for Data Warehouses Queries. Abstract
Indexing Techniques for Data Warehouses Queries Sirirut Vanichayobon Le Gruenwald The University of Oklahoma School of Computer Science Norman, OK, 739 [email protected] [email protected] Abstract Recently,
IMPROVING THE QUALITY OF THE DECISION MAKING BY USING BUSINESS INTELLIGENCE SOLUTIONS
IMPROVING THE QUALITY OF THE DECISION MAKING BY USING BUSINESS INTELLIGENCE SOLUTIONS Maria Dan Ştefan Academy of Economic Studies, Faculty of Accounting and Management Information Systems, Uverturii Street,
The Integration of Agent Technology and Data Warehouse into Executive Banking Information System (EBIS) Architecture
The Integration of Agent Technology and Data Warehouse into Executive Banking System (EBIS) Architecture Ismail Faculty of Technology and Communication (FTMK) Technical University of Malaysia Melaka (UTeM),75450
Introduction. Introduction. Spatial Data Mining: Definition WHAT S THE DIFFERENCE?
Introduction Spatial Data Mining: Progress and Challenges Survey Paper Krzysztof Koperski, Junas Adhikary, and Jiawei Han (1996) Review by Brad Danielson CMPUT 695 01/11/2007 Authors objectives: Describe
CHAPTER 5: BUSINESS ANALYTICS
Chapter 5: Business Analytics CHAPTER 5: BUSINESS ANALYTICS Objectives The objectives are: Describe Business Analytics. Explain the terminology associated with Business Analytics. Describe the data warehouse
MOBILITY DATA MODELING AND REPRESENTATION
PART I MOBILITY DATA MODELING AND REPRESENTATION 1 Trajectories and Their Representations Stefano Spaccapietra, Christine Parent, and Laura Spinsanti 1.1 Introduction For a long time, applications have
Design of a Multi Dimensional Database for the Archimed DataWarehouse
169 Design of a Multi Dimensional Database for the Archimed DataWarehouse Claudine Bréant, Gérald Thurler, François Borst, Antoine Geissbuhler Service of Medical Informatics University Hospital of Geneva,
a Geographic Data Warehouse for Water Resources Management
Towards a Geographic Data Warehouse for Water Resources Management Nazih Selmoune, Nadia Abdat, Zaia Alimazighi LSI - USTHB 2 Introduction A large part of the data in all decisional systems is geo-spatial
The Design and the Implementation of an HEALTH CARE STATISTICS DATA WAREHOUSE Dr. Sreèko Natek, assistant professor, Nova Vizija, srecko@vizija.
The Design and the Implementation of an HEALTH CARE STATISTICS DATA WAREHOUSE Dr. Sreèko Natek, assistant professor, Nova Vizija, [email protected] ABSTRACT Health Care Statistics on a state level is a
CHAPTER 4: BUSINESS ANALYTICS
Chapter 4: Business Analytics CHAPTER 4: BUSINESS ANALYTICS Objectives Introduction The objectives are: Describe Business Analytics Explain the terminology associated with Business Analytics Describe the
Microsoft 20466 - Implementing Data Models and Reports with Microsoft SQL Server
1800 ULEARN (853 276) www.ddls.com.au Microsoft 20466 - Implementing Data Models and Reports with Microsoft SQL Server Length 5 days Price $4070.00 (inc GST) Version C Overview The focus of this five-day
Data Warehousing and Data Mining in Business Applications
133 Data Warehousing and Data Mining in Business Applications Eesha Goel CSE Deptt. GZS-PTU Campus, Bathinda. Abstract Information technology is now required in all aspect of our lives that helps in business
Methodology Framework for Analysis and Design of Business Intelligence Systems
Applied Mathematical Sciences, Vol. 7, 2013, no. 31, 1523-1528 HIKARI Ltd, www.m-hikari.com Methodology Framework for Analysis and Design of Business Intelligence Systems Martin Závodný Department of Information
Metadata Management for Data Warehouse Projects
Metadata Management for Data Warehouse Projects Stefano Cazzella Datamat S.p.A. [email protected] Abstract Metadata management has been identified as one of the major critical success factor
Course 803401 DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization
Oman College of Management and Technology Course 803401 DSS Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization CS/MIS Department Information Sharing
Meta-data and Data Mart solutions for better understanding for data and information in E-government Monitoring
www.ijcsi.org 78 Meta-data and Data Mart solutions for better understanding for data and information in E-government Monitoring Mohammed Mohammed 1 Mohammed Anad 2 Anwar Mzher 3 Ahmed Hasson 4 2 faculty
Sizing Logical Data in a Data Warehouse A Consistent and Auditable Approach
2006 ISMA Conference 1 Sizing Logical Data in a Data Warehouse A Consistent and Auditable Approach Priya Lobo CFPS Satyam Computer Services Ltd. 69, Railway Parallel Road, Kumarapark West, Bangalore 560020,
Implementing Data Models and Reports with Microsoft SQL Server 2012 MOC 10778
Implementing Data Models and Reports with Microsoft SQL Server 2012 MOC 10778 Course Outline Module 1: Introduction to Business Intelligence and Data Modeling This module provides an introduction to Business
An Intelligent Approach for Integrity of Heterogeneous and Distributed Databases Systems based on Mobile Agents
An Intelligent Approach for Integrity of Heterogeneous and Distributed Databases Systems based on Mobile Agents M. Anber and O. Badawy Department of Computer Engineering, Arab Academy for Science and Technology
What is Visualization? Information Visualization An Overview. Information Visualization. Definitions
What is Visualization? Information Visualization An Overview Jonathan I. Maletic, Ph.D. Computer Science Kent State University Visualize/Visualization: To form a mental image or vision of [some
Sales and Operations Planning in Company Supply Chain Based on Heuristics and Data Warehousing Technology
Sales and Operations Planning in Company Supply Chain Based on Heuristics and Data Warehousing Technology Jun-Zhong Wang 1 and Ping-Yu Hsu 2 1 Department of Business Administration, National Central University,
Implementing Data Models and Reports with Microsoft SQL Server
Course 20466C: Implementing Data Models and Reports with Microsoft SQL Server Course Details Course Outline Module 1: Introduction to Business Intelligence and Data Modeling As a SQL Server database professional,
OLAP Theory-English version
OLAP Theory-English version On-Line Analytical processing (Business Intelligence) [Ing.J.Skorkovský,CSc.] Department of corporate economy Agenda The Market Why OLAP (On-Line-Analytic-Processing Introduction
Oracle8i Spatial: Experiences with Extensible Databases
Oracle8i Spatial: Experiences with Extensible Databases Siva Ravada and Jayant Sharma Spatial Products Division Oracle Corporation One Oracle Drive Nashua NH-03062 {sravada,jsharma}@us.oracle.com 1 Introduction
1. OLAP is an acronym for a. Online Analytical Processing b. Online Analysis Process c. Online Arithmetic Processing d. Object Linking and Processing
1. OLAP is an acronym for a. Online Analytical Processing b. Online Analysis Process c. Online Arithmetic Processing d. Object Linking and Processing 2. What is a Data warehouse a. A database application
IMPROVING DATA INTEGRATION FOR DATA WAREHOUSE: A DATA MINING APPROACH
IMPROVING DATA INTEGRATION FOR DATA WAREHOUSE: A DATA MINING APPROACH Kalinka Mihaylova Kaloyanova St. Kliment Ohridski University of Sofia, Faculty of Mathematics and Informatics Sofia 1164, Bulgaria
Data Mining - Introduction
Data Mining - Introduction Peter Brezany Institut für Scientific Computing Universität Wien Tel. 4277 39425 Sprechstunde: Di, 13.00-14.00 Outline Business Intelligence and its components Knowledge discovery
SPATIAL DATA CLASSIFICATION AND DATA MINING
, pp.-40-44. Available online at http://www. bioinfo. in/contents. php?id=42 SPATIAL DATA CLASSIFICATION AND DATA MINING RATHI J.B. * AND PATIL A.D. Department of Computer Science & Engineering, Jawaharlal
Interactive Information Visualization of Trend Information
Interactive Information Visualization of Trend Information Yasufumi Takama Takashi Yamada Tokyo Metropolitan University 6-6 Asahigaoka, Hino, Tokyo 191-0065, Japan [email protected] Abstract This paper
Designing Business Intelligence Solutions with Microsoft SQL Server 2012
CÔNG TY CỔ PHẦN TRƯỜNG CNTT TÂN ĐỨC TAN DUC INFORMATION TECHNOLOGY SCHOOL JSC LEARN MORE WITH LESS! Course 20467B: Designing Business Intelligence Solutions with Microsoft SQL Server 2012 Length: 5 Days
The Benefits of Data Modeling in Business Intelligence
WHITE PAPER: THE BENEFITS OF DATA MODELING IN BUSINESS INTELLIGENCE The Benefits of Data Modeling in Business Intelligence DECEMBER 2008 Table of Contents Executive Summary 1 SECTION 1 2 Introduction 2
Alexander Nikov. 5. Database Systems and Managing Data Resources. Learning Objectives. RR Donnelley Tries to Master Its Data
INFO 1500 Introduction to IT Fundamentals 5. Database Systems and Managing Data Resources Learning Objectives 1. Describe how the problems of managing data resources in a traditional file environment are
Introduction to Data Mining
Introduction to Data Mining 1 Why Data Mining? Explosive Growth of Data Data collection and data availability Automated data collection tools, Internet, smartphones, Major sources of abundant data Business:
Data Mining: Exploring Data. Lecture Notes for Chapter 3. Introduction to Data Mining
Data Mining: Exploring Data Lecture Notes for Chapter 3 Introduction to Data Mining by Tan, Steinbach, Kumar Tan,Steinbach, Kumar Introduction to Data Mining 8/05/2005 1 What is data exploration? A preliminary
Data Mining: Concepts and Techniques. Jiawei Han. Micheline Kamber. Simon Fräser University К MORGAN KAUFMANN PUBLISHERS. AN IMPRINT OF Elsevier
Data Mining: Concepts and Techniques Jiawei Han Micheline Kamber Simon Fräser University К MORGAN KAUFMANN PUBLISHERS AN IMPRINT OF Elsevier Contents Foreword Preface xix vii Chapter I Introduction I I.
OLAP, Knowledge Discovery from Database, Social Security Fund, Oracle Warehouse Builder, Oracle Discoverer.
ABSTRACT Mohamed Salah GOUIDER 1, Amine FARHAT 2 BESTMOD Laboratory Institut Supérieur de Gestion 41, rue de la liberté, cite Bouchoucha Bardo, 2000, Tunis, TUNISIA [email protected] 1, [email protected]
Lection 3-4 WAREHOUSING
Lection 3-4 DATA WAREHOUSING Learning Objectives Understand d the basic definitions iti and concepts of data warehouses Understand data warehousing architectures Describe the processes used in developing
DATABASE MANAGEMENT SYSTEM
REVIEW ARTICLE DATABASE MANAGEMENT SYSTEM Sweta Singh Assistant Professor, Faculty of Management Studies, BHU, Varanasi, India E-mail: [email protected] ABSTRACT Today, more than at any previous
Business Intelligence. A Presentation of the Current Lead Solutions and a Comparative Analysis of the Main Providers
60 Business Intelligence. A Presentation of the Current Lead Solutions and a Comparative Analysis of the Main Providers Business Intelligence. A Presentation of the Current Lead Solutions and a Comparative
2 Associating Facts with Time
TEMPORAL DATABASES Richard Thomas Snodgrass A temporal database (see Temporal Database) contains time-varying data. Time is an important aspect of all real-world phenomena. Events occur at specific points
BUSINESS INTELLIGENCE AS SUPPORT TO KNOWLEDGE MANAGEMENT
ISSN 1804-0519 (Print), ISSN 1804-0527 (Online) www.academicpublishingplatforms.com BUSINESS INTELLIGENCE AS SUPPORT TO KNOWLEDGE MANAGEMENT JELICA TRNINIĆ, JOVICA ĐURKOVIĆ, LAZAR RAKOVIĆ Faculty of Economics
