INFORMATION FILTERS SUPPLYING DATA WAREHOUSES WITH BENCHMARKING INFORMATION 1 Witold Abramowicz, 1. 2. 3. 4. 5. 6. 7. 8.



Similar documents
Data Mining: Concepts and Techniques. Jiawei Han. Micheline Kamber. Simon Fräser University К MORGAN KAUFMANN PUBLISHERS. AN IMPRINT OF Elsevier

Practical Applications of DATA MINING. Sang C Suh Texas A&M University Commerce JONES & BARTLETT LEARNING

Foundations of Business Intelligence: Databases and Information Management

Federico Rajola. Customer Relationship. Management in the. Financial Industry. Organizational Processes and. Technology Innovation.

Fluency With Information Technology CSE100/IMT100

POLAR IT SERVICES. Business Intelligence Project Methodology

The University of Jordan

Course DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

Chapter 5. Warehousing, Data Acquisition, Data. Visualization

Schneps, Leila; Colmez, Coralie. Math on Trial : How Numbers Get Used and Abused in the Courtroom. New York, NY, USA: Basic Books, p i.

SQL Server 2012 Business Intelligence Boot Camp

An Overview of Knowledge Discovery Database and Data mining Techniques

Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

Foundations of Business Intelligence: Databases and Information Management

Data Mining. Concepts, Models, Methods, and Algorithms. 2nd Edition

Contents RELATIONAL DATABASES

Detection. Perspective. Network Anomaly. Bhattacharyya. Jugal. A Machine Learning »C) Dhruba Kumar. Kumar KaKta. CRC Press J Taylor & Francis Croup

Database Marketing, Business Intelligence and Knowledge Discovery

Course MIS. Foundations of Business Intelligence

Chapter 6 FOUNDATIONS OF BUSINESS INTELLIGENCE: DATABASES AND INFORMATION MANAGEMENT Learning Objectives

Building a Data Warehouse

Oracle Database 11g: Data Warehousing Fundamentals

Mining. Practical. Data. Monte F. Hancock, Jr. Chief Scientist, Celestech, Inc. CRC Press. Taylor & Francis Group

Introduction. A. Bellaachia Page: 1

Delivering Business Intelligence With Microsoft SQL Server 2005 or 2008 HDT922 Five Days

Data Mining. Vera Goebel. Department of Informatics, University of Oslo

Index Contents Page No. Introduction . Data Mining & Knowledge Discovery

Data Warehouse Design

Foundations of Business Intelligence: Databases and Information Management

Alejandro Vaisman Esteban Zimanyi. Data. Warehouse. Systems. Design and Implementation. ^ Springer

Data Mart/Warehouse: Progress and Vision

An Overview of Database management System, Data warehousing and Data Mining

SOCIAL NETWORK DATA ANALYTICS

Chapter 20: Data Analysis

OLAP and Data Mining. Data Warehousing and End-User Access Tools. Introducing OLAP. Introducing OLAP

IT and CRM A basic CRM model Data source & gathering system Database system Data warehouse Information delivery system Information users

Introduction to Data Mining

A Review of Data Mining Techniques

Advanced Database Marketing Innovative Methodologies and Applications for Managing Customer Relationships

Chapter ML:XI. XI. Cluster Analysis

DATA MINING TECHNOLOGY. Keywords: data mining, data warehouse, knowledge discovery, OLAP, OLAM.

Contents. Dedication List of Figures List of Tables. Acknowledgments

Web Data Mining: A Case Study. Abstract. Introduction

Data Mining for Successful Healthcare Organizations

Oracle Database 11g: Administer a Data Warehouse

Cloud Computing. and Scheduling. Data-Intensive Computing. Frederic Magoules, Jie Pan, and Fei Teng SILKQH. CRC Press. Taylor & Francis Group

Introduction to Data Mining and Machine Learning Techniques. Iza Moise, Evangelos Pournaras, Dirk Helbing

72. Ontology Driven Knowledge Discovery Process: a proposal to integrate Ontology Engineering and KDD

Static Data Mining Algorithm with Progressive Approach for Mining Knowledge

Introduction to Data Mining

Principles of Data Mining by Hand&Mannila&Smyth

Dynamic Data in terms of Data Mining Streams

Data Mining as Part of Knowledge Discovery in Databases (KDD)

International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014

DATA MINING IN FINANCE

Software Development Training Camp 1 (0-3) Prerequisite : Program development skill enhancement camp, at least 48 person-hours.

CONTENTS PREFACE 1 INTRODUCTION 1 2 DATA VISUALIZATION 19

How To Use Neural Networks In Data Mining

life science data mining

Data Mining Applications in Fund Raising

Master of Science in Health Information Technology Degree Curriculum

Development and Management

FIFTH EDITION. Oracle Essentials. Rick Greenwald, Robert Stackowiak, and. Jonathan Stern O'REILLY" Tokyo. Koln Sebastopol. Cambridge Farnham.

Data Mining and Neural Networks in Stata

Table of Contents. Testimonials from the MDM Alliance Group... Introduction to MDM...

Data Mining System, Functionalities and Applications: A Radical Review

Intrusion Detection. Jeffrey J.P. Tsai. Imperial College Press. A Machine Learning Approach. Zhenwei Yu. University of Illinois, Chicago, USA

Customer and Business Analytic

SPATIAL DATA CLASSIFICATION AND DATA MINING

Graphical Web based Tool for Generating Query from Star Schema

BIG DATA COURSE 1 DATA QUALITY STRATEGIES - CUSTOMIZED TRAINING OUTLINE. Prepared by:

Inner Classification of Clusters for Online News

Data Mining. 1 Introduction 2 Data Mining methods. Alfred Holl Data Mining 1

Experiments in Web Page Classification for Semantic Web

Course 6234A: Implementing and Maintaining Microsoft SQL Server 2008 Analysis Services

Business Architecture

Course Syllabus For Operations Management. Management Information Systems

Welcome. Data Mining: Updates in Technologies. Xindong Wu. Colorado School of Mines Golden, Colorado 80401, USA

OLAP Theory-English version

Data Warehousing in the Age of Big Data

The basic data mining algorithms introduced may be enhanced in a number of ways.

Business Information Systems and Technology

A Partially Supervised Metric Multidimensional Scaling Algorithm for Textual Data Visualization

Data Warehousing and Data Mining in Business Applications

Computer-Aided Multivariate Analysis

DSS based on Data Warehouse

META DATA QUALITY CONTROL ARCHITECTURE IN DATA WAREHOUSING

3/17/2009. Knowledge Management BIKM eclassifier Integrated BIKM Tools

Data Vault and The Truth about the Enterprise Data Warehouse

Nagarjuna College Of

PartJoin: An Efficient Storage and Query Execution for Data Warehouses

Copyright 2007 Ramez Elmasri and Shamkant B. Navathe. Slide 29-1

Transcription:

Contents PREFACE FOREWORD xi xiii LIST OF CONTRIBUTORS xv Chapter 1 INFORMATION FILTERS SUPPLYING DATA WAREHOUSES WITH BENCHMARKING INFORMATION 1 Witold Abramowicz, 1 Data Warehouses 2 The HyperSDI System 4 User Profiles in the HyperSDI System 11 Building Data Warehouse Profiles 11 Techniques for Improving Profiles 18 Implementation Notes 22 25 Chapter 2 PARALLEL MINING OF ASSOCIATION RULES David Cheung, Sau Dan Lee Parallel Mining of Association Rules Pruning Techniques and The FPM Algorithm Metrics for Data Skewness and Workload Balance Partitioning of the Database Experimental Evaluation of the Partitioning Algorithms Discussions 29 29 32 33 39 48 56 62 64 65

vi Chapter 3 UNSUPERVISED FEATURE RANKING AND SELECTION Manoranjan Dash, Huan Liu, Jun Yao Basic Concepts and Possible Approaches An Entropy Measure for Continuous and Nominal Data Types Algorithm to Find Important Variables Experimental Studies Clustering Using SUD Discussion and Conclusion CONTENTS 67 67 69 72 75 76 80 82 84 Chapter 4 APPROACHES TO CONCEPT BASED EXPLORATION OF INFORMATION RESOURCES Hele-Mai Haav, Jørgen Fischer Nilsson Conceptual Taxonomies Ontology Driven Concept Retrieval Search based on formal concept analysis Conclusion Acknowledgements 89 89 91 99 104 109 109 109 Chapter 5 HYBRID METHODOLOGY OF KNOWLEDGE DISCOVERY FOR BUSINESS INFORMATION Hippe Present Status of Data Mining Experiments with Mining Regularities from Data Discussion Acknowledgements 111 111 113 118 125 126 126 Chapter 6 FUZZY LINGUISTIC SUMMARIES OF DATABASES FOR AN EFFICIENT BUSINESS DATA ANALYSIS AND DECISION SUPPORT Janusz Kacprzyk, Ronald R. Yager and Idea of Linguistic Summaries Using Fuzzy Logic with Linguistic Quantifiers On Other Validity Criteria Derivation of Linguistic Summaries via a Fuzzy Logic Based Database Querying Interface Implementation for a Sales Database at a Computer Retailer 129 129 131 134 140 147

CONTENTS vii Concluding Remarks 150 150 Chapter 7 INTEGRATING DATA SOURCES USING A STANDARDIZED GLOBAL DICTIONARY Ramon Lawrence and Ken Barker 9. Data Semantics and the Integration Problem Previous work The Integration Architecture The Global Dictionary The Relational Integration Model Special Cases of Integration Applications to the WWW Future Work and 153 154 154 156 157 160 164 169 171 171 172 Chapter 8 MAINTENANCE OF DISCOVERED ASSOCIATION RULES Sau Dan Lee, David Cheung Problem Description The FUP Algorithm for the Insertion Only Case The FUP Algorithm for the Deletions Only Case The FUP 2 Algorithm for the General Case Performance Studies Discussions Notes 173 173 176 179 183 189 194 204 208 209 209 Chapter 9 MULTIDIMENSIONAL BUSINESS PROCESS ANALYSIS WITH THE PROCESS WAREHOUSE Beate List, Josef Schiefer, A Min Tjoa, Gerald Quirchmayr Related Work Goals of the Data Warehouse Approach Data Source Basic Process Warehouse Components Representing Business Process Analysis Requirements Data Model and Analysis Capabilities Conclusion and Further Research 211 211 213 215 216 216 219 225 225

viii CONTENTS Chapter 10 AMALGAMATION OF STATISTICS AND DATA MINING TECHNIQUES: EXPLORATIONS IN CUSTOMER LIFETIME VALUE MODELING D. R. Mani, James Drew, Andrew Betz and Piew Datta Statistics and Data Mining Techniques: A Characterization Lifetime Value (LTV) Modeling Customer Data for LTV Tenure Prediction Classical Statistical Approaches to Survival Analysis Neural Networks for Survival Analysis From Data Models to Business Insight Conclusion: The Amalgamation of Statistical and Data Mining Techniques 229 229 231 232 234 235 239 244 247 249 Chapter 11 Chapter 12 ROBUST BUSINESS INTELLIGENCE SOLUTIONS Jan Mrazek Business Intelligence Architecture Data Transformation Data Modelling Integration Of Data Mining Conclusion THE ROLE OF GRANULAR INFORMATION IN KNOWLEDGE DISCOVERY IN DATABASES Witold Pedrycz Granulation of information The development of data-justifiable information granules Building associations in databases From associations to rules in databases The construction of rules in data mining Properties of rules induced by associations Detailed computations of the consistency of rules and its analysis 9. Acknowledgment 251 251 252 258 260 268 272 273 275 276 277 284 287 292 293 298 300 303 304 304

CONTENTS Chapter 13 DEALING WITH DIMENSIONS IN DATA WAREHOUSING Jaroslav Pokorny DW Modelling with Tables Dimensions Constellations Dimension Hierarchies with ISA-hierarchies 307 308 310 311 314 315 323 324 ix Chapter 14 ENHANCING THE KDD PROCESS IN THE RELATIONAL DATABASE MINING FRAMEWORK BY QUANTITATIVE EVALUATION OF ASSOCIATION RULES Giuseppe Psaila The Relational Database Mining Framework The Evaluate Rule Operator Enhancing the Knowledge Discovery Process and Future Work Notes 325 325 327 331 344 348 349 349 Chapter 15 SPEEDING UP HYPOTHESIS DEVELOPMENT Jörg A. Schlösser, Peter C. Lockemann, Matthias Gimbel Information Model The Execution Architecture of CITRUS Searching the Information Directory Documentation of the Process History Linking the Information Model with the Relational Model Generation of SQL Queries Automatic Materialization of Intermediate Results 9. Experimental Results 10. Utilizing Past Experience 1 Related Work 1 Concluding Remarks 351 352 354 356 359 361 362 364 368 369 371 372 374 374

x Chapter 16 CONTENTS SEQUENCE MINING IN DYNAMIC AND INTERACTIVE ENVIRONMENTS 377 Srinivasan Parthasarathy, Mohammed J. Zaki, Mitsunori Ogihara, Sandhya Dwarkadas Problem Formulation The SPADE Algorithm Incremental Mining Algorithm Interactive Sequence Mining Experimental Evaluation Related Work Acknowledgements 378 379 382 384 388 390 394 395 395 395 Chapter 17 INVESTIGATION OF ARTIFICIAL NEURAL NETWORKS FOR CLASSIFYING LEVELS OF FINANCIAL DISTRESS OF FIRMS: THE CASE OF AN UNBALANCED TRAINING SAMPLE Jozef Zurada, Benjamin P. Foster, Terry J. Ward Motivation and Literature Review Logit Regression, Neural Network, and Principal Component Analysis Fundamentals Research Methodology Discussion of the Results and Future Research Directions Appendix - Neural Network Toolbox 397 398 399 402 409 415 420 421 423 INDEX 425