Data Warehousing and Data Mining Introduction
|
|
|
- Randell Russell Griffin
- 10 years ago
- Views:
Transcription
1 Data Warehousing and Data Mining Introduction General introduction to DWDM Business intelligence OLTP vs. OLAP Data integration Methodological framework DW definition Acknowledgements: I am indebted to Michael Böhlen and Stefano Rizzi for providing me their slides, upon which these lecture notes are based. J. Gamper, Free University of Bolzano, DWDM 2012/13
2 The Big Picture of DWDM/1 What's important for researchers: Algorithms and Theory Systems J. Gamper, Free University of Bolzano, DWDM 2012/13 2
3 The Big Picture of DWDM/2 What's important for real world applications: Systems (and System Integration) Database Customer Algorithms J. Gamper, Free University of Bolzano, DWDM 2012/13 3
4 The Big Picture of DWDM/3 What's important for businesses: Algorithms $$$ Database Customer Systems J. Gamper, Free University of Bolzano, DWDM 2012/13 4
5 Computer Science and Decision Making An exponential increase in operational data has made computers the only tools suitable for providing data for decision-making performed by business managers. The massive use of techniques for analyzing enterprise data made information systems a key factor to achieve business goals. J. Gamper, Free University of Bolzano, DWDM 2012/13 5
6 Remarks about the DW part We learn how to design, build, and use a data warehouse. Relevance to the real world is an important guideline. Not only/mainly crisp algorithms, theorems, etc. We will look at a number of concrete and important case studies. A good way to prepare and learn the subject is to participate to lectures. J. Gamper, Free University of Bolzano, DWDM 2012/13 6
7 Content of the DW Part 1) Data warehousing: business intelligence, data integration, data warehouse, facts, dimensions, DW design 2) SQL OLAP extensions: analytical functions, crosstab, group by extensions, hierarchical cube, moving windows 3) Generalized multi-dimensional join: GMDJ, evaluation, subqueries, optimization rules, distributed evaluation 4) DW performance: pre-aggregation, lattice framework, view selection, view maintenance, bitmap indexing 5) ETL and advanced modeling: ETL process, handling changes in dimensions J. Gamper, Free University of Bolzano, DWDM 2012/13 7
8 What is Business Intelligence?/1 BI is a set of processes, tools, and technologies to transform business data into timely and accurate information to support decisional processes Data Warehousing (DW) On-Line Analytical Processing (OLAP) Data Mining (DM) and Data Visualization (VIS) Decision Analysis (what-if) Customer Relationship Management (CRM) BI systems are used by decision makers to get a comprehensive knowledge of the business and to define and support their business strategies. The goal is to enable data-based decisions aimed at gaining competitive advantage, improving operative performance, responding more quickly to changes, increasing profitability, and, in general, creating added value for the company. J. Gamper, Free University of Bolzano, DWDM 2012/13 8
9 What is Business Intelligence?/2 BI is the opposite of Artificial Intelligence (AI) AI systems make decisions for the users BI systems help users make the right decisions, based on the available data Many BI techniques have roots in AI, though. J. Gamper, Free University of Bolzano, DWDM 2012/13 9
10 The BI Pyramid decisions WHAT-IF ANALYSIS simulation models DATA MINING learning models knowledge INFORMATION EXPLORATION statistical techniques OLAP ANALYSIS data warehouse information OPERATIONAL APPLICATIONS data sources data J. Gamper, Free University of Bolzano, DWDM 2012/13 10
11 Example BI Queries Q1: On October 11, 2000, find the 5 top-selling products for each product subcategory that contributes more than 20% of the sales within its product category. Q2: As of March 15, 1995, determine shipping priority and potential gross revenue of the orders that have the 10 largest gross revenues among the orders that had not yet been shipped. Consider orders from the book market segment only. Regular database models and systems are not suitable for this type of queries. J. Gamper, Free University of Bolzano, DWDM 2012/13 11
12 BI is Crucial and Growing/1 Meta Group: DW alone = $15 Bio. in 2000 Palo Alto Management Group: BI = $113 Bio. in 2002 The Web made BI more necessary: Thus: Customers do not appear physically in the store Customers can change to other stores more easily You have to know your customers using data and BI. Web logs make is possible to analyze customer behavior in more detailed than before (what was not bought?) Combine web data with traditional customer data Wireless Internet adds further to this: Customers are always online Customer s position is known Combine position and knowledge about customer => very valuable J. Gamper, Free University of Bolzano, DWDM 2012/13 12
13 BI is Crucial and Growing/2 Gartner, 2009: Organizations will expect IT leaders in charge of BI and performance management initiatives to help transform and significantly improve their business Because of lack of information, processes, and tools, through 2012, more than 35% of the top 5,000 global companies will regularly fail to make insightful decisions about significant changes in their business and markets. By 2010, 20% of organizations will have an industryspecific analytic application delivered via software as a standard service of their business intelligence portfolio. In 2009, collaborative decision making will emerge as a new product category that combines social software with business intelligence platform capabilities. S. Chaudhuri, U. Dayal, V. Narasayya, CACM 2011: Today, it is difficult to find a successful enterprise that has not leveraged BI technology for their business. J. Gamper, Free University of Bolzano, DWDM 2012/13 13
14 BI: Key Problems 1) Complex and unusable models Many models are difficult to understand models do not focus on a single clear business purpose 2) Same data found in many different systems Example: customer data in many different systems The same concept is defined differently 3) Data is suited for operational systems Accounting, billing, etc. Do not support analysis across business functions 4) Data quality is bad Missing data, imprecise data, different use of systems 5) Data are volatile Data deleted in operational systems (6 months) Data change over time no historical information J. Gamper, Free University of Bolzano, DWDM 2012/13 14
15 BI: Solution A new analysis environment with a data warehouse at the core, where data is Integrated (logically and physically) Subject oriented (versus function oriented) Supporting management decisions (different organization) Stable (data is not deleted, several versions) Time variant (data can always be related to time) J. Gamper, Free University of Bolzano, DWDM 2012/13 15
16 Definition of a Data Warehouse/1 Barry Devlin, IBM Consultant A data warehouse is simply a single, complete, and consistent store of data obtained from a variety of sources and made available to end users in a way they can understand and use it in a business context. J. Gamper, Free University of Bolzano, DWDM 2012/13 16
17 Definition of a Data Warehouse/2 W. H. Inmon, Building the Data Warehouse A data warehouse is a - subject-oriented, - integrated, - time-varying, - non-volatile collection of data that is used primarily in - organizational decision making. J. Gamper, Free University of Bolzano, DWDM 2012/13 17
18 DW Architecture/1 Basic elements of a Data Warehouse environment J. Gamper, Free University of Bolzano, DWDM 2012/13 18
19 DW Architecture/2 Source layer EXTRACTION, TRANSFORMATION, AND LOADING: Data marts Operational data External data Loading Reporting tools Operational Data Store ETL tools Data Warehouse OLAP tools Data mining tools What-if analysis tools Data staging Reconciled layer Data warehouse layer Analysis ETL processes extract data from sources, transform and clean them, and finally load them in the ODS and in the data warehouse OPERATIONAL DATA STORE: Operational data obtained after integrating and cleansing source data. As a result, those data are integrated, consistent, appropriate, current, and detailed DATA MART: A subset or an aggregation of the data stored to a primary data warehouse. It includes a set of information pieces relevant to a specific business area, corporate department, or category of users J. Gamper, Free University of Bolzano, DWDM 2012/13 19
20 Data Integration Problem: Different interfaces Different data representations Duplicated information Inconsistent information Integrated System Goal: Collect and combine information Provide an integrated view Provide a uniform user interface Support sharing of data J. Gamper, Free University of Bolzano, DWDM 2012/13 20
21 Query-Driven Data Integration Data is integrated on demand (lazy) PROS CONS Access to most up-to-date data (all source data directly available) No duplication of data Delay in query processing Slow (or currently unavailable) information sources Complex filtering and integration Inefficient and expensive for frequent queries Competes with local processing at sources Data loss at the sources (e.g., historical data) cannot be recovered Has not caught on in industry J. Gamper, Free University of Bolzano, DWDM 2012/13 21
22 Warehouse-Driven Data Integration Data is integrated in advance (eager) Data is stored in DW for querying and analysis PROS CONS High query performance Does not interfere with local processing at sources Assumes that data warehouse update is possible during downtime of local processing Complex queries are run at the data warehouse OLTP queries are run at the source systems Duplication of data The most current source data is not available Has caught on in industry J. Gamper, Free University of Bolzano, DWDM 2012/13 22
23 OLTP versus OLAP/1 On-Line Transaction Processing (OLTP) Many small queries on a small number of tuples from many tables that need to be joined Frequent updates The system is always available for both updates and reads Smaller data volume (few historical data) Complex data model (normalized) On-Line Analytical Processing (OLAP) Fewer, but bigger queries that typically need to scan a huge amount of records and doing some aggregation Frequent reads, in-frequent updates (daily, weekly) 2-phase operation: either reading or updating Larger data volumes (collection of historical data) Simple data model (multidimensional/de-normalized) J. Gamper, Free University of Bolzano, DWDM 2012/13 23
24 OLTP versus OLAP/2 A mix of analytical queries (OLAP) with transactional routine queries (OLTP) inevitably slows down the system, and this does not meet the needs of users of both types of queries. Separate OLAP from OLTP by creating a new repository that integrates data from various sources and then makes data available for analysis and evaluation aimed at decision-making processes J. Gamper, Free University of Bolzano, DWDM 2012/13 24
25 OLTP versus OLAP/3 Existing databases and systems (OLTP) New databases and systems (OLAP) DM OLAP DM Data mining Trans. DW Global Data Warehouse Data Marts DM Visualization J. Gamper, Free University of Bolzano, DWDM 2012/13 25
26 OLTP versus OLAP/4 Function-oriented Systems (OLTP) Subject-oriented Systems (OLAP) DM DM Trans. All subjects, integrated DW Selected subjects DM J. Gamper, Free University of Bolzano, DWDM 2012/13 26
27 OLTP Example: CS Dept/1 1, 2, 3, 4 hostgrpnms hostgrpnmid hostgrpnmn hostgrpnm_reason hstgrn_crtdato hstgrn_expdato uids uidid ugid_id_uid idcat_id_uid hostgrp_id_uid uidcrt_data uidexp_dato idcats idcatid idcat ugids ugidid ugid hostgrps hostgrpid fqdn_id_hgr hstgrnm_id_hgr users userid name_id_usr home disklimit userstat_id_usr uid_id_usr hostgrp_id_usr user_crtdato user_expdato userstats userstatid userstat names nameid osname name_crtdato name_expdato fqdns fqdnid fqdn fqdn_crtdato fqdn_expdato pgecos pgecoid person_id_pge user_id_pge personsgroups personsgroupid person_id_prs sgroup_id_prs semester_id_prs pstatus pstatusid prsnstatus persons personid name firstname homeaddress home homephone person_crtdato pstatus_id_ps personwrkgroups personwrkgroupid person_id_prw wrkgroup_id_prw wrkgroups wrkgroupid wrkgroup persinfs persinfid persinf person_id_pinf persinf_crtdato persinf_expdato employees employeeid person_id_emp position_id_emp, initials empl_stdato empl_expdato uguests uguestid person_id_ugu ughost uguest_crtdato uguest_exp J. Gamper, Free University of Bolzano, DWDM 2012/13 27
28 OLTP Example: CS Dept/2 semesters semesterid semester sgroupsems sgroupsemid sgroup_id_sgs semester_id_sgs sgroups sgroupid sgroup sgroup_crtdato sgroup_expdato supervisor sgrplocs sgrplocid room_id_sgl sgr_id_sgl positioncats positioncatid positioncat emplocations emplocationid employee_id_elo room_id_elo rooms roomid roomname roomalias roomcat_id_rom roomcats roomcatid roomcat positions positionid position eng_position phonelocs phonelocid room_id_phl phonenr_id_phl employees employeeid person_id_emp position_id_emp, initials empl_stdato empl_expdato phones phoneid employee_id_pho phonenr_id_pho phonenrs phonenrid phonenr phonecat_id_phn owner_id_phn statusempls statusemplid statusempl persons personid name firstname homeaddress home homephone person_crtdato pstatus_id_ps phonecats phonecatid phonecat owners ownerid owner J. Gamper, Free University of Bolzano, DWDM 2012/13 28
29 OLTP Example: OncoNet OncoNet is a system for the management of patients undergoing a cancer therapy > 200 tables Well-suited for daily management of patients But: statistical analysis are expensive takes up to 12 hours tables are locked for that time run queries over weekend A DW approach reduced the runtime of the same queries to a few seconds (BSc-thesis of A. Heinisch) J. Gamper, Free University of Bolzano, DWDM 2012/13 29
30 Methodological/Design Framework Building a DW is a very complex task, which requires an accurate planning aimed at devising satisfactory answers to organizational and architectural questions. A large number of organizations lack experience and skills that are required to meet the challenges involved in DW projects. The reports of DW project failures state that a major cause lies in the absence of a global view of the design process: in other terms, in the absence of a design methodology. Methodologies are created by closely studying similar experiences and minimizing the risks for failure by basing new approaches on a constructive analysis of the mistakes made previously. J. Gamper, Free University of Bolzano, DWDM 2012/13 30
31 Many Ways not to Do/1 Trans. Trans. DM DM App App Trans. DM App J. Gamper, Free University of Bolzano, DWDM 2012/13 31
32 Many Ways not to Do/2 Trans. DM D- DM Trans. DW DM DM J. Gamper, Free University of Bolzano, DWDM 2012/13 32
33 Top-down Approach Analyze global business needs, plan how to develop a data warehouse, design it, and implement it as a whole This procedure is promising: it is based on a global picture of the goal to achieve, and in principle it ensures consistent, well integrated data warehouses. High-cost estimates with long-term implementations discourage company managers from embarking on these kind of projects. Analyzing and integrating all relevant sources at the same time is a very difficult task, even because it is not very likely that they are all available and stable at the same time. It is extremely difficult to forecast the specific needs of every department involved in a project, which can result in the analysis process coming to a standstill. Since no working system is going to be delivered in the short term, users cannot check for this project to be useful, so they lose trust and interest in it. J. Gamper, Free University of Bolzano, DWDM 2012/13 33
34 Bottom-up Approach DWs are incrementally built and several data marts are iteratively created. Each data mart is based on a set of facts that are linked to a specific department and that can be interesting for a user group Leads to concrete results in a short time Does not require huge investments Enables designers to investigate one area at a time Gives managers a quick feedback about the actual benefits of the system being built Keeps the interest for the project constantly high May determine a partial vision of the business domain J. Gamper, Free University of Bolzano, DWDM 2012/13 34
35 Top-down vs. Bottom-up Approach DM DM Trans. DW Top-down: 1. Design of DW 2. Design of DMs In-between : 1. Design of DW for DM1 2. Design of DM2 and integration with DW 3. Design of DM3 and integration with DW J. Gamper, Free University of Bolzano, DWDM 2012/13 35 DM Bottom-up: 1. Design of DMs 2. Integration of DMs in DW 3. Maybe no physical DW
36 Top-down vs. Bottom-up Approach DM DM Trans. DW Top-down: 1. Design of DW 2. Design of DMs In-between : 1. Design of DW for DM1 2. Design of DM2 and integration with DW 3. Design of DM3 and integration with DW J. Gamper, Free University of Bolzano, DWDM 2012/13 36 DM Bottom-up: 1. Design of DMs 2. Integration of DMs in DW 3. Maybe no physical DW
37 The Life-cycle/1 Goal setting and planning Infrastructure design set system goals, borders, and size select an approach for design and implementation estimate costs and benefits analyze risks and expectations examine the skills of the working team Design and developm. of data marts J. Gamper, Free University of Bolzano, DWDM 2012/13 37
38 The Life-cycle/2 Goal setting and planning Infrastructure design Design and developm. of data marts analyze and compare the possible architectural solutions assess the available technologies and tools create a preliminary plan of the whole system J. Gamper, Free University of Bolzano, DWDM 2012/13 38
39 The Life-cycle/3 Goal setting and planning Infrastructure design Design and developm. of data marts Every iteration causes a new data mart and new applications to be created and progressively added to the DW system J. Gamper, Free University of Bolzano, DWDM 2012/13 39
40 Data mart design phases Source analysis and integration db administrator Requirement analysis business user Conceptual design Workload and data volume Logical design designer ETL design Physical design J. Gamper, Free University of Bolzano, DWDM 2012/13 40
41 The First Data Mart is the one playing the most strategic role for the enterprise should be a backbone for the whole DW should lean on available and consistent data sources DM5 DM2 DM1 DM3 DM4 Source 3 Source 1 Source 2 J. Gamper, Free University of Bolzano, DWDM 2012/13 41
42 Summary BI is well-recognized and is a combination of a number of techniques to support decision making. DW is at the core of BI that provides a complete, consistent, subject-oriented and time-varying collection of the data; allows to separte OLTP from OLAP. Applications that use the DW include OLAP, data mining, visualization BI can provide many advantages to an organization Creates added value by transforming data into information Provides comprehensive knowledge about your business A good DW is a prerequisite for BI But, a DW is a means rather than a goal it is only a success if it is heavily used Following a clear design methodology is important. J. Gamper, Free University of Bolzano, DWDM 2012/13 42
Advanced Data Management Technologies
ADMT 2015/16 Unit 2 J. Gamper 1/44 Advanced Data Management Technologies Unit 2 Basic Concepts of BI and Data Warehousing J. Gamper Free University of Bozen-Bolzano Faculty of Computer Science IDSE Acknowledgements:
Bussiness Intelligence and Data Warehouse. Tomas Bartos CIS 764, Kansas State University
Bussiness Intelligence and Data Warehouse Schedule Bussiness Intelligence (BI) BI tools Oracle vs. Microsoft Data warehouse History Tools Oracle vs. Others Discussion Business Intelligence (BI) Products
Data Warehousing and Data Mining
Data Warehousing and Data Mining Part I: Data Warehousing Gao Cong [email protected] Slides adapted from Man Lung Yiu and Torben Bach Pedersen Course Structure Business intelligence: Extract knowledge
Data Warehousing. Overview, Terminology, and Research Issues. Joachim Hammer. Joachim Hammer
Data Warehousing Overview, Terminology, and Research Issues 1 Heterogeneous Database Integration Integration System World Wide Web Digital Libraries Scientific Databases Personal Databases Collects and
Overview. DW Source Integration, Tools, and Architecture. End User Applications (EUA) EUA Concepts. DW Front End Tools. Source Integration
DW Source Integration, Tools, and Architecture Overview DW Front End Tools Source Integration DW architecture Original slides were written by Torben Bach Pedersen Aalborg University 2007 - DWML course
Enterprise Data Warehouse (EDW) UC Berkeley Peter Cava Manager Data Warehouse Services October 5, 2006
Enterprise Data Warehouse (EDW) UC Berkeley Peter Cava Manager Data Warehouse Services October 5, 2006 What is a Data Warehouse? A data warehouse is a subject-oriented, integrated, time-varying, non-volatile
Lection 3-4 WAREHOUSING
Lection 3-4 DATA WAREHOUSING Learning Objectives Understand d the basic definitions iti and concepts of data warehouses Understand data warehousing architectures Describe the processes used in developing
MDM and Data Warehousing Complement Each Other
Master Management MDM and Warehousing Complement Each Other Greater business value from both 2011 IBM Corporation Executive Summary Master Management (MDM) and Warehousing (DW) complement each other There
An Introduction to Data Warehousing. An organization manages information in two dominant forms: operational systems of
An Introduction to Data Warehousing An organization manages information in two dominant forms: operational systems of record and data warehouses. Operational systems are designed to support online transaction
Data Warehouse: Introduction
Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of base and data mining group,
OLAP and OLTP. AMIT KUMAR BINDAL Associate Professor M M U MULLANA
OLAP and OLTP AMIT KUMAR BINDAL Associate Professor Databases Databases are developed on the IDEA that DATA is one of the critical materials of the Information Age Information, which is created by data,
Enterprise Solutions. Data Warehouse & Business Intelligence Chapter-8
Enterprise Solutions Data Warehouse & Business Intelligence Chapter-8 Learning Objectives Concepts of Data Warehouse Business Intelligence, Analytics & Big Data Tools for DWH & BI Concepts of Data Warehouse
Data Warehousing and Data Mining in Business Applications
133 Data Warehousing and Data Mining in Business Applications Eesha Goel CSE Deptt. GZS-PTU Campus, Bathinda. Abstract Information technology is now required in all aspect of our lives that helps in business
Introduction to Datawarehousing
DIPARTIMENTO DI INGEGNERIA INFORMATICA AUTOMATICA E GESTIONALE ANTONIO RUBERTI Master of Science in Engineering in Computer Science (MSE-CS) Seminars in Software and Services for the Information Society
Copyright 2007 Ramez Elmasri and Shamkant B. Navathe. Slide 29-1
Slide 29-1 Chapter 29 Overview of Data Warehousing and OLAP Chapter 29 Outline Purpose of Data Warehousing Introduction, Definitions, and Terminology Comparison with Traditional Databases Characteristics
A Survey on Data Warehouse Architecture
A Survey on Data Warehouse Architecture Rajiv Senapati 1, D.Anil Kumar 2 1 Assistant Professor, Department of IT, G.I.E.T, Gunupur, India 2 Associate Professor, Department of CSE, G.I.E.T, Gunupur, India
Data Warehousing Systems: Foundations and Architectures
Data Warehousing Systems: Foundations and Architectures Il-Yeol Song Drexel University, http://www.ischool.drexel.edu/faculty/song/ SYNONYMS None DEFINITION A data warehouse (DW) is an integrated repository
Emerging Technologies Shaping the Future of Data Warehouses & Business Intelligence
Emerging Technologies Shaping the Future of Data Warehouses & Business Intelligence Appliances and DW Architectures John O Brien President and Executive Architect Zukeran Technologies 1 TDWI 1 Agenda What
[callout: no organization can afford to deny itself the power of business intelligence ]
Publication: Telephony Author: Douglas Hackney Headline: Applied Business Intelligence [callout: no organization can afford to deny itself the power of business intelligence ] [begin copy] 1 Business Intelligence
IST722 Data Warehousing
IST722 Data Warehousing Components of the Data Warehouse Michael A. Fudge, Jr. Recall: Inmon s CIF The CIF is a reference architecture Understanding the Diagram The CIF is a reference architecture CIF
Data Warehousing and Decision Support. Torben Bach Pedersen Department of Computer Science Aalborg University
Data Warehousing and Decision Support Torben Bach Pedersen Department of Computer Science Aalborg University Talk Overview Data warehousing and decision support basics Definition Applications Multidimensional
B.Sc (Computer Science) Database Management Systems UNIT-V
1 B.Sc (Computer Science) Database Management Systems UNIT-V Business Intelligence? Business intelligence is a term used to describe a comprehensive cohesive and integrated set of tools and process used
Business Intelligence: Effective Decision Making
Business Intelligence: Effective Decision Making Bellevue College Linda Rumans IT Instructor, Business Division Bellevue College [email protected] Current Status What do I do??? How do I increase
Data Mart/Warehouse: Progress and Vision
Data Mart/Warehouse: Progress and Vision Institutional Research and Planning University Information Systems What is data warehousing? A data warehouse: is a single place that contains complete, accurate
CONCEPTUALIZING BUSINESS INTELLIGENCE ARCHITECTURE MOHAMMAD SHARIAT, Florida A&M University ROSCOE HIGHTOWER, JR., Florida A&M University
CONCEPTUALIZING BUSINESS INTELLIGENCE ARCHITECTURE MOHAMMAD SHARIAT, Florida A&M University ROSCOE HIGHTOWER, JR., Florida A&M University Given today s business environment, at times a corporate executive
DATA WAREHOUSING AND OLAP TECHNOLOGY
DATA WAREHOUSING AND OLAP TECHNOLOGY Manya Sethi MCA Final Year Amity University, Uttar Pradesh Under Guidance of Ms. Shruti Nagpal Abstract DATA WAREHOUSING and Online Analytical Processing (OLAP) are
Jagir Singh, Greeshma, P Singh University of Northern Virginia. Abstract
224 Business Intelligence Journal July DATA WAREHOUSING Ofori Boateng, PhD Professor, University of Northern Virginia BMGT531 1900- SU 2011 Business Intelligence Project Jagir Singh, Greeshma, P Singh
Turkish Journal of Engineering, Science and Technology
Turkish Journal of Engineering, Science and Technology 03 (2014) 106-110 Turkish Journal of Engineering, Science and Technology journal homepage: www.tujest.com Integrating Data Warehouse with OLAP Server
www.ijreat.org Published by: PIONEER RESEARCH & DEVELOPMENT GROUP (www.prdg.org) 28
Data Warehousing - Essential Element To Support Decision- Making Process In Industries Ashima Bhasin 1, Mr Manoj Kumar 2 1 Computer Science Engineering Department, 2 Associate Professor, CSE Abstract SGT
OLAP. Business Intelligence OLAP definition & application Multidimensional data representation
OLAP Business Intelligence OLAP definition & application Multidimensional data representation 1 Business Intelligence Accompanying the growth in data warehousing is an ever-increasing demand by users for
Data Mining for Successful Healthcare Organizations
Data Mining for Successful Healthcare Organizations For successful healthcare organizations, it is important to empower the management and staff with data warehousing-based critical thinking and knowledge
Data Warehousing. Jens Teubner, TU Dortmund [email protected]. Winter 2015/16. Jens Teubner Data Warehousing Winter 2015/16 1
Jens Teubner Data Warehousing Winter 2015/16 1 Data Warehousing Jens Teubner, TU Dortmund [email protected] Winter 2015/16 Jens Teubner Data Warehousing Winter 2015/16 13 Part II Overview
Chapter 6 Basics of Data Integration. Fundamentals of Business Analytics RN Prasad and Seema Acharya
Chapter 6 Basics of Data Integration Fundamentals of Business Analytics Learning Objectives and Learning Outcomes Learning Objectives 1. Concepts of data integration 2. Needs and advantages of using data
Master Data Management and Data Warehousing. Zahra Mansoori
Master Data Management and Data Warehousing Zahra Mansoori 1 1. Preference 2 IT landscape growth IT landscapes have grown into complex arrays of different systems, applications, and technologies over the
Part 22. Data Warehousing
Part 22 Data Warehousing The Decision Support System (DSS) Tools to assist decision-making Used at all levels in the organization Sometimes focused on a single area Sometimes focused on a single problem
Data warehouse Architectures and processes
Database and data mining group, Data warehouse Architectures and processes DATA WAREHOUSE: ARCHITECTURES AND PROCESSES - 1 Database and data mining group, Data warehouse architectures Separation between
Whitepaper. Data Warehouse/BI Testing Offering YOUR SUCCESS IS OUR FOCUS. Published on: January 2009 Author: BIBA PRACTICE
YOUR SUCCESS IS OUR FOCUS Whitepaper Published on: January 2009 Author: BIBA PRACTICE 2009 Hexaware Technologies. All rights reserved. Table of Contents 1. 2. Data Warehouse - Typical pain points 3. Hexaware
Course 803401 DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization
Oman College of Management and Technology Course 803401 DSS Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization CS/MIS Department Information Sharing
CHAPTER SIX DATA. Business Intelligence. 2011 The McGraw-Hill Companies, All Rights Reserved
CHAPTER SIX DATA Business Intelligence 2011 The McGraw-Hill Companies, All Rights Reserved 2 CHAPTER OVERVIEW SECTION 6.1 Data, Information, Databases The Business Benefits of High-Quality Information
Data Warehousing and OLAP
1 Data Warehousing and OLAP Hector Garcia-Molina Stanford University Warehousing Growing industry: $8 billion in 1998 Range from desktop to huge: Walmart: 900-CPU, 2,700 disk, 23TB Teradata system Lots
IMPROVING DATA INTEGRATION FOR DATA WAREHOUSE: A DATA MINING APPROACH
IMPROVING DATA INTEGRATION FOR DATA WAREHOUSE: A DATA MINING APPROACH Kalinka Mihaylova Kaloyanova St. Kliment Ohridski University of Sofia, Faculty of Mathematics and Informatics Sofia 1164, Bulgaria
Data Warehouse Design
Data Warehouse Design Modern Principles and Methodologies Matteo Golfarelli Stefano Rizzi Translated by Claudio Pagliarani Mc Grauu Hill New York Chicago San Francisco Lisbon London Madrid Mexico City
Understanding Data Warehousing. [by Alex Kriegel]
Understanding Data Warehousing 2008 [by Alex Kriegel] Things to Discuss Who Needs a Data Warehouse? OLTP vs. Data Warehouse Business Intelligence Industrial Landscape Which Data Warehouse: Bill Inmon vs.
<Insert Picture Here> Extending Hyperion BI with the Oracle BI Server
Extending Hyperion BI with the Oracle BI Server Mark Ostroff Sr. BI Solutions Consultant Agenda Hyperion BI versus Hyperion BI with OBI Server Benefits of using Hyperion BI with the
8. Business Intelligence Reference Architectures and Patterns
8. Business Intelligence Reference Architectures and Patterns Winter Semester 2008 / 2009 Prof. Dr. Bernhard Humm Darmstadt University of Applied Sciences Department of Computer Science 1 Prof. Dr. Bernhard
Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization
Turban, Aronson, and Liang Decision Support Systems and Intelligent Systems, Seventh Edition Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization
Outline Business Intelligence Roadmap: The Complete Project Lifecycle for Decision-Support Applications
Outline Business Intelligence Roadmap: The Complete Project Lifecycle for Decision-Support Applications Introduction to the BI Roadmap Business Intelligence Framework DW role in BI From Chaos to Architecture
Foundations of Business Intelligence: Databases and Information Management
Foundations of Business Intelligence: Databases and Information Management Problem: HP s numerous systems unable to deliver the information needed for a complete picture of business operations, lack of
OLAP Theory-English version
OLAP Theory-English version On-Line Analytical processing (Business Intelligence) [Ing.J.Skorkovský,CSc.] Department of corporate economy Agenda The Market Why OLAP (On-Line-Analytic-Processing Introduction
Fluency With Information Technology CSE100/IMT100
Fluency With Information Technology CSE100/IMT100 ),7 Larry Snyder & Mel Oyler, Instructors Ariel Kemp, Isaac Kunen, Gerome Miklau & Sean Squires, Teaching Assistants University of Washington, Autumn 1999
DATA WAREHOUSE CONCEPTS DATA WAREHOUSE DEFINITIONS
DATA WAREHOUSE CONCEPTS A fundamental concept of a data warehouse is the distinction between data and information. Data is composed of observable and recordable facts that are often found in operational
Week 13: Data Warehousing. Warehousing
1 Week 13: Data Warehousing Warehousing Growing industry: $8 billion in 1998 Range from desktop to huge: Walmart: 900-CPU, 2,700 disk, 23TB Teradata system Lots of buzzwords, hype slice & dice, rollup,
An Overview of Data Warehousing, Data mining, OLAP and OLTP Technologies
An Overview of Data Warehousing, Data mining, OLAP and OLTP Technologies Ashish Gahlot, Manoj Yadav Dronacharya college of engineering Farrukhnagar, Gurgaon,Haryana Abstract- Data warehousing, Data Mining,
Data Warehousing and OLAP Technology for Knowledge Discovery
542 Data Warehousing and OLAP Technology for Knowledge Discovery Aparajita Suman Abstract Since time immemorial, libraries have been generating services using the knowledge stored in various repositories
Chapter 3 - Data Replication and Materialized Integration
Prof. Dr.-Ing. Stefan Deßloch AG Heterogene Informationssysteme Geb. 36, Raum 329 Tel. 0631/205 3275 [email protected] Chapter 3 - Data Replication and Materialized Integration Motivation Replication:
A Comparative Study on Operational Database, Data Warehouse and Hadoop File System T.Jalaja 1, M.Shailaja 2
RESEARCH ARTICLE A Comparative Study on Operational base, Warehouse Hadoop File System T.Jalaja 1, M.Shailaja 2 1,2 (Department of Computer Science, Osmania University/Vasavi College of Engineering, Hyderabad,
Data Warehousing: A Technology Review and Update Vernon Hoffner, Ph.D., CCP EntreSoft Resouces, Inc.
Warehousing: A Technology Review and Update Vernon Hoffner, Ph.D., CCP EntreSoft Resouces, Inc. Introduction Abstract warehousing has been around for over a decade. Therefore, when you read the articles
14. Data Warehousing & Data Mining
14. Data Warehousing & Data Mining Data Warehousing Concepts Decision support is key for companies wanting to turn their organizational data into an information asset Data Warehouse "A subject-oriented,
Indexing Techniques for Data Warehouses Queries. Abstract
Indexing Techniques for Data Warehouses Queries Sirirut Vanichayobon Le Gruenwald The University of Oklahoma School of Computer Science Norman, OK, 739 [email protected] [email protected] Abstract Recently,
Hybrid Support Systems: a Business Intelligence Approach
Journal of Applied Business Information Systems, 2(2), 2011 57 Journal of Applied Business Information Systems http://www.jabis.ro Hybrid Support Systems: a Business Intelligence Approach Claudiu Brandas
Week 3 lecture slides
Week 3 lecture slides Topics Data Warehouses Online Analytical Processing Introduction to Data Cubes Textbook reference: Chapter 3 Data Warehouses A data warehouse is a collection of data specifically
BUILDING BLOCKS OF DATAWAREHOUSE. G.Lakshmi Priya & Razia Sultana.A Assistant Professor/IT
BUILDING BLOCKS OF DATAWAREHOUSE G.Lakshmi Priya & Razia Sultana.A Assistant Professor/IT 1 Data Warehouse Subject Oriented Organized around major subjects, such as customer, product, sales. Focusing on
Data Warehousing & OLAP
Data Warehousing & OLAP Motivation: Business Intelligence Customer information (customer-id, gender, age, homeaddress, occupation, income, family-size, ) Product information (Product-id, category, manufacturer,
Application Of Business Intelligence In Agriculture 2020 System to Improve Efficiency And Support Decision Making in Investments.
Application Of Business Intelligence In Agriculture 2020 System to Improve Efficiency And Support Decision Making in Investments Anuraj Gupta Department of Electronics and Communication Oriental Institute
DSS based on Data Warehouse
DSS based on Data Warehouse C_13 / 6.01.2015 Decision support system is a complex system engineering. At the same time, research DW composition, DW structure and DSS Architecture based on DW, puts forward
Outline. Data Warehousing. What is a Warehouse? What is a Warehouse?
Outline Data Warehousing What is a data warehouse? Why a warehouse? Models & operations Implementing a warehouse 2 What is a Warehouse? Collection of diverse data subject oriented aimed at executive, decision
Dr. Osama E.Sheta Department of Mathematics (Computer Science) Faculty of Science, Zagazig University Zagazig, Elsharkia, Egypt oesheta75@gmail.
Evaluating a Healthcare Data Warehouse For Cancer Diseases Dr. Osama E.Sheta Department of Mathematics (Computer Science) Faculty of Science, Zagazig University Zagazig, Elsharkia, Egypt [email protected]
Dimensional Modeling for Data Warehouse
Modeling for Data Warehouse Umashanker Sharma, Anjana Gosain GGS, Indraprastha University, Delhi Abstract Many surveys indicate that a significant percentage of DWs fail to meet business objectives or
META DATA QUALITY CONTROL ARCHITECTURE IN DATA WAREHOUSING
META DATA QUALITY CONTROL ARCHITECTURE IN DATA WAREHOUSING Ramesh Babu Palepu 1, Dr K V Sambasiva Rao 2 Dept of IT, Amrita Sai Institute of Science & Technology 1 MVR College of Engineering 2 [email protected]
Data warehouse and Business Intelligence Collateral
Data warehouse and Business Intelligence Collateral Page 1 of 12 DATA WAREHOUSE AND BUSINESS INTELLIGENCE COLLATERAL Brains for the corporate brawn: In the current scenario of the business world, the competition
Introduction to Data Warehousing. Ms Swapnil Shrivastava [email protected]
Introduction to Data Warehousing Ms Swapnil Shrivastava [email protected] Necessity is the mother of invention Why Data Warehouse? Scenario 1 ABC Pvt Ltd is a company with branches at Mumbai,
ETL and Advanced Modeling
ETL and Advanced Modeling 1. Extract-Transform-Load (ETL) The ETL process Building dimension and fact tables Extract, transform, load 2. Advanced Multidimensional Modeling Handling changes in dimensions
Meta-data and Data Mart solutions for better understanding for data and information in E-government Monitoring
www.ijcsi.org 78 Meta-data and Data Mart solutions for better understanding for data and information in E-government Monitoring Mohammed Mohammed 1 Mohammed Anad 2 Anwar Mzher 3 Ahmed Hasson 4 2 faculty
DATA MINING AND WAREHOUSING CONCEPTS
CHAPTER 1 DATA MINING AND WAREHOUSING CONCEPTS 1.1 INTRODUCTION The past couple of decades have seen a dramatic increase in the amount of information or data being stored in electronic format. This accumulation
International Journal of Scientific & Engineering Research, Volume 5, Issue 4, April-2014 442 ISSN 2229-5518
International Journal of Scientific & Engineering Research, Volume 5, Issue 4, April-2014 442 Over viewing issues of data mining with highlights of data warehousing Rushabh H. Baldaniya, Prof H.J.Baldaniya,
SAS BI Course Content; Introduction to DWH / BI Concepts
SAS BI Course Content; Introduction to DWH / BI Concepts SAS Web Report Studio 4.2 SAS EG 4.2 SAS Information Delivery Portal 4.2 SAS Data Integration Studio 4.2 SAS BI Dashboard 4.2 SAS Management Console
ETL-EXTRACT, TRANSFORM & LOAD TESTING
ETL-EXTRACT, TRANSFORM & LOAD TESTING Rajesh Popli Manager (Quality), Nagarro Software Pvt. Ltd., Gurgaon, INDIA [email protected] ABSTRACT Data is most important part in any organization. Data
Data Warehousing. Read chapter 13 of Riguzzi et al Sistemi Informativi. Slides derived from those by Hector Garcia-Molina
Data Warehousing Read chapter 13 of Riguzzi et al Sistemi Informativi Slides derived from those by Hector Garcia-Molina What is a Warehouse? Collection of diverse data subject oriented aimed at executive,
Business Intelligence. 1. Introduction September, 2013.
Business Intelligence 1. Introduction September, 2013. The content of the first lecture Introduction to data warehousing and business intelligence Star join 2 Data hierarchy Strategical data Operational
Decision Support and Business Intelligence Systems. Chapter 1: Decision Support Systems and Business Intelligence
Decision Support and Business Intelligence Systems Chapter 1: Decision Support Systems and Business Intelligence Types of DSS Two major types: Model-oriented DSS Data-oriented DSS Evolution of DSS into
CONCEPTUAL FRAMEWORK OF BUSINESS INTELLIGENCE ANALYSIS IN ACADEMIC ENVIRONMENT USING BIRT
CONCEPTUAL FRAMEWORK OF BUSINESS INTELLIGENCE ANALYSIS IN ACADEMIC ENVIRONMENT USING BIRT Julaily Aida Jusoh, Norhakimah Endot, Nazirah Abd. Hamid, Raja Hasyifah Raja Bongsu, Roslinda Muda Faculty of Informatics,
A Critical Review of Data Warehouse
Global Journal of Business Management and Information Technology. Volume 1, Number 2 (2011), pp. 95-103 Research India Publications http://www.ripublication.com A Critical Review of Data Warehouse Sachin
A Design and implementation of a data warehouse for research administration universities
A Design and implementation of a data warehouse for research administration universities André Flory 1, Pierre Soupirot 2, and Anne Tchounikine 3 1 CRI : Centre de Ressources Informatiques INSA de Lyon
INTRODUCTION TO BUSINESS INTELLIGENCE What to consider implementing a Data Warehouse and Business Intelligence
INTRODUCTION TO BUSINESS INTELLIGENCE What to consider implementing a Data Warehouse and Business Intelligence Summary: This note gives some overall high-level introduction to Business Intelligence and
Chapter 5. Warehousing, Data Acquisition, Data. Visualization
Decision Support Systems and Intelligent Systems, Seventh Edition Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization 5-1 Learning Objectives
Importance or the Role of Data Warehousing and Data Mining in Business Applications
Journal of The International Association of Advanced Technology and Science Importance or the Role of Data Warehousing and Data Mining in Business Applications ATUL ARORA ANKIT MALIK Abstract Information
Moving Large Data at a Blinding Speed for Critical Business Intelligence. A competitive advantage
Moving Large Data at a Blinding Speed for Critical Business Intelligence A competitive advantage Intelligent Data In Real Time How do you detect and stop a Money Laundering transaction just about to take
Introduction to Business Intelligence
IBM Software Group Introduction to Business Intelligence Vince Leat ASEAN SW Group 2007 IBM Corporation Discussion IBM Software Group What is Business Intelligence BI Vision Evolution Business Intelligence
Applied Business Intelligence. Iakovos Motakis, Ph.D. Director, DW & Decision Support Systems Intrasoft SA
Applied Business Intelligence Iakovos Motakis, Ph.D. Director, DW & Decision Support Systems Intrasoft SA Agenda Business Drivers and Perspectives Technology & Analytical Applications Trends Challenges
A Knowledge Management Framework Using Business Intelligence Solutions
www.ijcsi.org 102 A Knowledge Management Framework Using Business Intelligence Solutions Marwa Gadu 1 and Prof. Dr. Nashaat El-Khameesy 2 1 Computer and Information Systems Department, Sadat Academy For
Welcome to online seminar on. Oracle Agile PLM BI. Presented by: Rapidflow Apps Inc. January, 2011
Welcome to online seminar on Oracle Agile PLM BI Presented by: Rapidflow Apps Inc. January, 2011 Agenda Agile PLM BI Overview What is Agile BI? Who Needs Agile PLM BI? What does it offer? PLM Business
Data Warehouse Overview. Srini Rengarajan
Data Warehouse Overview Srini Rengarajan Please mute Your cell! Agenda Data Warehouse Architecture Approaches to build a Data Warehouse Top Down Approach Bottom Up Approach Best Practices Case Example
