January 2010. Fast-Tracking Data Warehousing & Business Intelligence Projects via Intelligent Data Modeling. Sponsored by:



Similar documents
ER/Studio Data Architect

Why Data Warehouse Projects Fail Using Schema Examination Tools to Ensure Information Quality, Schema Compliance, and Project Success

New Tools for Faster SQL Tuning and Analysis Embarcadero Technologies

Role-Based Benefits of Buying and Using ER/Studio Enterprise Portal

Using Database Monitoring Tools to Measure, Manage, and Prove SLA Compliance Embarcadero Technologies

Setting up IIS on Windows 7: Build Web Services with RAD Studio 2010

Tech Notes. Corporate Headquarters EMEA Headquarters Asia-Pacific Headquarters 100 California Street, 12th Floor San Francisco, California 94111

Selecting the Right Change Management Solution Key Factors to Consider When Evaluating Change Management Tools for Your Databases and Teams

The Role of the Analyst in Business Analytics. Neil Foshay Schwartz School of Business St Francis Xavier U

Managing Java EE Performance with Embarcadero s J Optimizer Request Analyzer

A Database Re-engineering Workbench

Software Development Predictions For 2009

Measuring and Monitoring the Quality of Master Data By Thomas Ravn and Martin Høedholt, November 2008

Considerations: Mastering Data Modeling for Master Data Domains

Reducing the Time and Costs Associated with Sarbanes-Oxley Compliance

Embarcadero DataU Conference. Data Governance. Francis McWilliams. Solutions Architect. Master Your Data

COURSE 20463C: IMPLEMENTING A DATA WAREHOUSE WITH MICROSOFT SQL SERVER

Best Practices and a Must Have Toolset for SOA Migration Projects

Embarcadero ToolCloud for XE Quick Start Guide. ToolCloud for Embarcadero XE Products Last Published May 5, 2010

Whitepaper Data Governance Roadmap for IT Executives Valeh Nazemoff

Foundations of Business Intelligence: Databases and Information Management

Implementing a Data Warehouse with Microsoft SQL Server

Implement a Data Warehouse with Microsoft SQL Server 20463C; 5 days

Course 10777A: Implementing a Data Warehouse with Microsoft SQL Server 2012

Technology in Action. Alan Evans Kendall Martin Mary Anne Poatsy. Eleventh Edition. Copyright 2015 Pearson Education, Inc.

MDM and Data Warehousing Complement Each Other

Implementing a Data Warehouse with Microsoft SQL Server 2012

Logical Modeling for an Enterprise MDM Initiative

Course Outline. Module 1: Introduction to Data Warehousing

Busting 7 Myths about Master Data Management

Implementing a Data Warehouse with Microsoft SQL Server

Course MIS. Foundations of Business Intelligence

!!!!! White Paper. Understanding The Role of Data Governance To Support A Self-Service Environment. Sponsored by

Embarcadero Technologies, with contributions from Ron Lewis, Senior Security Analyst, CDO Technologies

Data Integration Alternatives Managing Value and Quality

SQL Server 2012 Business Intelligence Boot Camp

Implementing a Data Warehouse with Microsoft SQL Server 2012

Implementing a SQL Data Warehouse 2016

Bringing agility to Business Intelligence Metadata as key to Agile Data Warehousing. 1 P a g e.

Data Modeling Basics

Implementing a Data Warehouse with Microsoft SQL Server MOC 20463

COURSE OUTLINE MOC 20463: IMPLEMENTING A DATA WAREHOUSE WITH MICROSOFT SQL SERVER

White Paper. Thirsting for Insight? Quench It With 5 Data Management for Analytics Best Practices.

TRIMIT Fashion reviewed: Tailor made out of the box?

Implementing Oracle BI Applications during an ERP Upgrade

Data Integration Alternatives Managing Value and Quality

An Enterprise Data Hub, the Next Gen Operational Data Store

Course Outline: Course: Implementing a Data Warehouse with Microsoft SQL Server 2012 Learning Method: Instructor-led Classroom Learning

<Insert Picture Here> Extending Hyperion BI with the Oracle BI Server

POLAR IT SERVICES. Business Intelligence Project Methodology

East Asia Network Sdn Bhd

Cincom Business Intelligence Solutions

Business Intelligence In SAP Environments

Integrated Data Management: Discovering what you may not know

Implementing a Data Warehouse with Microsoft SQL Server

Business Intelligence

Bringing Big Data into the Enterprise

MIPRO s Business Intelligence Manifesto: Six Requirements for an Effective BI Deployment

Increase Business Intelligence Infrastructure Responsiveness and Reliability Using IT Automation

Embarcadero Rapid SQL

Turning Data into Knowledge: Creating and Implementing a Meta Data Strategy

Business Intelligence for Everyone

CHAPTER SIX DATA. Business Intelligence The McGraw-Hill Companies, All Rights Reserved

Ten Things You Need to Know About Data Virtualization

CONCEPTUALIZING BUSINESS INTELLIGENCE ARCHITECTURE MOHAMMAD SHARIAT, Florida A&M University ROSCOE HIGHTOWER, JR., Florida A&M University

Cloud Computing for Technology Tools

Data Quality Assessment. Approach

Embarcadero DB Change Manager 6.0 and DB Change Manager XE2

Chapter 6 FOUNDATIONS OF BUSINESS INTELLIGENCE: DATABASES AND INFORMATION MANAGEMENT Learning Objectives

CREATING BUSINESS VALUE THROUGH INTEGRATION

Open Source Business Intelligence Costs and Benefits

An Enterprise Framework for Business Intelligence

White Paper What Solutions Architects Should Know About The TOGAF ADM

A Blueprint for Business Software Implementation Success

Integrating SAP and non-sap data for comprehensive Business Intelligence

A technical paper for Microsoft Dynamics AX users

Alexander Nikov. 5. Database Systems and Managing Data Resources. Learning Objectives. RR Donnelley Tries to Master Its Data

Escape from Data Jail: Getting business value out of your data warehouse

Performance Tuning Essentials for Java Manage Application Speed, Scalability and Reliability Throughout the Development Process

Implementing a Data Warehouse with Microsoft SQL Server

Course 20463:Implementing a Data Warehouse with Microsoft SQL Server

A Design Technique: Data Integration Modeling

Master Your Data. Master Your Business. Empower your business with access to consolidated and reliable business-critical data

Implementing a Data Warehouse with Microsoft SQL Server 2014

ZAP Business Intelligence Application for Microsoft Dynamics

A McKnight Associates, Inc. White Paper: Effective Data Warehouse Organizational Roles and Responsibilities

Facilitating Efficient Data Management by Craig S. Mullins

Microsoft. Course 20463C: Implementing a Data Warehouse with Microsoft SQL Server

Moving Large Data at a Blinding Speed for Critical Business Intelligence. A competitive advantage

Architected Blended Big Data with Pentaho

Implementing a Data Warehouse with Microsoft SQL Server 2012 (70-463)

Foundations of Business Intelligence: Databases and Information Management

DATA GOVERNANCE AND DATA QUALITY

RESEARCH NOTE TECHNOLOGY VALUE MATRIX: ANALYTICS

UNIFY YOUR (BIG) DATA

Implementing a Data Warehouse with Microsoft SQL Server 2012 MOC 10777

Extensibility of Oracle BI Applications

NewVantage Point: Principles for a Successful Data Strategy

Successful Outsourcing of Data Warehouse Support

Transcription:

Fast-Tracking Data Warehousing & Business Intelligence Projects via Intelligent Data Modeling January 2010 Claudia Imhoff, Ph.D Sponsored by:

Table of Contents Introduction... 3 What is a Data Model?... 3 Benefits of Data Models on BI Projects... 5 Getting Started... 6 Summary... 7 2

Introduction Why do we read about so many Business Intelligence (BI) projects missing the mark with their business users? Did the implementers misinterpret the requirements? Were the users unable to verbalize their needs accurately or clearly? Was the wrong technology deployed? Probably none of these caused the mismatch. Often it is simply a matter of not properly documenting the needed data and related business rules. In other words, the root cause is a poorly constructed or the complete lack of a data model for the application and therefore, no way to confirm the requirements as stated by the business users. At the core of any BI should be the ability to align business needs with the data infrastructure supporting them. This is almost impossible to do without a data model. Yet many BI implementers do not understand the need for or the benefits from these design components. This paper will examine the major benefits that data models have on BI environments. What is a Data Model? First let s define what a data model is. A data model is a blueprint of the data infrastructure for an application or environment. This blueprint translates business concepts into technical diagrams that record the fundamental aspects of the information needed for a particular BI environment. In IT application development, there are different levels of data models, depending on the audience for these models. Figure 1 depicts these different levels of models. These data models are used as specialized communication devices so that the business and IT can fully comprehend the BI environment s requirements for data and the business constraints on that data. There are four levels of data models each successive level inherits the information and constraints from the previous level but adds specific information required at that level. The first two levels of data models are enterprise standards and should be used as a starting point for any application. Therefore, they are labeled as Program Activities. The last two levels are specific to a coordinated set of projects under the program. An example would be the set of projects that create the BI environment. These models are labeled as project-specific activities. Subject Area Model A collection of the high-level subject areas 3

and their relationships. The subject areas are groupings of things of importance to the enterprise and usually number between 12-15 subject areas. Enterprise Data Model A representation of information used in an enterprise from a business perspective. This model is also called the Logical or Business Data Model. It is designed to be independent of functional application or physical implementation. There is only one enterprise data model in an organization. System Model A collection of the information being addressed by a specific system or application. It is the electronic representation of information and is independent of the specific technology to be used in its implementation. It is developed from a subset of the enterprise data model. Since a system or application may be implemented on multiple technologies (e.g., a data warehouse plus several dependent data marts), there may be multiple system models. Technology Model A collection of the information being addressed by a system and implemented on a specific platform. The technology model must take into account and document the specific hardware, operating system, DBMS, etc., requirements. Just like the system data model, there may be multiple technology models. That is the same system may reside on multiple technologies (e.g., a data warehouse that is physically implemented on different technologies) Figure 1: Data Model Levels 4

Benefits of Data Models on BI Projects So what are the benefits of using these data models? 1. Documentation As stated, data models are wonderful communication devices. Just as architectural blueprints are created with several levels of detail, so are data models for the BI environment. Each level produces a successive level of detail about the data. Architectural blueprints record the high level layout of the overall building all the way down to the detailed plumbing and electrical wiring for each room. In the same manner, the data models for the BI environment document the overall high level data requirements all the way down to the technical formats and constraints on each individual data attribute. This documentation is invaluable for ensuring that the BI application is correctly supported by the data infrastructure. 2. Business rule adherence Architectural blueprints also include constraints for the building (types of pipes and wires needed, how far apart components must be for safety, structural bearing walls, etc.) Data models also provide a similar record of all the business rules the enterprise must not violate. These can be as simple as an order must have a relationship with both a customer and a product to very complex rules surrounding compliance requirements or proper accounting procedures. These rules of the business are generally captured in the relationships between business entities such as between the customer, product, and order or within the domains of entities, their subtypes, and ultimate formats. Business rules are the checks or controls within the application that stop ETL processes from incorrectly loading the data to restricting analytic processes (or analysts!) from creating invalid comparisons, analyses, or aggregations in the BI environment. 3. Productivity A well-known best practice for BI environments is for each successive project to build upon the foundation created by the previous projects. In other words, the ETL processes, integrated data, and especially the data models should be reused where possible. Well-constructed data models are significant productivity tools that eliminate redundant design efforts, inconsistent data elements, inaccurate or incompatible business rules, etc. They promote significant increases to overall project productivity. We all 5

know that we tend to be better editors of an existing component than creators of something new. What better way to increase productivity than to start with an existing data model rather than a blank sheet of paper! 4. Easier maintenance/enhancement No matter how thoroughly you investigate the requirements for a BI application, I can guarantee that once it is deployed, the users of the application will figure out new and different ways of using it. This generally translates into enhancements or maintenance to the underlying data infrastructure and processes. Without a data model to act as a roadmap for these changes, they can be very detrimental to the overall performance and usability of the application. For example, the data model determines the optimal placement of new attributes or entities for the best possible response time and ease of use. 5. Enterprise-focus As mentioned in number 3, each BI project should build upon existing deployments. Another best practice is to ensure that the data models for each project maintain the enterprise view or at least highlight any exceptions to this view. The enterprise nature of the data models ensures that the data infrastructure can be reused by multiple departments or groups as well as multiple projects with minimal effort. It may mean that the company has to resolve its differences for such hot entities as customer or product just agreeing to their definitions can be challenging but the consistency and reliability obtained by this effort far outweighs the effort it takes to consolidation these differences. In the long run, the overall BI environment benefits in terms of data consistency and reliability. Getting Started If a subject area model or enterprise data model do not exist, then a best practice for BI states that the first BI project should create the subject area model (it is a very quick model to create) and only that portion of the enterprise model that directly supports the current BI project and no more. Each successive BI project can then build out more and more of the enterprise data model as the project creates its own system and technology models (see the back arrows in Figure 1). This practice of iteratively building out the two program models means that no one project is responsible for the creation of the entire enterprise data model (a significant effort to say the least). The risk is that the program models may undergo significant changes as successive 6

projects are implemented. These changes could impact the already implemented projects models as well. Another best practice for getting started is to start with the database schema of the existing operational or transaction (source) systems. It is possible to convert these designs into technology and system models. These can in turn be used as a starting point for the enterprise data model and subject area model. They may also serve as the starting point for the system and technology models of the data warehouse and/or data marts. Reusing their definitions and documentation can be a great leg up on the creation of all the levels of models and can ensure consistency from one environment to the other. But ensure that you have other sources of information for the data models in addition to the operational models, especially if your operational systems are packages like ERP systems. It may be that the business actually wants to change some of the rules embedded in the operational models but can t. Don t continue this frustration by continuing the poor business rule in the BI environment when it is not necessary. In any case, watch out for analysis paralysis. A data model is never really finished; it can always be polished and polished to the detriment of any project s time line. To avoid this over-design situation, you may want to time box the data model development activities. When the time is up, go with the model that you have. It may not be perfect but it will be good enough. Summary Given these major benefits of data models, it should be obvious that their usage translates not only into far superior BI applications and environments but also into significant cost reductions by eliminating data inconsistencies and design duplications. Data models also reduce costs by improving the maintenance of BI products and overall efficiency of deployments. Data models promote a common understanding between IT and the business and are the keys to deploying more successful projects. The likelihood that the BI environment will be adopted and used by the business community goes up significantly when both the business and IT have a thorough understanding of its infrastructure. And that is the whole point of a BI environment happy and satisfied business users making accurate and timely decisions for the enterprise based on reliable, documented information. 7

About Claudia Imhoff President, Intelligent Solutions Inc. A thought leader, visionary, and practitioner in the rapidly growing fields of business intelligence and customer focused-strategy Claudia Imhoff, Ph.D., is a popular and dynamic speaker and internationally recognized expert on analytical CRM, business intelligence, and the infrastructure to support these initiatives the Corporate Information Factory (CIF). Dr. Imhoff has coauthored five highly-regarded and popular books on these subjects and writes monthly columns and articles (totaling more than 100) for technical and business magazines. She has served on the Board of Advisors for DAMA International and was chosen by the DAMA organizations to receive the 1999 and 2005 Individual Achievement Awards. She is an advisor and a faculty member for The Data Warehousing Institute and serves as an advisor for several technology and commercial companies. Dr. Imhoff delivers keynote addresses at conferences sponsored by software companies and their user groups, The Data Warehousing Institute, The Economist, COMDEX, and many international organizations. She has appeared repeatedly on World Business Review, Microsoft s Getting Results programs, and web casts sponsored by DM Review, Better Management, and several technology vendors. She is a member of the Advisory Board of the Daniels School of Business at the University of Denver and is on several technology companies advisory councils. Dr. Imhoff founded Intelligent Solutions, Inc. (www.intelsols.com), a well respected Business Intelligence and CRM consulting and education firm in 1992. Her company has successfully implemented over 150 Corporate Information Factory architectures in all industry areas. About Embarcadero Embarcadero Technologies, Inc. is a leading provider of award-winning tools for application developers and database professionals so they can design systems right, build them faster and run them better, regardless of their platform or programming language. Ninety of the Fortune 100 and an active community of more than three million users worldwide rely on Embarcadero products to increase productivity, reduce costs, simplify change management and compliance and accelerate innovation. The company s 8

flagship tools include: Embarcadero Change Manager, Embarcadero RAD Studio, DBArtisan, Delphi, ER/Studio, JBuilder and Rapid SQL. Founded in 1993, Embarcadero is headquartered in San Francisco, with offices located around the world. Embarcadero is online at www.embarcadero.com. 9