Creating a Real Time Data Warehouse
|
|
|
- Juniper Curtis
- 10 years ago
- Views:
Transcription
1 Creating a Real Time Data Warehouse Joseph Guerra, SVP, CTO & Chief Architect David A. Andrews, Founder Introduction Business Intelligence (BI) technology has helped many organizations make better decisions and improve performance. Unfortunately, implementing BI does not automatically lead to better results. Too often, BI systems deliver only a small fraction of their potential value. Experience has shown that the way in which BI technology is implemented has a huge impact on results. One of the most important keys to success is the use of a data warehouse as the foundation for a BI solution. 700 West Johnson Avenue Cheshire, CT Copyright RapidDecision All trademarks are the property of their respective owners. The way in which the data is brought into the warehouse is extremely important. The most effective approach keeps the data in the warehouse closely synchronized with data from source databases something often referred to as real-time updating. Even though the advantages of real-time data warehousing have been known for some time surprisingly few working BI solutions employ it today. A key reason for limited use of this very valuable approach is a lack of understanding of what it is and what is involved in accomplishing it. Fortunately, improving technology and lessons from the experience of those that have successfully made it work have combined to make real-time data warehousing practical and affordable. This paper will explain real-time data warehousing in simple terms and will provide practical guidance on how to make it work. The advice offered here is not theoretical. It has been successfully used for many years in a wide variety of complex organizations
2 Creating a Real Time Data Warehouse 2013 Executive Summary The most effective BI solutions are based on data warehouses. The only other option using BI tools to obtain data directly from source applications has been shown countless times to simply not work acceptably. A data warehouse maintains a copy of data from sources that usually include the ERP software that runs business operations. A data warehouse runs on dedicated servers and organizes and formats the data in a way that facilitates analysis and reporting. In a perfect world the data in the warehouse would instantly reflect changes in source databases. In practice some delay is inevitable. Experience has shown that a delay of up to a few minutes in synchronization of a warehouse with its sources is almost always acceptable. Longer delays, however, can create problems. This paper will refer to any data warehouse that maintains a short and acceptable delay in synchronization as a real-time data warehouse. Techniques for achieving real-time synchronization have been available for some time. Unfortunately, they have not yet achieved widespread use. Applying them has become practical and affordable and thus should be the first option explored by any organization starting down the BI path. Retrofitting existing BI solutions with a real-time data warehouse is also an option worthy of examination. Most data warehouses in use today are not real-time. Instead they typically synchronize with data sources once a day, usually late at night. Data warehouses updated this way often use the brute force approach of re-copying everything in the source databases. This daily refresh approach takes a long time and can create a serious scheduling problem since the source servers normally need to stop all other activity while this is taking place. The only other option is to provide the excess processing power needed to accommodate the refresh. The technical term for the process used to achieve realtime synchronization is Change Data Capture or CDC. A number of ways to achieve CDC have been tried over the years. Only one approach has stood the test of time and has proven to be acceptable. It is based on obtaining data from log files created by the source computer s database management system. Other approaches that have been tried and found to be inadequate will be examined later in the more technical section of this paper. The best practice for BI is thus to use a data warehouse that employs log-based CDC for updates. Those evaluating data warehouse options therefore need to verify that any claims of a CDC capability are log-based and do not employ one of the inferior alternatives. Log-based CDC is important because it is the only updating technique that can insure that all changes to the source databases will be reflected in the data warehouse. Without this assurance the value of the data warehouse and the output it generates can be greatly diminished. Those using a data warehouse need to have confidence that its contents are complete and accurate. Many BI projects fail because users lose confidence in the data warehouse and stop using it. A growing number of BI systems produce output that is vital to the ongoing success of the organization. Financial results are often reported to the public and to regulators based on information in data warehouses. The updating technique for these warehouses must be unassailable, auditable and in compliance with Sarbanes- Oxley regulations. Log-based CDC is by far the best way to achieve this. The more detailed examination of how this approach works is presented later in the paper and will make it clear why this is the case. Most data warehouses are built using a type of software tool named for what it does: Extract, Transform and Load or ETL. Popular tools in this category include Informatica PowerCenter, IBM InfoSphere DataStage, SAP Business Objects Data Services, and Oracle Data Integrator. Other options are available from less well known vendors. All perform the basic functions of obtaining data from a source server (extraction), reformatting it (transformation) and updating of the resulting data warehouse (loading). 2 RapidDecision
3 2013 Creating a Real Time Data Warehouse Not all of these tools can perform log-based CDC on their own. ETL tools that do not support this function need to be combined with another software tool that adds this capability. The major vendors have tended to acquire software firms that offer a log reading capability and then either integrate it into their ETL tools or offer it as an optional complementary product. Those evaluating data warehouse offerings cannot therefore assume that vendors whose product line includes a log-based CDC capability will automatically include it in a proposed solution. One of the reasons that log-based CDC is not more widely used is that the use of this technique has implications on the way the data is organized and stored in the warehouse. That structure is referred to as the Data Model of the warehouse. If the Data Model does not follow certain design principles then the performance of the resulting data warehouse can be disappointing. Those about to undertake a BI project need to fully understand the impact that real-time updating will have on the Data Model design. If a warehouse is first implemented using daily batch updates then a significant redesign might later be necessary in order to make logbased CDC workable. The advantages of real-time updating An obvious advantage of real-time synchronization is that there is no need to reconcile differences between source applications and the warehouse. A data warehouse that does not appear to always be up to date tends to lose the confidence of the user community. Lost user confidence can lead to the failure of a BI effort. This is because the return on investment in BI is dependent on how many different things it is used for and their collective impact. Any loss of confidence in the quality or timeliness of the data on which output is based reduces the use of BI and thus its impact. A well designed log-based warehouse will be fully auditable. This makes it possible to create reports and other output that is provided to outsiders such as investors or regulators. The usage and thus value of a BI environment based on auditable data tends to be much higher than one whose data cannot be fully trusted. Another major potential benefit of BI is the elimination of debates over whose data is accurate. This benefit is hard to realize unless the foundation of the BI solution is a real-time data warehouse. BI systems can also contribute greatly to operational efficiency if they include the ability to initiate actionoriented alerts. Traditional applications often cannot instantly analyze the impact of unusual transactions such as a large rush order, a delivery delay, or a major purchase price variance. BI systems can be programmed to identify the need for action in many of these situations and to alert the appropriate personnel, often through dashboard displays. Without the support of a real-time data warehouse, it is not possible to create this type of action-oriented alert. Log-based CDC also represents the lowest impact method to update a data warehouse. Every other approach puts a greater burden on the servers supporting the source databases. Performance is thus improved and unnecessary costs are eliminated. Summary BI solutions that employ a real-time data warehouse with log-based updating will tend to deliver far more satisfactory results over time than those without it. Other approaches will not perform as well, may not be reliable enough to be auditable, and will create confusion at times due to timing differences. Proven techniques have been established to make realtime data warehousing practical and affordable. A small but growing number of pre-built log-based real-time data warehouses are available for purchase. If one is available for the sources you use then it should be seriously considered. When a pre-built solution is not available it is usually necessary to either custom-build a warehouse or engage others to create it. In either case it is important to involve experts in log-based CDC. It is also important to insure that the Data Model on which your data warehouse is created anticipates real-time updating. RapidDecision 3
4 Creating a Real Time Data Warehouse 2013 Examining real-time data warehousing in more detail Over time the highly effective use of BI is likely to become a competitive necessity for every complex organization. In anticipation of that, it is appropriate to set a goal of working towards real-time data warehousing. Achieving that goal may not be as difficult as it now appears. A more detailed understanding of exactly what is involved may be useful. Log-based Change Data Capture Every data base management system (DBMS) includes a mechanism to save data for backup and recovery and to support commitment control. All use a similar approach based on the creation of log files. Each time a change occurs in a database under its control, the DBMS insures that a complete record of the change is written to a log file. This file can be stored on devices that are separate from those used to store the source data so that if one of those devices or its server fails, the log remains available. The commitment control process allows databases to insure that data is never lost due to failure during the process of updating or otherwise changing data on the source system. The log contains records of changes that have been confirmed to be complete by the commitment control process. Log files contain a complete record of all changes that were made along with a date and time stamp indicating exactly when each change occurred. These log files make it possible to repair the source database in the event of failure. They are used in the process of recovering from a device failure (or software error) by restoring the data on a device to a copy made at a particular time and then re-processing transactions from the log files until the device is restored to the point it was when the failure occurred. The key point is that database log files are designed to reflect every possible change in the database. Creative programmers, hackers, hardware crashes or even cruel fate cannot get around making them the best possible source of data on which to base a CDC strategy. A mechanism similar to the one used for device recovery can therefore be used to create a data warehouse that is synchronized with its data sources without fear that changes will be missed. How it works An examination of the way log-based CDC works should make it clear why this approach is vastly superior to any alternative. The starting point is the ETL tool that is used to organize and manage the entire process of building data warehouses. Some ETL tools do not have the capability to perform the extract function when the source data comes from log files instead of the source database itself. Such tools need to be combined with another tool that does offer this capability if log-based CDC is going to be implemented. It should be noted that each DBMS has its own unique way of creating log files. A driver or software interface to each DBMS from which source data needs to be obtained will thus be needed. Commonly used DBMS s include Microsoft SQL Server, Oracle 11g, mysql and IBM DB2. Note that the version of DB2 used with IBM s ios differs considerably from the other versions and is not supported by some ETL tools. When a real-time data warehouse is being created the first step is a one-time process to populate the initial version of the warehouse. The ETL tool finds and extracts every data item from the source databases, processes all the necessary transformations and loads everything into the data warehouse. This start-up process can take hours depending on the size of the files involved and the power of the servers. It is important to note that this time consuming process only needs to happen once in a CDC-fed data warehouse. With the log-reading drivers in place the ETL/CDC tool is turned on. The log readers poll the log files and use them to identify all changes that have occurred. These changes are sent to the parts of the ETL tool that handle the transform and load functions. When the ETL tool is ready to handle the next wave of update transactions another polling cycle is initiated. The polling cycle 4 RapidDecision
5 2013 Creating a Real Time Data Warehouse typically takes place every few seconds and can be varied as appropriate. The log reading logic thus handles the extraction phase of the ETL process. It feeds change transactions to the transformation function. This is the step where source data is made more understandable and usable by BI analysis tools. Often this involves changing the names of data fields to make them both understandable and consistent. The final function of loading the data warehouse can then take place. The transformed data is normally organized in different ways than it was in source databases. Combinations (de-normalization) of tables are frequently performed to make access faster and easier. Design considerations Transformation and loading are complex procedures that take time and computer capacity. There can be occasions where an unusually heavy volume of change transactions can cause the ETL process to fall behind the log file. In these cases update transactions are not lost, just delayed. Designers of log-based CDC data warehouses need to determine how much server capacity to put in place to handle such intermittent bursts of change traffic in order to limit the duration and frequency of delays. Experience has shown that most data warehouse users will not notice or be concerned about synchronization delays of a minute or less. Most will tolerate longer delays if they occur infrequently. Real-time data warehouses should be designed to continuously monitor their own level of synchronization and alert administrators and users if pre-set service levels are not met. A well-designed log-based CDC system will have no noticeable impact on performance of the source servers from which data is being extracted. This is one of the most important benefits of this approach to CDC. In most cases it is very important to avoid slowing down the process of updating source servers since the transactions they handle are often vital to the ongoing operation of the entity. Log-based CDC is by far the best way to minimize impact on source transaction servers. Alternatives to log-based CDC Approaches other than log reading that have been tried, and found to be wanting include: Database triggers this DBMS feature invokes a prewritten routine each time a specific set of conditions are met, such as the addition or updating of a record in the database. In concept, the code that the trigger launches can be written to write a record of the transaction to a database table and the ETL tool can then poll those tables on a periodic basis. In practice, this approach is far from foolproof. For example triggers can regularly be deleted/disabled then re-added/re-enabled in the normal course of a business operation. Triggers also place a relatively high overhead burden on the source database server Message Queues a number of middleware products such as IBM MQ Series also appear to have the capacity to capture application, not database, changes and report them to an ETL tool. An obvious disadvantage is the cost of the license for these products. More importantly, message queues are not a completely reliable source of change information since they only know about data changes that are sent to them by the applications and not batch routines or manual updates to the database. Date and time stamps many ERP applications and other data sources maintain data fields within each record that indicate when it was last changed. An approach to CDC that has been tried a surprising number of times reads through the data records and looks for recent changes. The fatal flaw in this approach is that it relies on the programs that change data to unfailingly update this field. In addition, this approach can lose track of deletions since the entire record, including the time stamps, are gone. Sadly, some of the major BI vendors claim to update data warehouses in real-time but do so using an inferior approach to log-based CDC. Buyers need to look under the covers and verify that properly designed log-based CDC forms the foundation of any data warehouse before it is put in the hands of users. RapidDecision 5
6 Creating a Real Time Data Warehouse 2013 Case History RapidDecision The ideas presented above would have limited value if they had not been proven to work in practice. RapidDecision, a packaged software offering from Andrews Consulting Group, represents a proof-point that everything advocated in this paper works in practice. RapidDecision evolved out of the efforts of a development team under the leadership of Joseph Guerra (principal author of this paper) that was engaged to create a number of custom data warehouses. Over time products were created using the custom warehouses as their foundation. The RapidDecision design principles have thus successfully been applied many times, building warehouses for a variety of businesses in very different industries. RapidDecision is a pre-built real-time data warehouse built around a highly flexible and expandable data model. Packaged versions of it are sold for JD Edwards, PeopleSoft, Oracle E-Business and Lawson. Other sources are planned for development in the future. Custom versions have been created for Infor BPCS, MAPICS and Siebel. Every implementation of RapidDecision, including the early custom versions created more than a decade ago, has used log-based CDC as its synchronization approach. Early versions were based on the DataMirror Transformation Server product that has since been bought by IBM and is now sold as IBM InfoSphere CDC. Since 2003 RapidDecision has used SAP Business Objects Data Services to perform the ETL function and to handle log reading for SQL Server and Oracle DBMS sources. When source data resides on an IBM server using the ios version of DB2 then a third party log reader is included as well. Plans call for future support of other ETL tools that offer log-based CDC. A schedule for availability of other ETL options has not yet been announced. A unique feature of RapidDecision is called its heartbeat. It makes it possible for RapidDecision to continuously monitor the lag time between posting of changes to source logs and the completion of the resulting updates to the data warehouse. Alerts can be sent to administrators when pre-set limits are exceeded or when the updating process is interrupted for any reason. Immediate action can be taken in either case. An option is offered to buyers to have the monitoring and response function managed by the RapidDecision support staff. A number of RapidDecision installations have replaced failed data warehouse efforts. The real-time capability of RapidDecision was often the key reason it was selected. In every case, performance was improved dramatically. Using log-based real-time as the foundation for RapidDecision has made it less expensive and complex to install and operate than warehouses that use other approaches. RapidDecision typically can become fully operational within a few weeks, even for buyers implementing large numbers of RapidDecision data marts. The success of RapidDecision under many different conditions provides further confirmation of the viability of log-based CDC. The RapidDecision product is only a viable option for organizations that use one or more of the source applications that it supports. Those creating warehouses for data from other sources can still benefit from examining how RapidDecision was designed and implemented. More information about RapidDecision and the data warehouse design services of Andrews Consulting Group can be found at www. rapiddecision. net. 6 RapidDecision
7 2013 Creating a Real Time Data Warehouse About the Authors Joseph Guerra is one of the few people in the world to both deeply understand all facets of Business Intelligence and to also have contributed to the success of hundreds of successful installations. His combination of experience, expertise and influence are largely unmatched. His most significant accomplishment is the creation of RapidDecision, a software product that has set a new standard for pre-built data warehouses. Guerra has found ways to overcome challenges others assumed were insurmountable. He has discovered how to create real-time data warehouses that can be installed quickly, perform well, and grow without disruption. His ideas and opinions are widely sought by user groups, major BI vendors, industry analysts and many leading edge users of Business Intelligence technology. About Andrews Consulting Group This paper is one of a series covering various aspects of Business Intelligence. It was created by Andrews Consulting Group a Software and IT advisory business that has been implementing real-time data warehouses since More than 250 organizations are actively benefiting today from real-time data warehouses that it has helped implement. Other papers in the series will explore related issues such as: Introduction to Business Intelligence Why you need a Data Warehouse Turning the potential of BI into business results Which Business Intelligence Tool is Right? The best ways to deploy Business Intelligence Andrews Consulting Group (ACG) has been helping organizations make effective use of information technology since Few IT service businesses have In his role as Vice President and Chief Architect at Andrews Consulting Group, Guerra oversees all product development and a highly respected BI and data warehousing consulting group. He is the author of a number of white papers and frequently is asked to present at industry events. David Andrews founded Andrews Consulting Group in 1984 after 17 years managing the implementation of advanced IT systems. He is the author of the book Revolutionizing IT (John Wiley and Sons, September 2002) and has written more than 50 white papers. He has made presentations to audiences in 17 countries about the effective use of IT within mid-sized organizations and has been widely quoted in trade journals and the business press. achieved as high a level of client satisfaction over an extended period of time as ACG. One of the most effective ways in which ACG has helped its clients is in the creation of business intelligence solutions. Numerous customers have business intelligence solutions designed and installed by ACG. This experience allows ACG to implement sophisticated, but cost-effective BI solutions for new customers in as few as five days, far faster than is possible using any other approach. Andrews Consulting Group has been publishing white papers since 1987 when we were the first ones to accurately describe IBM s plans for introduction of the AS/400 (known by the code name Silverlake at the time). Since then, ACG has published over 60 industry reports with a total circulation of over one million copies. RapidDecision 7
8 RapidDecision Making Business Intelligence Work Best RapidDecision makes your business intelligence work best. It does this by specially preparing your data so that any business intelligence tool can use it. This means that business users can easily create ad hoc queries, perform complex analytics and use dashboards. The process of preparing data this way is known as a Data Warehouse. RapidDecision provides pre-built data warehouses for JD Edwards, PeopleSoft, Oracle E-Business and Lawson. Any Business Intelligence tool, for example those from SAP Business Objects, IBM Cognos, Oracle, Microsoft, MicroStrategy and others can only give you the best results when it works with a data warehouse. RapidDecision brings data warehouses within reach of all customers, and removes all the reporting and analysis limitations that characterize business intelligence when it runs without a data warehouse. To learn more about RapidDecision and how it can help your organization, visit us on the web: us at [email protected] or call us at x RapidDecision The Engine That Drives BI
RapidDecision EDW: THE BETTER WAY TO DATA WAREHOUSE
RapidDecision EDW: THE BETTER WAY TO DATA WAREHOUSE GET THE MOST COMPLETE, REAL-TIME VIEW OF YOUR BUSINESS DATA Data, data everywhere but no complete view or meaningful analysis in sight. Sound familiar?
ENABLING OPERATIONAL BI
ENABLING OPERATIONAL BI WITH SAP DATA Satisfy the need for speed with real-time data replication Author: Eric Kavanagh, The Bloor Group Co-Founder WHITE PAPER Table of Contents The Data Challenge to Make
Breadboard BI. Unlocking ERP Data Using Open Source Tools By Christopher Lavigne
Breadboard BI Unlocking ERP Data Using Open Source Tools By Christopher Lavigne Introduction Organizations have made enormous investments in ERP applications like JD Edwards, PeopleSoft and SAP. These
IBM WebSphere Cast Iron Cloud integration
IBM Cast Iron Cloud integration Integrate SugarCRM in days Highlights Speeds up time to implementation for SugarCRM integration projects with configuration, not coding approach Offers cost savings with
IBM WebSphere Cast Iron Cloud integration
IBM Cast Iron Cloud integration Integrate Microsoft Dynamics in days Highlights Speeds up time to implementation for Microsoft Dynamics integration projects with configuration, not coding approach Achieves
SAP BusinessObjects SOLUTIONS FOR ORACLE ENVIRONMENTS
SAP BusinessObjects SOLUTIONS FOR ORACLE ENVIRONMENTS BUSINESS INTELLIGENCE FOR ORACLE APPLICATIONS AND TECHNOLOGY SAP Solution Brief SAP BusinessObjects Business Intelligence Solutions 1 SAP BUSINESSOBJECTS
Overview and Frequently Asked Questions
Overview and Frequently Asked Questions OVERVIEW Oracle is pleased to announce that we have completed our acquisition of Siebel Systems and we are now operating as one. As the leader in customer relationship
SQL Maestro and the ELT Paradigm Shift
SQL Maestro and the ELT Paradigm Shift Abstract ELT extract, load, and transform is replacing ETL (extract, transform, load) as the usual method of populating data warehouses. Modern data warehouse appliances
IBM WebSphere Cast Iron Cloud integration
IBM Cast Iron Cloud integration Integrate salesforce.com in days Highlights Speeds up time to implementation for salesforce.com integration projects with configuration, not coding approach Offers cost
Business Intelligence for Everyone
Business Intelligence for Everyone Business Intelligence for Everyone Introducing timextender The relevance of a good Business Intelligence (BI) solution has become obvious to most companies. Using information
Contents. Financial Reporting & Analysis: Dynamically Leveraging Microsoft Excel Via Spreadsheet-Based Applications
Financial Reporting & Analysis: Dynamically Leveraging Microsoft Excel Via Spreadsheet-Based Applications Contents A Tactical Solution for an Age Old Challenge The Challenge 3 The Outlook 4 White Paper
WebSphere Cast Iron Cloud integration
Cast Iron Cloud integration Integrate in days Highlights Speeds up time to implementation for Cloud and on premise integration projects with configuration, not coding approach Offers cost savings with
How To Use Noetix
Using Oracle BI with Oracle E-Business Suite How to Meet Enterprise-wide Reporting Needs with OBI EE Using Oracle BI with Oracle E-Business Suite 2008-2010 Noetix Corporation Copying of this document is
Innovative technology for big data analytics
Technical white paper Innovative technology for big data analytics The HP Vertica Analytics Platform database provides price/performance, scalability, availability, and ease of administration Table of
Oracle BI Applications (BI Apps) is a prebuilt business intelligence solution.
1 2 Oracle BI Applications (BI Apps) is a prebuilt business intelligence solution. BI Apps supports Oracle sources, such as Oracle E-Business Suite Applications, Oracle's Siebel Applications, Oracle's
Business Intelligence Solution for Small and Midsize Enterprises (BI4SME)
Business Intelligence Solution for Small and Midsize Enterprises (BI4SME) Preface Not only large Enterprises can benefit from the advantages of Business Intelligence (BI) Solutions. BI4SME is a cost efficient,
IBM Cognos 8 Business Intelligence Reporting Meet all your reporting requirements
Data Sheet IBM Cognos 8 Business Intelligence Reporting Meet all your reporting requirements Overview Reporting requirements have changed dramatically in organizations. Organizations today are much more
Business Intelligence Solutions: Data Warehouse versus Live Data Reporting
Business Intelligence Solutions: Data Warehouse versus Live Data Reporting If you are a JD Edwards customer and have tried to improve reporting for your organization, you have probably realized that JD
Epicor Vantage GLOBAL ENTERPRISE RESOURCE PLANNING
Epicor Vantage GLOBAL ENTERPRISE RESOURCE PLANNING EPICOR VANTAGE Next Generation Manufacturing Software Epicor Software Corporation understands that you, like manufacturers worldwide, must identify, consider
Oracle BI Application: Demonstrating the Functionality & Ease of use. Geoffrey Francis Naailah Gora
Oracle BI Application: Demonstrating the Functionality & Ease of use Geoffrey Francis Naailah Gora Agenda Oracle BI & BI Apps Overview Demo: Procurement & Spend Analytics Creating a ad-hoc report Copyright
Extensibility of Oracle BI Applications
Extensibility of Oracle BI Applications The Value of Oracle s BI Analytic Applications with Non-ERP Sources A White Paper by Guident Written - April 2009 Revised - February 2010 Guident Technologies, Inc.
Bringing Big Data into the Enterprise
Bringing Big Data into the Enterprise Overview When evaluating Big Data applications in enterprise computing, one often-asked question is how does Big Data compare to the Enterprise Data Warehouse (EDW)?
Oracle Data Integrator 12c (ODI12c) - Powering Big Data and Real-Time Business Analytics. An Oracle White Paper October 2013
An Oracle White Paper October 2013 Oracle Data Integrator 12c (ODI12c) - Powering Big Data and Real-Time Business Analytics Introduction: The value of analytics is so widely recognized today that all mid
Simplify Software as a Service (SaaS) integration
IBM Software WebSphere Thought Leadership White Paper Simplify Software as a Service (SaaS) integration By Simon Peel January 2011 2 Simplify Software as a Service (SaaS) integration Contents Introduction
Managing Big Data with Hadoop & Vertica. A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database
Managing Big Data with Hadoop & Vertica A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database Copyright Vertica Systems, Inc. October 2009 Cloudera and Vertica
Introduction to Oracle Business Intelligence Standard Edition One. Mike Donohue Senior Manager, Product Management Oracle Business Intelligence
Introduction to Oracle Business Intelligence Standard Edition One Mike Donohue Senior Manager, Product Management Oracle Business Intelligence The following is intended to outline our general product direction.
Business-driven governance: Managing policies for data retention
August 2013 Business-driven governance: Managing policies for data retention Establish and support enterprise data retention policies for ENTER» Table of contents 3 4 5 Step 1: Identify the complete business
A McKnight Associates, Inc. White Paper: Effective Data Warehouse Organizational Roles and Responsibilities
A McKnight Associates, Inc. White Paper: Effective Data Warehouse Organizational Roles and Responsibilities Numerous roles and responsibilities will need to be acceded to in order to make data warehouse
DEMAND SMARTER, FASTER, EASIER BUSINESS INTELLIGENCE
DEMAND SMARTER, FASTER, EASIER BUSINESS INTELLIGENCE Join the New timextender removes all the expensive and time-consuming backend concerns typically associated with Business Intelligence. With one application
White Paper: Evaluating Big Data Analytical Capabilities For Government Use
CTOlabs.com White Paper: Evaluating Big Data Analytical Capabilities For Government Use March 2012 A White Paper providing context and guidance you can use Inside: The Big Data Tool Landscape Big Data
TECHNOLOGY BRIEF: CA ERWIN SAPHIR OPTION. CA ERwin Saphir Option
TECHNOLOGY BRIEF: CA ERWIN SAPHIR OPTION CA ERwin Saphir Option Table of Contents Executive Summary SECTION 1 2 Introduction SECTION 2: OPPORTUNITY 2 Modeling ERP Systems ERP Systems and Data Warehouses
TIBCO StreamBase High Availability Deploy Mission-Critical TIBCO StreamBase Applications in a Fault Tolerant Configuration
TIBCO StreamBase High Availability Deploy Mission-Critical TIBCO StreamBase Applications in a Fault Tolerant Configuration Richard Tibbetts, CTO, TIBCO StreamBase Table of Contents 3 TIBCO StreamBase High
BusinessObjects XI. New for users of BusinessObjects 6.x New for users of Crystal v10
BusinessObjects XI Delivering extreme Insight Bringing information to new users, in new ways, with unmatched simplicity and context. Broadest and deepest end user capabilities from reporting, to query
Business Usage Monitoring for Teradata
Managing Big Analytic Data Business Usage Monitoring for Teradata Increasing Operational Efficiency and Reducing Data Management Costs How to Increase Operational Efficiency and Reduce Data Management
Realizing the True Power of Insurance Data: An Integrated Approach to Legacy Replacement and Business Intelligence
Realizing the True Power of Insurance Data: An Integrated Approach to Legacy Replacement and Business Intelligence Featuring as an example: Guidewire DataHub TM and Guidewire InfoCenter TM An Author: Mark
Fiserv. Saving USD8 million in five years and helping banks improve business outcomes using IBM technology. Overview. IBM Software Smarter Computing
Fiserv Saving USD8 million in five years and helping banks improve business outcomes using IBM technology Overview The need Small and midsize banks and credit unions seek to attract, retain and grow profitable
RESEARCH NOTE TECHNOLOGY VALUE MATRIX: ANALYTICS
Document L59 RESEARCH NOTE TECHNOLOGY VALUE MATRIX: ANALYTICS THE BOTTOM LINE Organizations continue to invest in analytics in order to both improve productivity and enable better decision making. The
Turnkey Hardware, Software and Cash Flow / Operational Analytics Framework
Turnkey Hardware, Software and Cash Flow / Operational Analytics Framework With relevant, up to date cash flow and operations optimization reporting at your fingertips, you re positioned to take advantage
Bussiness Intelligence and Data Warehouse. Tomas Bartos CIS 764, Kansas State University
Bussiness Intelligence and Data Warehouse Schedule Bussiness Intelligence (BI) BI tools Oracle vs. Microsoft Data warehouse History Tools Oracle vs. Others Discussion Business Intelligence (BI) Products
<Insert Picture Here> Oracle BI Standard Edition One The Right BI Foundation for the Emerging Enterprise
Oracle BI Standard Edition One The Right BI Foundation for the Emerging Enterprise Business Intelligence is the #1 Priority the most important technology in 2007 is business intelligence
Top 10 Reasons for Using Disk-based Online Server Backup and Recovery
ADVISORY Top 10 Reasons for Using Disk-based Online Server Backup and Recovery INTRODUCTION Backup of vital company information is critical to a company s survival, no matter what size the company. Recent
A New Foundation For Customer Management
The Customer Data Platform: A New Foundation For Customer Management 730 Yale Avenue Swarthmore, PA 19081 [email protected] The Marketing Technology Treadmill Marketing automation. Inbound marketing.
HARNESS IT. An introduction to business intelligence solutions. THE SITUATION THE CHALLENGES THE SOLUTION THE BENEFITS
HARNESS IT. An introduction to business intelligence solutions. THE SITUATION THE CHALLENGES THE SOLUTION THE BENEFITS THE SITUATION Data is growing exponentially in size and complexity. Traditional analytics
ORACLE BUSINESS INTELLIGENCE, ORACLE DATABASE, AND EXADATA INTEGRATION
ORACLE BUSINESS INTELLIGENCE, ORACLE DATABASE, AND EXADATA INTEGRATION EXECUTIVE SUMMARY Oracle business intelligence solutions are complete, open, and integrated. Key components of Oracle business intelligence
A business intelligence agenda for midsize organizations: Six strategies for success
IBM Software Business Analytics IBM Cognos Business Intelligence A business intelligence agenda for midsize organizations: Six strategies for success A business intelligence agenda for midsize organizations:
Welcome. Changes and Choices
Welcome Changes and Choices Today s Session Thursday, February 23, 2012 Agenda 1. The Fillmore Group Introduction 2. Reasons to Implement Replication 3. IBM s Replication Options How We Got Here 4. The
White Paper May 2009. Seven reports every supply chain executive needs Supply Chain Performance Management with IBM
White Paper May 2009 Seven reports every supply chain executive needs Supply Chain Performance Management with IBM 2 Contents 3 Business problems 3 Business drivers 4 The solution IBM Cognos SCPM Seven
www.ducenit.com Analance Data Integration Technical Whitepaper
Analance Data Integration Technical Whitepaper Executive Summary Business Intelligence is a thriving discipline in the marvelous era of computing in which we live. It s the process of analyzing and exploring
ENTERPRISE EDITION ORACLE DATA SHEET KEY FEATURES AND BENEFITS ORACLE DATA INTEGRATOR
ORACLE DATA INTEGRATOR ENTERPRISE EDITION KEY FEATURES AND BENEFITS ORACLE DATA INTEGRATOR ENTERPRISE EDITION OFFERS LEADING PERFORMANCE, IMPROVED PRODUCTIVITY, FLEXIBILITY AND LOWEST TOTAL COST OF OWNERSHIP
CASE STUDY LUMIDATA. SQL Toolbelt. Essential tools for SQL Server. 91% of Fortune 100 companies use Red Gate
CASE STUDY LUMIDATA SQL Toolbelt Essential tools for SQL Server 91% of Fortune 100 companies use Red Gate " If you work with SQL Server and don't have SQL Toolbelt, you're likely losing thousands of dollars
Microsoft. Course 20463C: Implementing a Data Warehouse with Microsoft SQL Server
Course 20463C: Implementing a Data Warehouse with Microsoft SQL Server Length : 5 Days Audience(s) : IT Professionals Level : 300 Technology : Microsoft SQL Server 2014 Delivery Method : Instructor-led
Ten Things You Need to Know About Data Virtualization
White Paper Ten Things You Need to Know About Data Virtualization What is Data Virtualization? Data virtualization is an agile data integration method that simplifies information access. Data virtualization
CHAPTER - 5 CONCLUSIONS / IMP. FINDINGS
CHAPTER - 5 CONCLUSIONS / IMP. FINDINGS In today's scenario data warehouse plays a crucial role in order to perform important operations. Different indexing techniques has been used and analyzed using
Implementing efficient system i data integration within your SOA. The Right Time for Real-Time
Implementing efficient system i data integration within your SOA The Right Time for Real-Time Do your operations run 24 hours a day? What happens in case of a disaster? Are you under pressure to protect
Business Intelligence and Service Oriented Architectures. An Oracle White Paper May 2007
Business Intelligence and Service Oriented Architectures An Oracle White Paper May 2007 Note: The following is intended to outline our general product direction. It is intended for information purposes
An Oracle White Paper March 2014. Best Practices for Real-Time Data Warehousing
An Oracle White Paper March 2014 Best Practices for Real-Time Data Warehousing Executive Overview Today s integration project teams face the daunting challenge that, while data volumes are exponentially
Implementing Oracle BI Applications during an ERP Upgrade
Implementing Oracle BI Applications during an ERP Upgrade Summary Jamal Syed BI Practice Lead Emerging solutions 20 N. Wacker Drive Suite 1870 Chicago, IL 60606 Emerging Solutions, a professional services
The ActiveBatch Integrated Jobs Library: Extensions Job Steps. The ActiveBatch Integrated Jobs Library: SSIS Job
IT organizations are managing an expanding array of applications, databases and technologies. Businesses are operating in a real-time world where IT demands are becoming increasingly complex. More advanced
TIBCO StreamBase High Availability Deploy Mission-Critical TIBCO StreamBase Applications in a Fault Tolerant Configuration
TIBCO StreamBase High Availability Deploy Mission-Critical TIBCO StreamBase s in a Fault Tolerant Configuration TIBCO STREAMBASE HIGH AVAILABILITY The TIBCO StreamBase event processing platform provides
Overview Western 12.-13.9.2012 Mariusz Gieparda
Overview Western 12.-13.9.2012 Mariusz Gieparda 1 Corporate Overview Company Global Leader in Business Continuity Easy. Affordable. Innovative. Technology Protection Operational Excellence Compliance Customer
Integrating data in the Information System An Open Source approach
WHITE PAPER Integrating data in the Information System An Open Source approach Table of Contents Most IT Deployments Require Integration... 3 Scenario 1: Data Migration... 4 Scenario 2: e-business Application
Chapter 3 - Data Replication and Materialized Integration
Prof. Dr.-Ing. Stefan Deßloch AG Heterogene Informationssysteme Geb. 36, Raum 329 Tel. 0631/205 3275 [email protected] Chapter 3 - Data Replication and Materialized Integration Motivation Replication:
Key Attributes for Analytics in an IBM i environment
Key Attributes for Analytics in an IBM i environment Companies worldwide invest millions of dollars in operational applications to improve the way they conduct business. While these systems provide significant
SQL Server 2012 Gives You More Advanced Features (Out-Of-The-Box)
SQL Server 2012 Gives You More Advanced Features (Out-Of-The-Box) SQL Server White Paper Published: January 2012 Applies to: SQL Server 2012 Summary: This paper explains the different ways in which databases
Getting it Right: How to Find the Right BI Package for the Right Situation Norma Waugh. RMOUG Training Days February 15-17, 2011
Delivering Oracle Success Getting it Right: How to Find the Right BI Package for the Right Situation Norma Waugh RMOUG Training Days February 15-17, 2011 About DBAK Oracle solution provider Co-founded
Knowledge Base Data Warehouse Methodology
Knowledge Base Data Warehouse Methodology Knowledge Base's data warehousing services can help the client with all phases of understanding, designing, implementing, and maintaining a data warehouse. This
ORACLE UTILITIES ANALYTICS
ORACLE UTILITIES ANALYTICS TRANSFORMING COMPLEX DATA INTO BUSINESS VALUE UTILITIES FOCUS ON ANALYTICS Aging infrastructure. Escalating customer expectations. Demand growth. The challenges are many. And
The IBM Cognos Platform
The IBM Cognos Platform Deliver complete, consistent, timely information to all your users, with cost-effective scale Highlights Reach all your information reliably and quickly Deliver a complete, consistent
PATROL From a Database Administrator s Perspective
PATROL From a Database Administrator s Perspective September 28, 2001 Author: Cindy Bean Senior Software Consultant BMC Software, Inc. 3/4/02 2 Table of Contents Introduction 5 Database Administrator Tasks
Implementing a Data Warehouse with Microsoft SQL Server
Course Code: M20463 Vendor: Microsoft Course Overview Duration: 5 RRP: 2,025 Implementing a Data Warehouse with Microsoft SQL Server Overview This course describes how to implement a data warehouse platform
IBM InfoSphere Optim Test Data Management solution for Oracle E-Business Suite
IBM InfoSphere Optim Test Data Management solution for Oracle E-Business Suite Streamline test-data management and deliver reliable application upgrades and enhancements Highlights Apply test-data management
Lowering E-Discovery Costs Through Enterprise Records and Retention Management. An Oracle White Paper March 2007
Lowering E-Discovery Costs Through Enterprise Records and Retention Management An Oracle White Paper March 2007 Lowering E-Discovery Costs Through Enterprise Records and Retention Management Exponential
A TECHNICAL WHITE PAPER ATTUNITY VISIBILITY
A TECHNICAL WHITE PAPER ATTUNITY VISIBILITY Analytics for Enterprise Data Warehouse Management and Optimization Executive Summary Successful enterprise data management is an important initiative for growing
Cincom Business Intelligence Solutions
CincomBI Cincom Business Intelligence Solutions Business Users Overview Find the perfect answers to your strategic business questions. SIMPLIFICATION THROUGH INNOVATION Introduction Being able to make
ORACLE BUSINESS INTELLIGENCE APPLICATIONS FOR JD EDWARDS ENTERPRISEONE
ORACLE BUSINESS INTELLIGENCE APPLICATIONS FOR JD EDWARDS ENTERPRISEONE Organizations with the ability to transform information into action enjoy a strategic advantage over their competitors. Reduced costs,
BENEFITS OF AUTOMATING DATA WAREHOUSING
BENEFITS OF AUTOMATING DATA WAREHOUSING Introduction...2 The Process...2 The Problem...2 The Solution...2 Benefits...2 Background...3 Automating the Data Warehouse with UC4 Workload Automation Suite...3
ORACLE DATA INTEGRATOR ENTERPRISE EDITION
ORACLE DATA INTEGRATOR ENTERPRISE EDITION ORACLE DATA INTEGRATOR ENTERPRISE EDITION KEY FEATURES Out-of-box integration with databases, ERPs, CRMs, B2B systems, flat files, XML data, LDAP, JDBC, ODBC Knowledge
60 TB of Savings in 4 Days: EMC IT s Informatica Data Archive Proof of Concept
60 TB of Savings in 4 Days: EMC IT s Informatica Data Archive Proof of Concept Applied Technology Abstract This white paper illustrates the ability to reduce the data growth challenge seen with EMC s Oracle
Perform-Tools. Powering your performance
Perform-Tools Powering your performance Perform-Tools With Perform-Tools, optimizing Microsoft Dynamics products on a SQL Server platform never was this easy. They are a fully tested and supported set
White Paper. 1 800 FASTFILE / www.ironmountain.ca Page 1
White Paper LIVEVAULT Top 10 Reasons for Using Online Server Backup and Recovery Introduction Backup of vital company information is critical to a company s survival, no matter what size the company. Recent
White Paper February 2009. IBM Cognos Supply Chain Analytics
White Paper February 2009 IBM Cognos Supply Chain Analytics 2 Contents 5 Business problems Perform cross-functional analysis of key supply chain processes 5 Business drivers Supplier Relationship Management
Understanding Oracle BI Applications
Understanding Oracle BI Applications Oracle BI Applications are a complete, end-to-end BI environment covering the Oracle BI EE platform and the prepackaged analytic applications. The Oracle BI Applications
Speeding ETL Processing in Data Warehouses White Paper
Speeding ETL Processing in Data Warehouses White Paper 020607dmxwpADM High-Performance Aggregations and Joins for Faster Data Warehouse Processing Data Processing Challenges... 1 Joins and Aggregates are
Before You Buy: A Checklist for Evaluating Your Analytics Vendor
Executive Report Before You Buy: A Checklist for Evaluating Your Analytics Vendor By Dale Sanders Sr. Vice President Health Catalyst Embarking on an assessment with the knowledge of key, general criteria
Integrating SAP and non-sap data for comprehensive Business Intelligence
WHITE PAPER Integrating SAP and non-sap data for comprehensive Business Intelligence www.barc.de/en Business Application Research Center 2 Integrating SAP and non-sap data Authors Timm Grosser Senior Analyst
