TDWI RESEARCH TDWI CHECKLIST REPORT. Big Data Analytics. By Wayne Eckerson. Sponsored by. tdwi.org
|
|
|
- Dustin Dawson
- 10 years ago
- Views:
Transcription
1 TDWI RESEARCH TDWI CHECKLIST REPORT Big Data Analytics By Wayne Eckerson Sponsored by tdwi.org
2 A U G U S T TDWI CHECKLIST REPORT Big Data Analytics By Wayne Eckerson TABLE OF CONTENTS 2 FOREWORD 2 NUMBER ONE Educate your users about the drivers of big data analytics. 3 NUMBER TWO Determine the type of analytics you need. 4 NUMBER THREE Architect for big data analytics. 5 NUMBER FOUR Employ in-database analytics. 6 NUMBER FIVE Don t limit analytics to SQL. 6 NUMBER SIX Define your requirements before selecting products. 7 ABOUT OUR SPONSOR 7 ABOUT THE AUTHOR 7 ABOUT TDWI RESEARCH 1201 Monster Road SW, Suite 250 Renton, WA T F E [email protected] tdwi.org 2010 by TDWI (The Data Warehousing Institute TM ), a division of 1105 Media, Inc. All rights reserved. Reproductions in whole or in part are prohibited except by written permission. requests or feedback to [email protected]. Product and company names mentioned herein may be trademarks and/or registered trademarks of their respective companies. 1 TDWI RESE ARCH tdwi.org
3 FOREWORD THE EVOLUTION OF BIG DATA ANALYTICS NUMBER ONE EDUCATE YOUR ORGANIZATION ABOUT THE DRIVERS OF BIG DATA ANALYTICS. There are two major trends causing organizations to rethink the way they approach doing analytics. Big data. First, data volumes are exploding. More than a decade ago, I participated in the formation of the Data Warehouse Terabyte Club, which highlighted the few leading-edge organizations whose data warehouses had reached or exceeded a terabyte in size. Today, the notion of a terabyte club seems quaint, as many organizations have blasted through that threshold. In fact, it is now time to start a petabyte club, since a handful of companies, including Internet businesses, banks, and telecommunications companies, have publicly announced that their data warehouses will soon exceed a petabyte of data. Deep analytics. Second, organizations want to perform deep analytics on these massive data warehouses. Deep analytics ranges from statistics such as moving averages, correlations, and regressions to complex functions such as graph analysis, market basket analysis, and tokenization. In addition, many organizations are embracing predictive analytics by using advanced machine learning algorithms, such as neural networks and decision trees, to anticipate behavior and events. Whereas in the past, organizations may have applied these types of analytics to a subset of data, today they want to analyze every transaction. The reason: profits. For Internet companies, the goal is to gain insight into how people use their Web sites so they can enhance visitor experiences and provide advertisers with more granular targeted advertising. Telecommunications companies want to mine millions of call detail records to better predict customer churn and profitability. Retailers want to analyze detailed transactions to better understand customer shopping patterns, forecast demand, optimize merchandising, and increase the lift of their promotions. In all cases, there is an opportunity to cut costs, increase revenues, and gain a competitive advantage. Few industries today are immune to the siren song of analyzing big data. This TDWI Checklist Report is designed to provide a basic set of guidelines for implementing big data analytics. The analytical techniques and data management structures of the past no longer work in this new era of big data. This report will help you take the first steps toward achieving a lasting competitive edge with analytics. There are many reasons organizations are embracing big data analytics. Data volumes. First, they are accumulating large volumes of data. According to TDWI Research, data warehouse data volumes are expanding rapidly. In 2009, 62% of organizations had less than 3 TB of data in their data warehouses. By 2012, 59% of those organizations estimate they will have more than 3 TB in their data warehouses and 34% said they would have more than 10 TB. (See Figure 1.) 5% < 500 GB 12% 18% 21% 20% 21% 19% 17% Figure 1. Current and projected data warehousing data volumes, based on 417 respondents. Source: TDWI Research, Data volumes are expanding because the price-performance of data management systems is rapidly increasing. This enables organizations to collect more detailed transaction data, including Web clicks, pointof-sale records, and claims data. For example, a telecommunications company can now store call detail records for months or years instead of summarizing and archiving the records after a day or week. Analysts can use this historical detail to better understand traffic patterns and customer behavior, for instance. In addition, there are new sources of data that organizations would like to bring into the analytical orbit, including social media data (e.g., blogs, tweets, online discussions), sensor data (e.g., RFID chips), GPS data, various devices or appliances (e.g., SmartMeters) that call home, and traditional unstructured data, such as audio, images, video, and text (e.g., , Web sites, documents). Business value. Second, organizations see value in analyzing this detailed information, which encourages them to collect more data. The more data an organization collects, the more patterns and insights that it can mine for. 25% 34% 500 GB 1 TB 1 TB 3 TB 3 TB 10 TB 10 TB (est.) 2009 (Continued, next page) 2 TDWI RESE ARCH tdwi.org
4 NUMBER TWO DETERMINE THE TYPE OF ANALYTICS YOU NEED. (Continued) For example, AT&T Mobility now calculates the profitability of its 80 million customers nightly, enabling the company to spot critical changes in customer behavior quickly and launch highly targeted marketing campaigns in response. If you lost 1% of your customers yesterday, wouldn t you like to know today who they are and whether there might be a common theme behind the churn? We can now pinpoint those subscribers and work proactively to win them back, says a director of financial analysis at the company. Ten years ago, we made assumptions on samples of data and based decisions on gut feel or someone s ability to argue an opinion. Now there is more precision. There is not a lot of agreement about the definition of analytics. That s because any common term gets hijacked by vendors, consultants, and experts to further their own interests. Part of the confusion stems from the fact that there are two types of analytics. High What will happen? What s happening? Prediction Reporting Analytics Sustainable advantage. Finally, companies see big data analytics as one of the last frontiers of achieving competitive advantage. Analytics offers a sustainable advantage because it harnesses information and intelligence, things that are unique to each organization and cannot be commoditized. The popularity of the book Competing on Analytics by Tom Davenport and Jeanne Harris, which is targeted to business executives, has encouraged organizations to explore how to leverage analytics for competitive advantage. Low Why did it happen? What happened? Reporting Statistic reports Analysis Monitoring Query, Excel, OLAP, visual discovery Dashboards, scorecards 1980s 1990s 2000s Statistics, data mining, optimization 2010s (mainstream) Figure 2. Historically, there have been two waves of reporting, each followed by a wave of analytics. Reports are typically a jumping-off point for analytics. Exploration and analysis. One type of analytics is exploration and analysis. This approach involves navigating historical data in a top-down, deductive manner. To start, an analyst needs to have an idea of what is causing a high-level trend or alert. In other words, the analyst must start with a hypothesis and deduce the cause by exploring the data with ad hoc query tools, OLAP tools, Excel, or SQL. Here, the burden is on the business analyst to sort through large volumes of data and find the needle in the haystack. This type of analytics has been around for a long time and constitutes the bulk of activity done by business analysts. Prediction and optimization. Another type of analytics is prediction and optimization. Although the algorithms used to power these types of analyses have existed for decades, they have been implemented only by a small number of commercial organizations. Business users model historical data in a bottom-up, inductive manner. They apply data mining tools to create statistical models to identify patterns and trends in the data that can be used to predict the future and optimize business processes. Here, the process is inductive. Rather than starting with a hypothesis, you let the tools discover the trends, patterns, and outliers on your behalf. (However, in reality, it takes some knowledge of the business process and data to apply these tools with reliable accuracy.) 3 TDWI RESE ARCH tdwi.org
5 NUMBER THREE ARCHITECT FOR BIG DATA ANALYTICS. A key question facing organizations that want to compete on analytics is how to architect for big data analytics. Bottlenecks. Today, most companies create data warehouses to store and process data for reporting and analytics. Unfortunately, most data warehouses are already tapped out: they have reached maximum storage capacity without an expensive upgrade and can t support complex ad hoc queries without wreaking havoc on performance. In addition, the underlying data warehousing platform (database, server, storage, and network) isn t scalable enough to support new sources of data (internal or external) and maintain adequate query performance. Meanwhile, analysts typically circumvent the data warehouse when performing their analysis. They use Excel, SAS, or ad hoc query tools to pull data directly from internal and external systems (providing they have access) and load it into a desktop spreadsheet or database, or perhaps a client/server analytical workbench such as SAS. Because of the network bottleneck and limited processing power of desktop or client/server systems, analysts typically analyze only a subset of the data. This approach also forces them to spend precious time cleaning, integrating, and preparing data for analysis tasks that DW professionals are paid to perform. or offload analytical processing from overloaded data warehouses. In some cases, companies are using these platforms as enterprise data warehouses running multiple workloads, although this is the exception today. These platforms enable users to run queries against all the data they want without having to download a subset of data across the network to their desktops or a local environment. They also dramatically improve query performance and free companies from having to extract, prepare, and manage this data. Where the analytic platform offloads queries from the data warehouse, it houses a subset of the data in the data warehouse and functions as a dependent data mart or analytics server. Here, the data warehouse team will push or replicate the subset of data to the analytic platform and keep the two systems in sync. However, it s likely that the DW and analytic platform manage two different data sets and don t require synchronization. The analytic platform will have its own data-loading and preparation processes that are tailored to the data set it manages. Although this requires the company to manage two analytical data platforms and processes, it improves overall efficiency by applying the optimal platform for the workload. Big data analytics architecture. To avoid these limitations, companies need to create a scalable architecture that supports big data analytics from the outset and utilizes existing skills and infrastructure where possible. To do this, many companies are implementing new, specialized analytical platforms designed to accelerate query performance when running complex functions against large volumes of data. Compared to traditional query processing systems, they are easier to install and manage, offering a better total cost of ownership and sometimes a cost as little as $10,000 per terabyte. These systems come in a variety of flavors and sizes. There are data warehousing appliances, which are purpose-built, hardware-software solutions; massively parallel processing (MPP) databases running on commodity servers; columnar databases; and distributed file systems running MapReduce and other non-sql types of data processing languages. Sometimes companies employ multiple types to address processing requirements. For instance, comscore, an online market research firm, uses Hadoop to acquire and transform Web log files and Aster Data s ncluster database for analysis. Today, companies typically implement these types of analytical platforms to store and process data that is not currently stored in the data warehouse (e.g., sensor data, Web logs, sentiment data) 4 TDWI RESE ARCH tdwi.org
6 NUMBER FOUR EMPLOY IN-DATABASE ANALYTICS. As mentioned earlier, business analysts traditionally download data to desktops or local servers to explore, model, calculate, and score data using specialized analytical software, such as from SAS or SPSS. But moving data from source systems to a local environment and back again chews up a lot of networking resources and takes considerable time. As a result, analysts typically download a subset of data or create a sample from the larger data file. Although sampling is often a valid option, most modelers would prefer to work with all the data at a detailed level to optimize model accuracy. Many companies are now rethinking traditional approaches to performing analytics. Instead of downloading data to local desktops or servers, they are running complex analytics in the database management system itself. This so-called in-database analytics minimizes or eliminates data movement, improves query performance, and optimizes model accuracy by enabling analytics to run against all data at a detailed level instead of against samples or summaries. This is particularly useful in the explore phase, when business analysts investigate data sets and prepare them for analytical processing, because now they can address all the data instead of a subset and leverage the processing power of a data center database to execute the transformations. It s also extremely beneficial in the score stage, when the model or function is applied to all incoming records. With in-database analytics, scoring can execute automatically as new records enter the database rather than in a clumsy two-step process that involves exporting new records to another server and importing and inserting the scores into the appropriate records. To support in-database analytics, ensure that your database platform makes it easy to embed analytical functions and can run them in an MPP environment with minimal or no rework. Many analytic platform vendors are making a concerted effort to make it easy for administrators to insert custom functions into the database, and many now bundle a variety of predefined functions with their products. In addition, some analytical software vendors, such as SAS Institute, have begun to translate their core analytical functions into SQL extensions or user-defined functions that run natively in various relational databases and analytical platforms. API, and integrates with database facilities such as fault tolerance and workload management. Also, understand where the logic runs inside the database or in a separate execution container because the architecture of the analytic plug-ins has a major impact on the stability and reliability of the database management system. Traditionally, IT professionals write stored procedures (SPs) or userdefined functions (UDFs) that run within the database engine to apply complex functions to data. Although database vendors offer varying levels of support for UDFs and SPs, they are often singlethreaded functions that don t support MPP workloads and are difficult to write and maintain. If a memory leak or other error occurs, it can bring down a database or corrupt data records. Due to the delicacy of writing functions directly against a database, many UDFs are written by experienced database specialists using databasespecific languages. Fortunately, there are new techniques that promise to make it possible for business analysts, rather than IT professionals, to custom-code database functions that run in a parallel environment. For example, MapReduce was pioneered by Google to run custom analytics against its massive distributed computing network so that it can better understand Web site activity and user behavior. Aster Data, which is sponsoring this report, offers a version of MapReduce that runs in its ncluster analytic database, supports multiple languages, and can be invoked via SQL. This makes it easier for developers to create custom functions that automatically run in a parallel environment without additional programming. And because MapReduce runs in a separate execution container, it offers better fault tolerance than previous methods of inserting custom functions into a database. It s important to recognize that there are different approaches to supporting in-database analytics and not all are created equal. For instance, you should ensure that the approach you choose offers an integrated development environment for development and testing, supports multiple programming languages, provides an extensible 5 TDWI RESE ARCH tdwi.org
7 NUMBER FIVE DON T LIMIT ANALYTICS TO SQL. NUMBER SIX DEFINE YOUR REQUIREMENTS BEFORE SELECTING PRODUCTS. Many analytic computations are recursive in nature, which requires multiple passes through the database. Such computations are difficult to write in SQL and expensive to run in a database management system. Thus, today most analysts first run SQL queries to create a data set, which they download to another platform, and then run a procedural program written in Java, C, or some other language against the data set. Next, they often load the results of their analysis back into the original database. This two-step process is time-consuming, expensive, and frustrating. One architect says, I do not like having to switch back and forth between Java and SQL. It is especially frustrating when business logic is spread across the languages. If I can have one language that does it all, it s a big win. Fortunately, techniques like MapReduce make it possible for business analysts, rather than IT professionals, to custom-code database functions that run in a parallel environment. Aster Data has integrated MapReduce into its SQL-MapReduce framework, which lets analysts write reusable functions in almost any language they desire Python, Java, C, C++, Perl and invoke them with simple SQL calls. And out of the box, Aster Data bundles a number of predefined MapReduce functions in its ncluster database management system, including market basket analysis, time-series analysis, sessionization, and various statistical functions. It also offers an API for customers and partners to create their own MapReduce functions and embed them in the database. Clearly, as analytical tasks increase in complexity, developers will need to apply the appropriate tool for each task. No longer will SQL be the only hammer in a developer s arsenal. With embedded functions, new analytical databases will accelerate the development and deployment of complex analytics against big data. Know your users. First, understand who s performing the analysis, the type, and how much data they require. If a power user wants to explore departmental data, then all they might need is an ad hoc query or OLAP tool and a data mart. If it s an IT person creating a complex standard report with sophisticated metrics and functions, then it s likely they can use a scalable BI tool running against an enterprise data warehouse. If a business analyst wants to run ad hoc queries or apply complex analytical functions against large volumes of detailed data without DBA assistance, then you probably need a specialized analytical database that supports analytical functions. Performance and scalability. Second, understand your performance and scalability requirements. What query response times are required to make various types of analyses worth doing? If you have to wait days for a result set, then you either need to upgrade your existing data warehousing environment, offload these queries to a specialized analytical platform, or reduce the amount of data by aggregating data or reducing the time span of the analysis. Also, how many concurrent users do you need to support? Many BI tools and databases experience bottlenecks with memory or threading as the number of concurrent users increases. Adding another database platform adds complexity to an existing architecture, but it may be worth doing. In-database analytics. Third, evaluate your need for in-database analytics. If complex models or analytics drive a critical portion of your business, then it s likely you can benefit from creating and scoring these models in the DW rather than a secondary system. If so, you ll need to evaluate the combination of your analytical software and DW database. If you are using an analytical package such as SAS, SPSS, or R, then understand which functions can run inside a database and how. Do you have to rewrite functions in SQL or C, or can they be converted automatically to a plug-in function in your database of choice? If you are hand-coding functions, then evaluate whether the database supports the programming languages your analysts prefer. Finally, make sure the database can run the analytical functions in parallel without extra programming. Other. Finally, investigate whether the analytic database integrates with existing tools in your environment, such as ETL, scheduling, and BI tools. If you plan to use it as an enterprise data warehouse replacement, find out how well it supports mixed workloads, including tactical queries, strategic queries, and inserts, updates, and deletes. Also, find out whether the system meets your data center standards for encryption, security, monitoring, backup/restore, and disaster recovery. Most important, you want to know whether or to what degree you will need to rewrite any existing applications to run on the new system. 6 TDWI RESE ARCH tdwi.org
8 ABOUT OUR SPONSOR ABOUT THE TDWI CHECKLIST REPORT SERIES Aster Data is a proven leader in big data management and big data analysis for data-driven applications. Aster Data s ncluster is the first MPP data warehouse architecture that allows applications to be fully embedded within the database engine to enable ultra-fast, deep analysis of massive data sets. Aster Data s unique applications-within approach allows application logic to exist and execute with the data itself. Termed a data-analytics server, Aster Data s solution effectively utilizes Aster s patent-pending SQL-MapReduce together with parallelized data processing and applications to address the big data challenge. Companies using Aster Data include Coremetrics, MySpace, comscore, Akamai, Full Tilt Poker, and ShareThis. Aster Data is headquartered in San Carlos, CA, and is backed by Sequoia Capital, JAFCO Ventures, IVP, and Cambrian Ventures, as well as industry visionaries such as David Cheriton, Ron Conway, and Rajeev Motwani. TDWI Checklist Reports provide an overview of success factors for specific projects in business intelligence, data warehousing, or related data management disciplines. Companies may use this overview to get organized before beginning a project or to identify goals and areas of improvement for current projects. ABOUT THE AUTHOR Wayne Eckerson is the director of TDWI Research. Eckerson is an industry analyst and educator who has covered DW and BI since Eckerson is the author of many in-depth, groundbreaking reports, a columnist for several business technology magazines, and a noted speaker and consultant. He is the author of Performance Dashboards: Measuring, Monitoring, and Managing Your Business (John Wiley & Sons, 2005) and the creator of TDWI s BI Maturity Model and Benchmarking Assessment service. He can be reached at [email protected]. ABOUT TDWI RESEARCH TDWI Research provides research and advice for BI professionals worldwide. TDWI Research focuses exclusively on BI/DW issues and teams up with industry practitioners to deliver both broad and deep understanding of the business and technical issues surrounding the deployment of business intelligence and data warehousing solutions. TDWI Research offers reports, commentary, and inquiry services via a worldwide Membership program and provides custom research, benchmarking, and strategic planning services to user and vendor organizations. 7 TDWI RESE ARCH tdwi.org
Using and Choosing a Cloud Solution for Data Warehousing
TDWI RESEARCH TDWI CHECKLIST REPORT Using and Choosing a Cloud Solution for Data Warehousing By Colin White Sponsored by: tdwi.org JULY 2015 TDWI CHECKLIST REPORT Using and Choosing a Cloud Solution for
Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing
Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing Wayne W. Eckerson Director of Research, TechTarget Founder, BI Leadership Forum Business Analytics
In-Database Analytics
Embedding Analytics in Decision Management Systems In-database analytics offer a powerful tool for embedding advanced analytics in a critical component of IT infrastructure. James Taylor CEO CONTENTS Introducing
Big Data and Your Data Warehouse Philip Russom
Big Data and Your Data Warehouse Philip Russom TDWI Research Director for Data Management April 5, 2012 Sponsor Speakers Philip Russom Research Director, Data Management, TDWI Peter Jeffcock Director,
High-Performance Business Analytics: SAS and IBM Netezza Data Warehouse Appliances
High-Performance Business Analytics: SAS and IBM Netezza Data Warehouse Appliances Highlights IBM Netezza and SAS together provide appliances and analytic software solutions that help organizations improve
Managing Big Data with Hadoop & Vertica. A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database
Managing Big Data with Hadoop & Vertica A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database Copyright Vertica Systems, Inc. October 2009 Cloudera and Vertica
Advanced In-Database Analytics
Advanced In-Database Analytics Tallinn, Sept. 25th, 2012 Mikko-Pekka Bertling, BDM Greenplum EMEA 1 That sounds complicated? 2 Who can tell me how best to solve this 3 What are the main mathematical functions??
Using Predictions to Power the Business. Wayne Eckerson Director of Research and Services, TDWI February 18, 2009
Using Predictions to Power the Business Wayne Eckerson Director of Research and Services, TDWI February 18, 2009 Sponsor 2 Speakers Wayne Eckerson Director, TDWI Research Caryn A. Bloom Data Mining Specialist,
How To Turn Big Data Into An Insight
mwd a d v i s o r s Turning Big Data into Big Insights Helena Schwenk A special report prepared for Actuate May 2013 This report is the fourth in a series and focuses principally on explaining what s needed
Integrating Hadoop. Into Business Intelligence & Data Warehousing. Philip Russom TDWI Research Director for Data Management, April 9 2013
Integrating Hadoop Into Business Intelligence & Data Warehousing Philip Russom TDWI Research Director for Data Management, April 9 2013 TDWI would like to thank the following companies for sponsoring the
Achieving Business Value through Big Data Analytics Philip Russom
Achieving Business Value through Big Data Analytics Philip Russom TDWI Research Director for Data Management October 3, 2012 Sponsor 2 Speakers Philip Russom Research Director, Data Management, TDWI Brian
Big Data at Cloud Scale
Big Data at Cloud Scale Pushing the limits of flexible & powerful analytics Copyright 2015 Pentaho Corporation. Redistribution permitted. All trademarks are the property of their respective owners. For
High-Performance Analytics
High-Performance Analytics David Pope January 2012 Principal Solutions Architect High Performance Analytics Practice Saturday, April 21, 2012 Agenda Who Is SAS / SAS Technology Evolution Current Trends
DATA VISUALIZATION AND DISCOVERY FOR BETTER BUSINESS DECISIONS
TDWI research TDWI BEST PRACTICES REPORT THIRD QUARTER 2013 EXECUTIVE SUMMARY DATA VISUALIZATION AND DISCOVERY FOR BETTER BUSINESS DECISIONS By David Stodder tdwi.org EXECUTIVE SUMMARY Data Visualization
Harnessing the power of advanced analytics with IBM Netezza
IBM Software Information Management White Paper Harnessing the power of advanced analytics with IBM Netezza How an appliance approach simplifies the use of advanced analytics Harnessing the power of advanced
WHITE PAPER. Harnessing the Power of Advanced Analytics How an appliance approach simplifies the use of advanced analytics
WHITE PAPER Harnessing the Power of Advanced How an appliance approach simplifies the use of advanced analytics Introduction The Netezza TwinFin i-class advanced analytics appliance pushes the limits of
W H I T E P A P E R. Deriving Intelligence from Large Data Using Hadoop and Applying Analytics. Abstract
W H I T E P A P E R Deriving Intelligence from Large Data Using Hadoop and Applying Analytics Abstract This white paper is focused on discussing the challenges facing large scale data processing and the
BIG DATA: FIVE TACTICS TO MODERNIZE YOUR DATA WAREHOUSE
BIG DATA: FIVE TACTICS TO MODERNIZE YOUR DATA WAREHOUSE Current technology for Big Data allows organizations to dramatically improve return on investment (ROI) from their existing data warehouse environment.
Big Data and Its Impact on the Data Warehousing Architecture
Big Data and Its Impact on the Data Warehousing Architecture Sponsored by SAP Speaker: Wayne Eckerson, Director of Research, TechTarget Wayne Eckerson: Hi my name is Wayne Eckerson, I am Director of Research
SELLING PROJECTS ON THE MICROSOFT BUSINESS ANALYTICS PLATFORM
David Chappell SELLING PROJECTS ON THE MICROSOFT BUSINESS ANALYTICS PLATFORM A PERSPECTIVE FOR SYSTEMS INTEGRATORS Sponsored by Microsoft Corporation Copyright 2014 Chappell & Associates Contents Business
Mike Maxey. Senior Director Product Marketing Greenplum A Division of EMC. Copyright 2011 EMC Corporation. All rights reserved.
Mike Maxey Senior Director Product Marketing Greenplum A Division of EMC 1 Greenplum Becomes the Foundation of EMC s Big Data Analytics (July 2010) E M C A C Q U I R E S G R E E N P L U M For three years,
Tap into Big Data at the Speed of Business
SAP Brief SAP Technology SAP Sybase IQ Objectives Tap into Big Data at the Speed of Business A simpler, more affordable approach to Big Data analytics A simpler, more affordable approach to Big Data analytics
Are You Ready for Big Data?
Are You Ready for Big Data? Jim Gallo National Director, Business Analytics February 11, 2013 Agenda What is Big Data? How do you leverage Big Data in your company? How do you prepare for a Big Data initiative?
IBM AND NEXT GENERATION ARCHITECTURE FOR BIG DATA & ANALYTICS!
The Bloor Group IBM AND NEXT GENERATION ARCHITECTURE FOR BIG DATA & ANALYTICS VENDOR PROFILE The IBM Big Data Landscape IBM can legitimately claim to have been involved in Big Data and to have a much broader
INVESTOR PRESENTATION. Third Quarter 2014
INVESTOR PRESENTATION Third Quarter 2014 Note to Investors Certain non-gaap financial information regarding operating results may be discussed during this presentation. Reconciliations of the differences
Actian SQL in Hadoop Buyer s Guide
Actian SQL in Hadoop Buyer s Guide Contents Introduction: Big Data and Hadoop... 3 SQL on Hadoop Benefits... 4 Approaches to SQL on Hadoop... 4 The Top 10 SQL in Hadoop Capabilities... 5 SQL in Hadoop
ADVANCED ANALYTICS AND FRAUD DETECTION THE RIGHT TECHNOLOGY FOR NOW AND THE FUTURE
ADVANCED ANALYTICS AND FRAUD DETECTION THE RIGHT TECHNOLOGY FOR NOW AND THE FUTURE Big Data Big Data What tax agencies are or will be seeing! Big Data Large and increased data volumes New and emerging
ten mistakes to avoid
second quarter 2010 ten mistakes to avoid In Predictive Analytics By Thomas A. Rathburn ten mistakes to avoid In Predictive Analytics By Thomas A. Rathburn Foreword Predictive analytics is the goal-driven
IBM Netezza High Capacity Appliance
IBM Netezza High Capacity Appliance Petascale Data Archival, Analysis and Disaster Recovery Solutions IBM Netezza High Capacity Appliance Highlights: Allows querying and analysis of deep archival data
Advanced Big Data Analytics with R and Hadoop
REVOLUTION ANALYTICS WHITE PAPER Advanced Big Data Analytics with R and Hadoop 'Big Data' Analytics as a Competitive Advantage Big Analytics delivers competitive advantage in two ways compared to the traditional
PDF PREVIEW EMERGING TECHNOLOGIES. Applying Technologies for Social Media Data Analysis
VOLUME 34 BEST PRACTICES IN BUSINESS INTELLIGENCE AND DATA WAREHOUSING FROM LEADING SOLUTION PROVIDERS AND EXPERTS PDF PREVIEW IN EMERGING TECHNOLOGIES POWERFUL CASE STUDIES AND LESSONS LEARNED FOCUSING
Integrated Big Data: Hadoop + DBMS + Discovery for SAS High Performance Analytics
Paper 1828-2014 Integrated Big Data: Hadoop + DBMS + Discovery for SAS High Performance Analytics John Cunningham, Teradata Corporation, Danville, CA ABSTRACT SAS High Performance Analytics (HPA) is a
UNLEASHING THE VALUE OF THE TERADATA UNIFIED DATA ARCHITECTURE WITH ALTERYX
UNLEASHING THE VALUE OF THE TERADATA UNIFIED DATA ARCHITECTURE WITH ALTERYX 1 Successful companies know that analytics are key to winning customer loyalty, optimizing business processes and beating their
Innovative technology for big data analytics
Technical white paper Innovative technology for big data analytics The HP Vertica Analytics Platform database provides price/performance, scalability, availability, and ease of administration Table of
Extend your analytic capabilities with SAP Predictive Analysis
September 9 11, 2013 Anaheim, California Extend your analytic capabilities with SAP Predictive Analysis Charles Gadalla Learning Points Advanced analytics strategy at SAP Simplifying predictive analytics
DATAMEER WHITE PAPER. Beyond BI. Big Data Analytic Use Cases
DATAMEER WHITE PAPER Beyond BI Big Data Analytic Use Cases This white paper discusses the types and characteristics of big data analytics use cases, how they differ from traditional business intelligence
In-Memory Analytics for Big Data
In-Memory Analytics for Big Data Game-changing technology for faster, better insights WHITE PAPER SAS White Paper Table of Contents Introduction: A New Breed of Analytics... 1 SAS In-Memory Overview...
Using Tableau Software with Hortonworks Data Platform
Using Tableau Software with Hortonworks Data Platform September 2013 2013 Hortonworks Inc. http:// Modern businesses need to manage vast amounts of data, and in many cases they have accumulated this data
INVESTOR PRESENTATION. First Quarter 2014
INVESTOR PRESENTATION First Quarter 2014 Note to Investors Certain non-gaap financial information regarding operating results may be discussed during this presentation. Reconciliations of the differences
2015 Ironside Group, Inc. 2
2015 Ironside Group, Inc. 2 Introduction to Ironside What is Cloud, Really? Why Cloud for Data Warehousing? Intro to IBM PureData for Analytics (IPDA) IBM PureData for Analytics on Cloud Intro to IBM dashdb
BEYOND BI: Big Data Analytic Use Cases
BEYOND BI: Big Data Analytic Use Cases Big Data Analytics Use Cases This white paper discusses the types and characteristics of big data analytics use cases, how they differ from traditional business intelligence
How to make BIG DATA work for you. Faster results with Microsoft SQL Server PDW
How to make BIG DATA work for you. Faster results with Microsoft SQL Server PDW Roger Breu PDW Solution Specialist Microsoft Western Europe Marcus Gullberg PDW Partner Account Manager Microsoft Sweden
Ten Mistakes to Avoid
EXCLUSIVELY FOR TDWI PREMIUM MEMBERS TDWI RESEARCH SECOND QUARTER 2014 Ten Mistakes to Avoid In Big Data Analytics Projects By Fern Halper tdwi.org Ten Mistakes to Avoid In Big Data Analytics Projects
Evolving Data Warehouse Architectures
Evolving Data Warehouse Architectures In the Age of Big Data Philip Russom April 15, 2014 TDWI would like to thank the following companies for sponsoring the 2014 TDWI Best Practices research report: Evolving
The 4 Pillars of Technosoft s Big Data Practice
beyond possible Big Use End-user applications Big Analytics Visualisation tools Big Analytical tools Big management systems The 4 Pillars of Technosoft s Big Practice Overview Businesses have long managed
Introducing Oracle Exalytics In-Memory Machine
Introducing Oracle Exalytics In-Memory Machine Jon Ainsworth Director of Business Development Oracle EMEA Business Analytics 1 Copyright 2011, Oracle and/or its affiliates. All rights Agenda Topics Oracle
An Enterprise Framework for Business Intelligence
An Enterprise Framework for Business Intelligence Colin White BI Research May 2009 Sponsored by Oracle Corporation TABLE OF CONTENTS AN ENTERPRISE FRAMEWORK FOR BUSINESS INTELLIGENCE 1 THE BI PROCESSING
DATA REPLICATION FOR REAL-TIME DATA WAREHOUSING AND ANALYTICS
TDWI RESE A RCH TDWI CHECKLIST REPORT DATA REPLICATION FOR REAL-TIME DATA WAREHOUSING AND ANALYTICS By Philip Russom Sponsored by tdwi.org APRIL 2012 TDWI CHECKLIST REPORT DATA REPLICATION FOR REAL-TIME
Discovering Business Insights in Big Data Using SQL-MapReduce
Discovering Business Insights in Big Data Using SQL-MapReduce A Technical Whitepaper Rick F. van der Lans Independent Business Intelligence Analyst R20/Consultancy July 2013 Sponsored by Copyright 2013
Information Architecture
The Bloor Group Actian and The Big Data Information Architecture WHITE PAPER The Actian Big Data Information Architecture Actian and The Big Data Information Architecture Originally founded in 2005 to
SQL Server 2005 Features Comparison
Page 1 of 10 Quick Links Home Worldwide Search Microsoft.com for: Go : Home Product Information How to Buy Editions Learning Downloads Support Partners Technologies Solutions Community Previous Versions
W H I T E P A P E R B u s i n e s s I n t e l l i g e n c e S o lutions from the Microsoft and Teradata Partnership
W H I T E P A P E R B u s i n e s s I n t e l l i g e n c e S o lutions from the Microsoft and Teradata Partnership Sponsored by: Microsoft and Teradata Dan Vesset October 2008 Brian McDonough Global Headquarters:
Unlock the business value of enterprise data with in-database analytics
Unlock the business value of enterprise data with in-database analytics Achieve better business results through faster, more accurate decisions White Paper Table of Contents Executive summary...1 How can
Solve your toughest challenges with data mining
IBM Software IBM SPSS Modeler Solve your toughest challenges with data mining Use predictive intelligence to make good decisions faster Solve your toughest challenges with data mining Imagine if you could
Picturing Performance: IBM Cognos dashboards and scorecards for retail
IBM Software Group White Paper Retail Picturing Performance: IBM Cognos dashboards and scorecards for retail 2 Picturing Performance: IBM Cognos dashboards and scorecards for retail Abstract More and more,
The Future of Data Management
The Future of Data Management with Hadoop and the Enterprise Data Hub Amr Awadallah (@awadallah) Cofounder and CTO Cloudera Snapshot Founded 2008, by former employees of Employees Today ~ 800 World Class
SAS and Teradata Partnership
SAS and Teradata Partnership Ed Swain Senior Industry Consultant Energy & Resources [email protected] 1 Innovation and Leadership Teradata SAS Magic Quadrant for Data Warehouse Database Management
BIG Data Analytics Move to Competitive Advantage
BIG Data Analytics Move to Competitive Advantage where is technology heading today Standardization Open Source Automation Scalability Cloud Computing Mobility Smartphones/ tablets Internet of Things Wireless
How To Use Hp Vertica Ondemand
Data sheet HP Vertica OnDemand Enterprise-class Big Data analytics in the cloud Enterprise-class Big Data analytics for any size organization Vertica OnDemand Organizations today are experiencing a greater
Technical white paper. R you ready? Turning big data into big value with the HP Vertica Analytics Platform and R
Technical white paper R you ready? Turning big data into big value with the HP Vertica Analytics Platform and R Table of contents Executive summary The data mining challenge Data mining implementation
Business Intelligence Solutions for Gaming and Hospitality
Business Intelligence Solutions for Gaming and Hospitality Prepared by: Mario Perkins Qualex Consulting Services, Inc. Suzanne Fiero SAS Objective Summary 2 Objective Summary The rise in popularity and
Ramesh Bhashyam Teradata Fellow Teradata Corporation [email protected]
Challenges of Handling Big Data Ramesh Bhashyam Teradata Fellow Teradata Corporation [email protected] Trend Too much information is a storage issue, certainly, but too much information is also
Investor Presentation. Second Quarter 2015
Investor Presentation Second Quarter 2015 Note to Investors Certain non-gaap financial information regarding operating results may be discussed during this presentation. Reconciliations of the differences
Are You Ready for Big Data?
Are You Ready for Big Data? Jim Gallo National Director, Business Analytics April 10, 2013 Agenda What is Big Data? How do you leverage Big Data in your company? How do you prepare for a Big Data initiative?
Big Data on Microsoft Platform
Big Data on Microsoft Platform Prepared by GJ Srinivas Corporate TEG - Microsoft Page 1 Contents 1. What is Big Data?...3 2. Characteristics of Big Data...3 3. Enter Hadoop...3 4. Microsoft Big Data Solutions...4
BIG DATA ANALYTICS REFERENCE ARCHITECTURES AND CASE STUDIES
BIG DATA ANALYTICS REFERENCE ARCHITECTURES AND CASE STUDIES Relational vs. Non-Relational Architecture Relational Non-Relational Rational Predictable Traditional Agile Flexible Modern 2 Agenda Big Data
A Next-Generation Analytics Ecosystem for Big Data. Colin White, BI Research September 2012 Sponsored by ParAccel
A Next-Generation Analytics Ecosystem for Big Data Colin White, BI Research September 2012 Sponsored by ParAccel BIG DATA IS BIG NEWS The value of big data lies in the business analytics that can be generated
Up Your R Game. James Taylor, Decision Management Solutions Bill Franks, Teradata
Up Your R Game James Taylor, Decision Management Solutions Bill Franks, Teradata Today s Speakers James Taylor Bill Franks CEO Chief Analytics Officer Decision Management Solutions Teradata 7/28/14 3 Polling
EMC ADVERTISING ANALYTICS SERVICE FOR MEDIA & ENTERTAINMENT
EMC ADVERTISING ANALYTICS SERVICE FOR MEDIA & ENTERTAINMENT Leveraging analytics for actionable insight ESSENTIALS Put your Big Data to work for you Pick the best-fit, priority business opportunity and
5 Keys to Unlocking the Big Data Analytics Puzzle. Anurag Tandon Director, Product Marketing March 26, 2014
5 Keys to Unlocking the Big Data Analytics Puzzle Anurag Tandon Director, Product Marketing March 26, 2014 1 A Little About Us A global footprint. A proven innovator. A leader in enterprise analytics for
UNIFY YOUR (BIG) DATA
UNIFY YOUR (BIG) DATA ANALYTIC STRATEGY GIVE ANY USER ANY ANALYTIC ON ANY DATA Scott Gnau President, Teradata Labs [email protected] t Unify Your (Big) Data Analytic Strategy Technology excitement:
BUSINESS INTELLIGENCE. Keywords: business intelligence, architecture, concepts, dashboards, ETL, data mining
BUSINESS INTELLIGENCE Bogdan Mohor Dumitrita 1 Abstract A Business Intelligence (BI)-driven approach can be very effective in implementing business transformation programs within an enterprise framework.
BIG DATA: FROM HYPE TO REALITY. Leandro Ruiz Presales Partner for C&LA Teradata
BIG DATA: FROM HYPE TO REALITY Leandro Ruiz Presales Partner for C&LA Teradata Evolution in The Use of Information Action s ACTIVATING MAKE it happen! Insights OPERATIONALIZING WHAT IS happening now? PREDICTING
Teradata s Big Data Technology Strategy & Roadmap
Teradata s Big Data Technology Strategy & Roadmap Artur Borycki, Director International Solutions Marketing 18 March 2014 Agenda > Introduction and level-set > Enabling the Logical Data Warehouse > Any
Five Technology Trends for Improved Business Intelligence Performance
TechTarget Enterprise Applications Media E-Book Five Technology Trends for Improved Business Intelligence Performance The demand for business intelligence data only continues to increase, putting BI vendors
AGENDA. What is BIG DATA? What is Hadoop? Why Microsoft? The Microsoft BIG DATA story. Our BIG DATA Roadmap. Hadoop PDW
AGENDA What is BIG DATA? What is Hadoop? Why Microsoft? The Microsoft BIG DATA story Hadoop PDW Our BIG DATA Roadmap BIG DATA? Volume 59% growth in annual WW information 1.2M Zetabytes (10 21 bytes) this
Genesee Health System RFI-Business Intelligence & Analytics with Dashboard Reporting Questions and Answers
Genesee Health System RFI-Business Intelligence & Analytics with Dashboard Reporting Questions and Answers 1. Is there any other information required other than that listed in Section II? Respondents must
Emerging Technologies Shaping the Future of Data Warehouses & Business Intelligence
Emerging Technologies Shaping the Future of Data Warehouses & Business Intelligence Appliances and DW Architectures John O Brien President and Executive Architect Zukeran Technologies 1 TDWI 1 Agenda What
White. Paper. EMC Isilon: A Scalable Storage Platform for Big Data. April 2014
White Paper EMC Isilon: A Scalable Storage Platform for Big Data By Nik Rouda, Senior Analyst and Terri McClure, Senior Analyst April 2014 This ESG White Paper was commissioned by EMC Isilon and is distributed
Parallel Data Warehouse
MICROSOFT S ANALYTICS SOLUTIONS WITH PARALLEL DATA WAREHOUSE Parallel Data Warehouse Stefan Cronjaeger Microsoft May 2013 AGENDA PDW overview Columnstore and Big Data Business Intellignece Project Ability
Navigating Big Data business analytics
mwd a d v i s o r s Navigating Big Data business analytics Helena Schwenk A special report prepared for Actuate May 2013 This report is the third in a series and focuses principally on explaining what
1 Performance Moves to the Forefront for Data Warehouse Initiatives. 2 Real-Time Data Gets Real
Top 10 Data Warehouse Trends for 2013 What are the most compelling trends in storage and data warehousing that motivate IT leaders to undertake new initiatives? Which ideas, solutions, and technologies
The Next Wave of Data Management. Is Big Data The New Normal?
The Next Wave of Data Management Is Big Data The New Normal? Table of Contents Introduction 3 Separating Reality and Hype 3 Why Are Firms Making IT Investments In Big Data? 4 Trends In Data Management
Ten Mistakes to Avoid When Creating Performance Dashboards
Ten Mistakes to Avoid When Creating Performance Dashboards Wayne W. Eckerson Wayne W. Eckerson is the director of research and services for TDWI, a worldwide association of business intelligence and data
JAVASCRIPT CHARTING. Scaling for the Enterprise with Metric Insights. 2013 Copyright Metric insights, Inc.
JAVASCRIPT CHARTING Scaling for the Enterprise with Metric Insights 2013 Copyright Metric insights, Inc. A REVOLUTION IS HAPPENING... 3! Challenges... 3! Borrowing From The Enterprise BI Stack... 4! Visualization
Infor10 Corporate Performance Management (PM10)
Infor10 Corporate Performance Management (PM10) Deliver better information on demand. The speed, complexity, and global nature of today s business environment present challenges for even the best-managed
