Harnessing the power of advanced analytics with IBM Netezza



Similar documents
WHITE PAPER. Harnessing the Power of Advanced Analytics How an appliance approach simplifies the use of advanced analytics

High-Performance Business Analytics: SAS and IBM Netezza Data Warehouse Appliances

IBM Netezza High Capacity Appliance

Solutions for Communications with IBM Netezza Network Analytics Accelerator

Datalogix. Using IBM Netezza data warehouse appliances to drive online sales with offline data. Overview. IBM Software Information Management

IBM Netezza High-performance business intelligence and advanced analytics for the enterprise. The analytics conundrum

A financial software company

IBM Analytical Decision Management

IBM Netezza Analytics

IBM Cognos 10: Enhancing query processing performance for IBM Netezza appliances

Predictive analytics with System z

Con-way Freight. Leveraging best-of-breed business intelligence for customer satisfaction. Overview. Before: a company with a vision

Answers to Top BRMS Questions

Network Analytics: Turn Big Data into Big Opportunity

IBM SPSS Modeler Professional

Solve your toughest challenges with data mining

Solve your toughest challenges with data mining

Kelley Blue Book. Increases ad revenue with better, faster data analysis and ad price optimization. Overview. Analytics: the heart of KBB s strategy

IBM Netezza Analytics

The IBM Cognos Platform for Enterprise Business Intelligence

The top 10 secrets to using data mining to succeed at CRM

IBM Global Business Services Microsoft Dynamics AX solutions from IBM

Focus on the business, not the business of data warehousing!

Three proven methods to achieve a higher ROI from data mining

Big Data Defined Introducing DataStack 3.0

IBM Global Business Services Microsoft Dynamics CRM solutions from IBM

MATLAB in Business Critical Applications Arvind Hosagrahara Principal Technical Consultant

IBM InfoSphere Guardium Data Activity Monitor for Hadoop-based systems

IBM Analytics. Just the facts: Four critical concepts for planning the logical data warehouse

Advanced In-Database Analytics

INVESTOR PRESENTATION. First Quarter 2014

Software development for the on demand enterprise. Building your business with the IBM Software Development Platform

2015 Ironside Group, Inc. 2

Customer Insight Appliance. Enabling retailers to understand and serve their customer

Better planning and forecasting with IBM Predictive Analytics

The IBM Cognos family

IBM Cognos Enterprise: Powerful and scalable business intelligence and performance management

Tapping the benefits of business analytics and optimization

IBM SPSS Modeler Premium

IBM Data Warehousing and Analytics Portfolio Summary

IBM InfoSphere Optim Test Data Management

IBM DB2 Near-Line Storage Solution for SAP NetWeaver BW

IBM SPSS Direct Marketing

ETPL Extract, Transform, Predict and Load

Name: Srinivasan Govindaraj Title: Big Data Predictive Analytics

Five predictive imperatives for maximizing customer value

Managing Big Data with Hadoop & Vertica. A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database

Solve Your Toughest Challenges with Data Mining

Getting the most out of big data

IBM Coremetrics Web Analytics

How To Use Big Data To Help A Retailer

SAS and Teradata Partnership

The IBM Solution Architecture for Energy and Utilities Framework

A business intelligence agenda for midsize organizations: Six strategies for success

IBM Business Analytics: Finance and Integrated Risk Management (FIRM) solution

IBM Software Enabling business agility through real-time process visibility

IBM System x reference architecture solutions for big data

In-Database Analytics

UNLEASHING THE VALUE OF THE TERADATA UNIFIED DATA ARCHITECTURE WITH ALTERYX

Working with telecommunications

Predictive Analytics

Optimize workloads to achieve success with cloud and big data

IBM Cognos Performance Management Solutions for Oracle

Making critical connections: predictive analytics in government

IBM Cognos Insight. Independently explore, visualize, model and share insights without IT assistance. Highlights. IBM Software Business Analytics

Beyond listening Driving better decisions with business intelligence from social sources

Predictive Maintenance for Government

BLACKICE ERA and PureData System for Analytics

IBM SmartCloud for Service Providers

Next Generation Data Warehousing Appliances

HP and Business Objects Transforming information into intelligence

SPSS Modeler Integration with IBM DB2 Analytics Accelerator

Simplifying development through activity-based change management

Advanced Big Data Analytics with R and Hadoop

Build an effective data integration strategy to drive innovation

ORACLE BUSINESS INTELLIGENCE, ORACLE DATABASE, AND EXADATA INTEGRATION

Manage student performance in real time

Driving workload automation across the enterprise

Lands End implements customer-centric strategies

INVESTOR PRESENTATION. Third Quarter 2014

Three Open Blueprints For Big Data Success

Age of Analytics: Competing in the 21 st Century

IBM Endpoint Manager for Server Automation

Managing your error-prone spreadsheets

Data-Driven Decisions: Role of Operations Research in Business Analytics

ORACLE DATA INTEGRATOR ENTERPRISE EDITION

Automated file management with IBM Active Cloud Engine

Optimization applications in finance, securities, banking and insurance

Extend your analytic capabilities with SAP Predictive Analysis

Predictive Analytics for Donor Management

A Next-Generation Analytics Ecosystem for Big Data. Colin White, BI Research September 2012 Sponsored by ParAccel

ORACLE TAX ANALYTICS. The Solution. Oracle Tax Data Model KEY FEATURES

In-Memory Analytics for Big Data

UNIFY YOUR (BIG) DATA

Data virtualization: Delivering on-demand access to information throughout the enterprise

Transcription:

IBM Software Information Management White Paper Harnessing the power of advanced analytics with IBM Netezza How an appliance approach simplifies the use of advanced analytics

Harnessing the power of advanced analytics with IBM Netezza Contents 2 Introduction 3 What does IBM Netezza Analytics do for me? 4 How do we use IBM Netezza Analytics? Data preparation and transformation Creating and publishing a model Model Scoring Algorithm development 7 Summary 7 About Netezza, an IBM Company Introduction The IBM Netezza data warehouse appliance with IBM Netezza Analytics pushes the limits of analytics by fusing our groundbreaking data warehouse appliance with high performance, scalable analytics that can process massive data to solve complex problems orders of magnitude faster than typical solutions. IBM Netezza Analytics is an open and flexible advanced embedded analytics platform delivered with every IBM Netezza appliance that enables the development and deployment of in-database analytics to drive game changing results. By embedding analytics into our appliances and making them an integral part of the appliance, IBM has simplified the deployment of analytics while fully capitalizing on the processing throughput of IBM Netezza s underlying architecture. With IBM Netezza Analytics, parallelized analytics for data preparation, data mining, predictive modeling and optimization can exploit the IBM Netezza asymmetric massively parallel processing (AMPP ) architecture to achieve high throughput of advanced analytics on huge data. IBM Netezza data warehouse appliances can easily be extended to create your own powerful, advanced analytics and embed them into the appliance. Analytic applications, visualization tools and business intelligence tools can harness the parallelized advanced analytics via a variety of programming methods such as SQL, Java, MapReduce, Python, R, C, C++ and Fortran to deliver powerful, insightful analytics. The comprehensive advanced analytics environment makes it easy to have it your way using your preferred tools for ad-hoc analysis, prototyping and production deployment of advanced analytics. 2

Information Management Figure 1: IBM Netezza Analytics architecture What does IBM Netezza Analytics do for me? IBM Netezza Analytics exploits the convergence of huge data and parallel processing to enable high performance, scalable analytics applications. The Asymmetric Massively Parallel Processing (AMPP ) architecture of the IBM Netezza appliance enables analytics to capitalize on data and task parallelism to deliver blisteringly fast insightful results. By tightly integrating the analytics with our unique AMPP architecture, IBM has pushed the technology envelope to deliver faster performance with the ability to process data that was impossible to process previously. This scaling for both data volume and computational complexity allows statisticians, modelers and developers to use the right tool at the right time to discover high value opportunities. 3

Harnessing the power of advanced analytics with IBM Netezza Today it is commonplace to copy data from a data warehouse in order to process it for analytics which results in longer development cycle times and higher software maintenance costs. This includes programs to extract, transform and load data between the data warehouse and other platforms. IBM Netezza data warehouse appliance with IBM Netezza Analytics simplifies the process by moving the analytics processing next to the data inside the data warehouse, eliminating or significantly reducing these non-value added steps. By simplifying the environment, analysts can focus on higher value added activities such as modeling, simulation and optimization that can drive significant value for their organization. IBM Netezza Analytics incorporates a rich set of advanced analytics from IBM, our partners and the open source community. Application developers and modelers can use the right combination of advanced analytics for their unique situation to design and develop a comprehensive solution to their unique challenges. Companies can leverage the highperformance, parallelized analytics as a starting point to create their own secret sauce or competitive edge by creating their own high performance analytics or by leveraging the analytics to more quickly develop analytic applications. For example, in CRM where there are a huge number of transactions and a high degree of variability among customers, customer segmentation is often used to identify high profit customers and predictive modeling can be used to determine how to convert low profit customers into high profit customers. Modeling and optimization can be combined to determine the best design, messaging and promotion to offer to customers to increase response rates. Data mining and predictive modeling can be used to identify and sustain loyal customers as well as approaches to increase customer loyalty with the appropriate incentives. How do we use IBM Netezza Analytics? IBM Netezza Analytics is an advanced embedded analytic platform purpose built for serious analytics. IBM Netezza Analytics is used for: Building and deploying advanced analytic applications Leveraging parallel analytics via visualization or business intelligence tools Performing ad-hoc analysis especially on huge data or computational intensive problems Visualization and business intelligence tools leverage the analytics in IBM Netezza Analytics via SQL for targeted inquiries. However, application developers and modelers require more flexibility to develop applications or ad-hoc analysis including: Data preparation and transformation Model creation and publishing Model scoring Algorithm development Data preparation and transformation An analyst performing data prep and transformation activities can use IBM Netezza Analytics tools and utilities to aid in the process. Such tools include: Descriptive statistics Data cleansing Data transformation Feature selection and dimension reduction Statistical testing and utilities When data prep and transformation are necessary, IBM speeds up the process with high performance packages while scaling petabytes of user data that can be stored on the appliance. 4

Information Management Creating and publishing an analytic model Creating an analytic model is typically an iterative process. The data used in building the model may be prepared and transformed by the modeler (data miner, statistician or quantitative analyst) who prepares the data for the modeler. The modeler may start off performing an ad-hoc analysis by request of some executive. Once the results are reviewed, the modeler may create a refinement that is more of a prototype for eventual production deployment. The modeler oftentimes develops multiple models during this process and evaluates and compares the performance of each of the models before selecting a model for final deployment. At each stage in the iterative development cycle, the modeler may use different development tools and analytics depending on the evolving requirements and performance targets. IBM Netezza Analytics offers flexibility at each stage of this process. The modeler can create their model using one of several supported development environments such as Eclipse, R GUI or directly using: IBM s plug-in for Eclipse IBM s plug-in for the R GUI Partner s integrated development environment (IDE) Partner s analytic development environment (ADE ) to create an analytic workflow Any of the IBM Netezza supported programming languages SQL, Java, MapReduce, Python, R, Fortran, C or C++ The model can use any combination of advanced analytics including: Open source analytics R CRAN packages which are easily installed via IBM Netezza Analytics Partner analytics Industry proven analytics that can be leveraged in application development In-database analytics Our parallelized, high performance in-database analytics for data prep, data mining, predictive and geospatial analytics Once a model has been used to predict future outcomes, it can be used to automate optimal decisions by combining optimization along with the predictive modeling. By embedding optimization, the application can solve tough, complicated business problems such as: Minimizing shipping costs at the same time as minimizing inventory Minimizing capital investment at the same time as minimizing the risk of an entire portfolio Maximizing revenue at the same time as maximizing customer loyalty Minimizing marketing budgets at the same time as maximizing lead generation This is done while taking into account diverse real world constraints including budget, staffing, customer preferences and behaviors, company preferences and many others. 5

Harnessing the power of advanced analytics with IBM Netezza Model scoring Once a model is built, the model can be applied to the voluminous data typically associated with a predictive model. This process is referred to as scoring. Examples of scoring include applying the model to millions of records to: Detect fraud Calculate credit scores Re-price customer accounts Predict future customer purchases Decide who should be sent various marketing promotions Recommend whether credit lines should be increased/ decreased Conclude whether delinquent accounts should be sold off Determine product warranty strategies based on field support calls Determine the appropriate customer promotions to increase sales Algorithm development The algorithm developer has many development tool choices including integrated development environments (IDE) for a GUI-based approach to development or programming languages. IBM provides a plug-in for Eclipse, an open source IDE. Additionally, IBM partners provide IDEs that can speed up the algorithm development cycle. IBM supports the R GUI for development of algorithms based on the R language. IBM supports several programming languages for algorithm development including Java, MapReduce, Python, R, Fortran, C, and C++ as well as the User Defined Extensions. User Defined Extensions include: UDFs user defined functions process a row UDAs user defined aggregates process a group of rows UDTFs user defined table functions return any size table UDAPs user defined analytic processes that provide complete flexibility to perform one or more processing steps Stored Procedures used to group functions, aggregates and other logic into procedures After an algorithm is developed and tested it can be registered on the appliance which makes the algorithm as accessible as any other in-database analytic via the supported programming languages. 6

Information Management Summary The IBM Netezza data warehouse appliance with IBM Netezza Analytics is a powerful and extendable advanced analytics platform that simplifies the development, deployment and use of advanced analytics while delivering unmatched performance and scalability. IBM Netezza, partner, open source and your own advanced analytics can be combined and exploited via SQL, Java, MapReduce, Python, R, Fortran, C or C++ to deliver game changing results for your business by harnessing the high performance parallel architecture of the IBM Netezza data warehouse appliance. About IBM Netezza IBM Netezza pioneered the data warehouse appliance space by integrating database, server and storage into a single, easy to manage appliance that requires minimal set-up and ongoing administration while producing faster and more consistent analytic performance. The newest data warehouse appliance, the IBM Netezza 1000, simplifies business analytics dramatically by consolidating all analytic activity in the appliance, right where the data resides, for blisteringly fast performance. Visit netezza.com or thinking.netezza.com to see how our data warehouse appliance eliminates complexity at every step and allows you to drive true business value for your organization. About IBM Data Warehousing and Analytics Solutions IBM provides the broadest and most comprehensive portfolio of data warehousing, information management and business analytic software, hardware and solutions to help customers maximize the value of their information assets and discover new insights to make better and faster decisions and optimize their business outcomes. For more information To learn more about the IBM Data Warehousing and Analytics Solutions, please contact your IBM sales representative or visit http://www.ibm.com/software/data/netezza/ 7

Copyright IBM Corporation 2011 IBM Corporation Software Group Route 100 Somers, NY 10589 U.S.A. Produced in the United States of America July 2011 All Rights Reserved IBM, the IBM logo, ibm.com and Netezza are trademarks or registered trademarks of International Business Machines Corporation in the United States, other countries, or both. If these and other IBM trademarked terms are marked on their first occurrence in this information with a trademark symbol ( or ), these symbols indicate U.S. registered or common law trademarks owned by IBM at the time this information was published. Such trademarks may also be registered or common law trademarks in other countries. A current list of IBM trademarks is available on the Web at Copyright and trademark information at ibm.com/legal/ copytrade.shtml Microsoft and SQL Server are trademarks of Microsoft Corporation in the United States, other countries or both. Other company, product and service names may be trademarks or service marks of others. Please Recycle IMW14572-USEN-01