WebFOCUS RStat. RStat. Predict the Future and Make Effective Decisions Today. WebFOCUS RStat



Similar documents
Quick Start. Creating a Scoring Application. RStat. Based on a Decision Tree Model

Operationalising Predictive Insights

Operationalising Predictive Insights

CONTENTS PREFACE 1 INTRODUCTION 1 2 DATA VISUALIZATION 19

Azure Machine Learning, SQL Data Mining and R

Lavastorm Analytic Library Predictive and Statistical Analytics Node Pack FAQs

Practical Data Science with Azure Machine Learning, SQL Data Mining, and R

Improving the Performance of Data Mining Models with Data Preparation Using SAS Enterprise Miner Ricardo Galante, SAS Institute Brasil, São Paulo, SP

KnowledgeSTUDIO HIGH-PERFORMANCE PREDICTIVE ANALYTICS USING ADVANCED MODELING TECHNIQUES

Easily Identify Your Best Customers

R Tools Evaluation. A review by Global BI / Local & Regional Capabilities. Telefónica CCDO May 2015

A fast, powerful data mining workbench designed for small to midsize organizations

2015 Workshops for Professors

STATISTICA. Financial Institutions. Case Study: Credit Scoring. and

Data Mining and Visualization

Grow Revenues and Reduce Risk with Powerful Analytics Software

New Work Item for ISO Predictive Analytics (Initial Notes and Thoughts) Introduction

Silvermine House Steenberg Office Park, Tokai 7945 Cape Town, South Africa Telephone:

Data Mining + Business Intelligence. Integration, Design and Implementation

Didacticiel Études de cas

Get to Know the IBM SPSS Product Portfolio

Data Science with R. Introducing Data Mining with Rattle and R.

Dashboards, Scorecards, and Mashups Visual Insight Into Your Organization s Health and Performance

White Paper. Redefine Your Analytics Journey With Self-Service Data Discovery and Interactive Predictive Analytics

Bowerman, O'Connell, Aitken Schermer, & Adcock, Business Statistics in Practice, Canadian edition

Server Load Prediction

WebFOCUS InfoDiscovery

Our Raison d'être. Identify major choice decision points. Leverage Analytical Tools and Techniques to solve problems hindering these decision points

SAS ENTERPRISE MINER 5.3

How To Choose A Business Intelligence Toolkit

How to Optimize Your Data Mining Environment

SEIZE THE DATA SEIZE THE DATA. 2015

Data Mining. Dr. Saed Sayad. University of Toronto

IBM SPSS Direct Marketing

Predictive modelling around the world

WROX Certified Big Data Analyst Program by AnalytixLabs and Wiley

Oracle Data Mining Hands On Lab

High Performance Predictive Analytics in R and Hadoop:

Data analysis process

Data Mining. SPSS Clementine Clementine Overview. Spring 2010 Instructor: Dr. Masoud Yaghini. Clementine

Data Mining Using SAS Enterprise Miner Randall Matignon, Piedmont, CA

Pentaho Data Mining Last Modified on January 22, 2007

Data Visualization Techniques

IBM SPSS Modeler Premium

TIBCO Spotfire Business Author Essentials Quick Reference Guide. Table of contents:

How To Understand Data Mining In R And Rattle

What is Data Mining? MS4424 Data Mining & Modelling. MS4424 Data Mining & Modelling. MS4424 Data Mining & Modelling. MS4424 Data Mining & Modelling

Data Exploration Data Visualization

Comparative Analysis of the Main Business Intelligence Solutions

Predictive Modeling in Workers Compensation 2008 CAS Ratemaking Seminar

Business Intelligence and Process Modelling

Predictive Analytics Techniques: What to Use For Your Big Data. March 26, 2014 Fern Halper, PhD

Make Better Decisions Through Predictive Intelligence

KnowledgeSEEKER POWERFUL SEGMENTATION, STRATEGY DESIGN AND VISUALIZATION SOFTWARE

Achieve Better Insight and Prediction with Data Mining

Predictive Analytics Powered by SAP HANA. Cary Bourgeois Principal Solution Advisor Platform and Analytics

Consumption of OData Services of Open Items Analytics Dashboard using SAP Predictive Analysis

APPLICATION PROGRAMMING: DATA MINING AND DATA WAREHOUSING

About Dell Statistica

Data Preparation Part 1: Exploratory Data Analysis & Data Cleaning, Missing Data

Some vendors have a big presence in a particular industry; some are geared toward data scientists, others toward business users.

Better decision making under uncertain conditions using Monte Carlo Simulation

Machine Learning with MATLAB David Willingham Application Engineer

Exadata V2 + Oracle Data Mining 11g Release 2 Importing 3 rd Party (SAS) dm models

Prof. Pietro Ducange Students Tutor and Practical Classes Course of Business Intelligence

not possible or was possible at a high cost for collecting the data.

KnowledgeSEEKER Marketing Edition

Diagrams and Graphs of Statistical Data

Driving business intelligence to new destinations

OpenText Actuate Big Data Analytics 5.2

Up Your R Game. James Taylor, Decision Management Solutions Bill Franks, Teradata

Instructions for SPSS 21

Improve Results with High- Performance Data Mining

Oracle Data Miner (Extension of SQL Developer 4.0)

Advanced In-Database Analytics

Achieve Better Insight and Prediction with Data Mining

APPROACHABLE ANALYTICS MAKING SENSE OF DATA

Easily Identify the Right Customers

Data Mining is the process of knowledge discovery involving finding

Tax Fraud in Increasing

COPYRIGHTED MATERIAL. Contents. List of Figures. Acknowledgments

Introduction to Big Data with Apache Spark UC BERKELEY

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.

Empowering the Masses with Analytics

Normality Testing in Excel

The Data Mining Process

AcademyR Course Catalog

Find the Hidden Signal in Market Data Noise

Application of SAS! Enterprise Miner in Credit Risk Analytics. Presented by Minakshi Srivastava, VP, Bank of America

Implementing Data Models and Reports with Microsoft SQL Server

Introduction Course in SPSS - Evening 1

Bill Burton Albert Einstein College of Medicine April 28, 2014 EERS: Managing the Tension Between Rigor and Resources 1

Big Data Analytics and Optimization

BIDM Project. Predicting the contract type for IT/ITES outsourcing contracts

Unit 1: Introduction to Quality Management

Making confident decisions with the full spectrum of analysis capabilities

IBM SPSS Direct Marketing 23

Advanced analytics at your hands

SQL Server 2005 Features Comparison

Transcription:

Information Builders enables agile information solutions with business intelligence (BI) and integration technologies. WebFOCUS the most widely utilized business intelligence platform connects to any enterprise system or application and enables simple and intuitive interaction with information. WebFOCUS RStat Predict the Future and Make Effective Decisions Today Traditional reporting provides a clear picture of the past, but has little power to shed light on the future. WebFOCUS RStat, the market s first fully integrated business intelligence (BI) and predictive analytics environment, bridges the gap between backward- and forwardfacing views of business operations by enabling the deployment of predictive models as scoring applications. WebFOCUS RStat Data mining extracts historical data and applies statistical techniques to build a model to predict an outcome. A scoring application supports decision-making by providing nontechnical users with an analytic resource for repeated use on new data sets. For example: A marketing executive can score new mailing lists to distinguish between good and bad prospects An insurance or bank officer can accurately determine if a client is a good risk A police department can be better prepared to prevent crime if they can predict when and where crimes are likely to occur Statisticians spend much of their time extracting and querying data. With WebFOCUS RStat, statisticians can leverage the same infrastructure that BI developers use to create queries and build models. They can then generate scoring routines from these models that can be used by the BI developers to build and deploy WebFOCUS scoring applications on any platform. This approach eliminates the need to work with multiple tools or platforms. RStat WebFOCUS RStat is a graphical user interface (GUI) front end to the R library of statistical functions from the R Systems open source project, which is maintained by a worldwide consortium of universities that teach statistics. Information Builders built RStat so that even business analysts without advanced statistical training can build statistical models. By building statistical scoring solutions into the WebFOCUS metadata, Information Builders provides a unique method for deploying them on any computer platform or operating system, e.g. Windows, Unix, Linux, Z/OS, and i Series, making RStat the simplest predictive analytics system to deploy in any application.

Data Preparation A user can access metadata, generate a query, create derived fields, and load the data into WebFOCUS RStat for data mining and statistical analysis. The RStat dialog shows how data can be sampled and variables can be assigned for modeling purposes. Full integration with Developer Studio and WebFOCUS Reporting Servers: Access more than 300 data sources for extracting data for modeling Access data from different servers Create new or leverage existing metadata and queries (FEXes) Use defines and computes to create new virtual fields in the modeling data set Apply advanced sampling techniques Create training and evaluation data sets Data mine and model using an easy and intuitive UI 2

Data Exploration The Explore tab allows users to examine the distribution and other statistics of the variables. The output window shows the summary statistics for the data set. The interactive graph window shows the analysis of the distribution for two variables along a third dimension (i.e. age and income sliced by gender). It also displays the different interactive options. Robust data exploration and descriptive statistics capabilities: Min, max, mean, quintiles, standard deviation, variance, standard error, LCL mean and UCL mean, counts, missing value counts, unique value counts, etc. Kurtosis and skewness Interactive graphs: Display multiple charts or multiple variables in matrix plots for comparison Group displayed variables by other variables for comparison Filter data dynamically to display details and sub-segments 3

Data Visualization A variety of charts and panels that can be generated using WebFOCUS RStat to display multiple variables or multiple charts in matrix plots, and group displayed variables by other variables for comparison. Create box, bar, or dot plots Generate mosaic, histogram, or cumulative charts Create Bendford charts Create interactive scatterplots, matrix plots, bar charts, time series, or parallel coordinates Build a hierarchical correlation dendogram Generate principal components charts Transformation types: Rescale: recenter, scale 0-to-1, median/mad, natural log, and matrix Impute: zero, mean, median, mode, and constant Binning: quantiles, K-means, and equal width Remap: indicator, join categories, categoric to numeric, and numeric to categoric 4

Hypothesis Testing and Clustering Shown here are the clustering analysis interface, the different methods available to build hierarchical clusters, and the dendogram and discriminant coordinates plots for the clusters. Hypothesis testing: T-test and F-test Kolmogorov-Smirnov and Wilcoxon tests Correlation analysis Clustering: K-means clustering Hierarchical clustering with dendogram plots or discriminant coordinates plots Association and market basket analysis 5

Model Building Shown here are the RStat modeling tab, the selection of the decision tree model, the output of the model (i.e. the description of the tree nodes and the rules), and the graphical display of the tree that shows how the data was classified into the end nodes. Robust models for prediction and classification: Decision tree GLM Boosting Random forests Support vector machine Neural network Linear regression Logistic regression Poisson regression Multinomial regression Survival Analysis (COXPH and Parametric) Model export: PMML export C export 6

Model Evaluation Evaluation of the decision tree model. The output window shows the error matrix, an analysis of how many of the records in the evaluation data set were correctly classified. The ROC and Lift charts are displayed for evaluating gains from applying the model. Immediately evaluate the model for performance Compare the performance of multiple models Built-in rules determine what evaluation methods apply Error matrix Lift, ROC, precision, sensitivity, and cost curves Risk curve Predicted vs. observed values curve Scoring data sets directly in RStat 7

Application Creation Scoring Routines can be incorporated into any WebFOCUS report or application. The Active Report shown here is an example of how RStat Predictive Modeling can put ad hoc analytics in the hands of operational users. Deploy scoring routines on any WebFOCUS server Access scoring routines using standard COMPUTE or DEFINE within a report or graph Provide ad hoc analytics with Active Reports Plot predictions on a map or graph Support real-time decision-making through KPI dashboards or transactional process flows Corporate Headquarters Two Penn Plaza, New York, NY 10121-2898 (212) 736-4433 Fax (212) 967-6406 DN7506080.0210 informationbuilders.com askinfo@informationbuilders.com Canadian Headquarters 150 York St., Suite 1000, Toronto, ON M5H 3S5 (416) 364-2760 Fax (416) 364-6552 For International Inquiries +1(212) 736-4433 Copyright 2010 by Information Builders. All rights reserved. [89] All products and product names mentioned in this publication are trademarks or registered trademarks of their respective companies. Printed in the U.S.A. on recycled paper