Lavastorm Analytic Library Predictive and Statistical Analytics Node Pack FAQs



Similar documents
WebFOCUS RStat. RStat. Predict the Future and Make Effective Decisions Today. WebFOCUS RStat

Practical Data Science with Azure Machine Learning, SQL Data Mining, and R

Azure Machine Learning, SQL Data Mining and R

Oracle Data Miner (Extension of SQL Developer 4.0)

SAP Predictive Analysis Installation

Quick Start. Creating a Scoring Application. RStat. Based on a Decision Tree Model

CONTENTS PREFACE 1 INTRODUCTION 1 2 DATA VISUALIZATION 19

CUSTOMER Presentation of SAP Predictive Analytics

2015 Workshops for Professors

IBM SPSS Modeler 15 In-Database Mining Guide

Prerequisites. Course Outline

Advanced analytics at your hands

Advanced In-Database Analytics

StreamServe Persuasion SP5 Supported platforms and software

R Tools Evaluation. A review by Global BI / Local & Regional Capabilities. Telefónica CCDO May 2015

How To Test The Performance Of An Ass 9.4 And Sas 7.4 On A Test On A Powerpoint Powerpoint 9.2 (Powerpoint) On A Microsoft Powerpoint 8.4 (Powerprobe) (

What s New in AppliDis Fusion 4 Service Pack 1

Data Mining. SPSS Clementine Clementine Overview. Spring 2010 Instructor: Dr. Masoud Yaghini. Clementine

Some vendors have a big presence in a particular industry; some are geared toward data scientists, others toward business users.

COPYRIGHTED MATERIAL. Contents. List of Figures. Acknowledgments

Table of Contents Release Notes 2013/04/08. Introduction in OS Deployment Manager. in Security Manager Known issues

Oracle Data Miner (Extension of SQL Developer 4.0)

Machine Learning with MATLAB David Willingham Application Engineer

Data Mining mit der JMSL Numerical Library for Java Applications

Cascading Pattern - How to quickly migrate Predictive Models (PMML) from SAS, R, Micro Strategies etc., onto Hadoop and deploy them at scale

Predictive Modeling Techniques in Insurance

About Dell Statistica

6.0, 6.5 and Beyond. The Future of Spotfire. Tobias Lehtipalo Sr. Director of Product Management

SEIZE THE DATA SEIZE THE DATA. 2015

White Paper. Redefine Your Analytics Journey With Self-Service Data Discovery and Interactive Predictive Analytics

Machine Learning Capacity and Performance Analysis and R

Active Directory Rights Management Services integration (AD RMS)

Big Data Analytics and Optimization

Automated Deployment of Oracle RAC Using Enterprise Manager Provisioning Pack

Introduction Predictive Analytics Tools: Weka

KNIME TUTORIAL. Anna Monreale KDD-Lab, University of Pisa

RED HAT ENTERPRISE VIRTUALIZATION FOR SERVERS: PRICING & LICENSING GUIDE

Created on May 20, 2015

How To Make A Credit Risk Model For A Bank Account

whitepaper Predictive Analytics with TIBCO Spotfire and TIBCO Enterprise Runtime for R

Predictive Analytics with TIBCO Spotfire and TIBCO Enterprise Runtime for R

Secure Agent Quick Start for Windows

Data Domain Profiling and Data Masking for Hadoop

Oracle Primavera P6 Enterprise Project Portfolio Management Performance and Sizing Guide. An Oracle White Paper October 2010

BIG DATA What it is and how to use?

Operationalise Predictive Analytics

SAP Crystal Reports & SAP HANA: Integration & Roadmap Kenneth Li SAP SESSION CODE: 0401

SAS Add-In 2.1 for Microsoft Office: Getting Started with Data Analysis

Application of Predictive Analytics for Better Alignment of Business and IT

The Data Mining Process

A fast, powerful data mining workbench designed for small to midsize organizations

Oracle EXAM - 1Z Oracle Weblogic Server 11g: System Administration I. Buy Full Product.

Altiris Client Management Suite

Installation Process

Data Mining + Business Intelligence. Integration, Design and Implementation

An Introduction to Data Mining. Big Data World. Related Fields and Disciplines. What is Data Mining? 2/12/2015

System Requirements for Dynamics GP 2013

Is a Data Scientist the New Quant? Stuart Kozola MathWorks

Welcome to the second half ofour orientation on Spotfire Administration.

SUSE Cloud Installation: Best Practices Using a SMT, Xen and Ceph Storage Environment

System Requirements. Version 8.2 November 23, For the most recent version of this document, visit our documentation website.

3. Where can I obtain the Service Pack 5 software?

Bringing Big Data Modelling into the Hands of Domain Experts

Enterprise Private Cloud Storage

Installing and Administering VMware vsphere Update Manager

Lavastorm Resolution Center 2.2 Release Frequently Asked Questions

1. Server Microsoft FEP Instalation

Name: Srinivasan Govindaraj Title: Big Data Predictive Analytics

Enhanced Diagnostics Improve Performance, Configurability, and Usability

Package acrm. R topics documented: February 19, 2015

TIBCO Spotfire Server Deployment and Administration

The power of IBM SPSS Statistics and R together

Pentaho Data Mining Last Modified on January 22, 2007

Architecting for the next generation of Big Data Hortonworks HDP 2.0 on Red Hat Enterprise Linux 6 with OpenJDK 7

Using VMware Player. VMware Player. What Is VMware Player?

Fast Analytics on Big Data with H20

Eaton NetWatch v5.0 NetWatch on Microsoft Hyper-V server 2008 Installation and configuration guide

Designing a Windows Server 2008 Applications Infrastructure

WHAT S NEW IN SAS 9.4

ANSYS EKM Overview. What is EKM?

Make Better Decisions Through Predictive Intelligence

vcloud Automation Center Support Matrix vcloud Automation Center 5.2

MEGA Web Application Architecture Overview MEGA 2009 SP4

Table of Contents. Introduction Prerequisites Installation Configuration Conclusion Recommended Reading...

XenClient Enterprise Synchronizer Installation Guide

Supported Platforms. HP Vertica Analytic Database. Software Version: 7.0.x

EnterpriseLink Benefits

Machine learning for algo trading

Windows Server 2008 R2 Hyper V. Public FAQ

Synchronizer Installation

Intelligent Power Protector User manual extension for Microsoft Virtual architectures: Hyper-V 6.0 Manager Hyper-V Server (R1&R2)

MS Implementing an Advanced Server Infrastructure

TIBCO Spotfire Business Author Essentials Quick Reference Guide. Table of contents:

NETWRIX FILE SERVER CHANGE REPORTER

Querying Databases Using the DB Query and JDBC Query Nodes

2012 LABVANTAGE Solutions, Inc. All Rights Reserved.

Transcription:

1.1 Introduction Lavastorm Analytic Library Predictive and Statistical Analytics Node Pack FAQs For brevity, the Lavastorm Analytics Library (LAL) Predictive and Statistical Analytics Node Pack will be referred to in this document as the Node Pack. What is the Node Pack? The LAL Predictive and Statistical Analytics Node Pack is a set of nodes that allows business users to incorporate powerful statistical and predictive functionality within a Lavastorm Analytics Engine (LAE) graph or logical workflow without needing to use programming. What is included in the Node Pack? The Node Pack includes the following nodes: Linear Regression Enables users to model relationships between a dependent variable and a set of independent variables using a linear regression model. Predict Linear Regression Uses a linear regression model to predict the value of a dependent variable based on the specified values of the independent variables. Logistic Regression Enables users to model relationships between a binomial dependent variable and a set of independent variables using a logistic regression model. Predict Logistic Regression Uses a logistic (Generalized Linear Model) regression model to predict the probability of an outcome based on the specified values of the independent variables. K-Means Clustering Provides classification of data into a specified number of clusters. K-Means Advisor Helps users select the most appropriate number of clusters to use when performing K-Means classification. Hierarchical Clustering Classifies data by building a hierarchy of clusters. R Library Package Download Utility node that simplifies the download of CRAN Packages that are prerequisites for the correct operation of other nodes in the Statistical and Predictive Analytic Node Pack. Market Basket Analysis - Uses data mining to identify hidden relationships in data by identifying association rules. Page 1

Market Basket Miner Allows users to sort and filter rules that meet their specified criteria. Time Series Diagnostics Provides sets of diagnostic plots that help the user select the correct model to be used when creating a time series model and determine whether the model is a good fit for the data. Time Series Forecast Uses the Holt-Winters method to forecast a time series that can optionally contain a trend and seasonal variations, and allows the prediction of future values for the times series. Decision Forest Machine learning technique that uses an ensemble of decision trees to generate a model that can be used to classify data, perform regression modeling, or use an unsupervised learning mode to assess the importance of variables in determining the proximity of observations. Predict Decision Forest Uses a Random Forest model to predict the value or classification of a dependent variable based on the specified values of the independent variables. Is that the complete list? No, the above list reflects the set of nodes that are currently available in LAL 6.0.35.0 and LAL 5.1.35.0 releases. As with the existing LAL program, additional nodes will be added to the Node Pack in future LAL releases. Examples of these additional nodes include: Linear Regression Diagnostics Provides a set of diagnostic plots to aid the user in determining whether the model is a good fit for the data. Logistic Regression Diagnostics Provides a set of diagnostic plots to aid the user in determining whether the model is a good fit for the data. Quantile Analysis Uses regression to model the relationship between a dependent variable and a set of independent variables using a specified quantile function of the response. ARIMA Utilizes an AutoRegressive Integrated Moving Average technique to model time series data and forecast future values of the time series. How do I use the Node Pack? The nodes available in the Node Pack can be connected in a graph similar to any other node. The nodes can utilize prepared data originating from others nodes, and can also output data to other nodes for further analysis. Page 2

Figure 1: An example of a K-means clustering node using filtered data to prepare a clustered scatter plot and quick statistics. How is the node pack licensed? The nodes are part of the LAE Statistical and Predictive Node Pack. This is a premium product option that requires a separate Node Pack license in addition to the LAE license. Note: the LAE Statistical and Predictive Node Pack license does not provide access to the Power R node. A separate license is required to be able to use the Power R Node Pack. These two licenses are independent (i.e. they can be purchased separately with no dependence on the customer purchasing the other Node Pack license). Can I purchase the nodes individually? No, the nodes can only be purchased as part of the LAE Statistical and Predictive Node Pack. What are the supported LAE editions? The following LAE editions are supported: - Professional - Professional Plus - Team Server - Enterprise Server What platforms are supported? LAE 5.1.x Desktop Editions: - Microsoft Windows (64-bit): Windows 7 for x64; Windows 8.1 for x64 LAE 5.1.x Server Editions: - Microsoft Windows (64-bit): Windows Server 2008 - Oracle Linux: (x86-64; 64-bit): Enterprise 5 Page 3

- RedHat Linux (x86-64; 64-bit): Enterprise Linux 5, 6 - SUSE Linux Enterprise Server 11 Service Pack 2 Note: the R Library Download node is only supported on Windows platforms. LAE 6.0.x Desktop Editions: - Microsoft Windows (64-bit): Windows 7 for x64; Windows 8.1 for x64 LAE 6.0.x Server Editions: - Microsoft Windows (64-bit): Windows Server 2008 SP2; Windows Server 2012 R2 - Oracle Linux: (x86-64; 64-bit): Enterprise 6 - RedHat Linux (x86-64; 64-bit): Enterprise Linux 6, 7 - SUSE Linux Enterprise Server 11 Service Pack 2 Note: the R Library Download node is only supported on Windows platforms. What versions of LAE are supported? The nodes in the LAE Statistical and Predictive Node Pack are supported on LAE 6.0.x and LAE 5.1.x releases. A subset of the nodes in the Node Pack are also supported on LAE 5.0.x, see the Lavastorm Website for further details. What other related software available? The Power R node provides data scientists and other technical users with direct access to the power of TIBCO Enterprise Runtime for R, allowing them to us the R language to create custom statistical and predictive analytic functionality on an enterprise-grade and scalable platform. Can I use the Power R node if I purchase the Predictive and Statistical Analytics Node Pack? No. A separate Node Pack license is required to use the Power R node. Is the Node Pack based on Open Source R? No. The Node Pack is built on TIBCO Enterprise Runtime for R foundations, which is optimized for speed, scale, and stability. The majority of the Node Pack utilizes functionality that is natively supported by TIBCO Enterprise Runtime for R. Some functionality is based on the open source Comprehensive R Archive Network (see below for additional details). 1.2 Deploying and using the Node Pack How do customers get the software? Page 4

The Statistical and Predictive Node Pack will be available as part of the published LAL installer software that will be available on the Lavastorm Analytics website. Where a customer s LAE desktop installation is configured to automatically check for updates, they will be notified that the new software is available when their LAE installation checks for an update following the date at which it is publically released. 1.2.1 CRAN Packages Some of the nodes in the Node Pack use functionality provided by R packages. The prerequisite packages have an Open Source License and are available from the Comprehensive R Archive Network (or one of its mirror sites). As these packages are open source, they must not be distributed with LAE. However, for correct operation of the nodes in the Node Pack the prerequisite CRAN packages must be installed by the user on the machine hosting the LAE Server and installed into the file system location used by LAE. The R Library Download node provides a convenient mechanism by which non-technical users can download and install the prerequisite CRAN packages onto a Windows machine that hosts the LAE Server. The R Library Download node is not supported on Linux platforms. The R Library Download node requires access to the internet to download the prerequisite R binary packages from the CRAN repository. For Linux installations, it is necessary to install the packages by other means. Typically, CRAN packages for Linux systems are available from the CRAN repositories as source packages rather than in binary form. Open Source R (OS R) can be used to download and install the prerequisite packages. If OS R is installed on the machine hosting the LAE Server, when performing the installation, the installation directory can be set to that used by LAE for its package library location (i.e. <LAE Installation Directory> /LAE6.0/TERR/library). If the OS R installed on a separate machine is used to download and compile the prerequisite packages, is must have the same Linux configuration as the machine hosting the LAE server. Once the packages have been installed on the other machine, the packages can be copied to the directory used by LAE as the package library location (see above). Internet access will be required to download the source packages from the CRAN repository. The following CRAN packages are required by the Node Pack: - 'arules' - 'boot' - 'cluster' - 'gbm' - 'lattice' - Matrix - 'NbClust' - 'randomforest' - 'TTR' - 'zoo' Dependent packages are also required and should be automatically downloaded by OS R during the installation process. For further information on the package installation process, see the CRAN documentation (section 6.3): - http://cran.r-project.org/doc/manuals/r-release/r-admin.html Page 5

TIBCO Enterprise Runtime for R are either registered trademarks or trademarks of TIBCO Software Inc. and/or its subsidiaries in the United States and/or other countries Page 6