WHAT S NEW IN SAS 9.4



Similar documents
Hadoop & SAS Data Loader for Hadoop

and Hadoop Technology

ANALYTICS MODERNIZATION TRENDS, APPROACHES, AND USE CASES. Copyright 2013, SAS Institute Inc. All rights reserved.

Hortonworks & SAS. Analytics everywhere. Page 1. Hortonworks Inc All Rights Reserved

QUEST meeting Big Data Analytics

9.4 Intelligence. SAS Platform. Overview Second Edition. SAS Documentation

I/O Considerations in Big Data Analytics

Comprehensive Analytics on the Hortonworks Data Platform

Big Data Analytics - Accelerated. stream-horizon.com

Document Type: Best Practice

Intel HPC Distribution for Apache Hadoop* Software including Intel Enterprise Edition for Lustre* Software. SC13, November, 2013

Introduction to Big data. Why Big data? Case Studies. Introduction to Hadoop. Understanding Features of Hadoop. Hadoop Architecture.

Peers Techno log ies Pv t. L td. HADOOP

Bringing the Power of SAS to Hadoop. White Paper

Supported Platforms. HP Vertica Analytic Database. Software Version: 7.0.x

DATA VISUALIZATION: CONVERTING INFORMATION TO DECISIONS DAVID FRONING, PRINCIPAL PRODUCT MANAGER

MicroStrategy Course Catalog

Data processing goes big

Cost-Effective Business Intelligence with Red Hat and Open Source

<Insert Picture Here> Big Data

Supported Platforms. HP Vertica Analytic Database. Software Version: 7.1.x

HDP Hadoop From concept to deployment.

Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing

Native Connectivity to Big Data Sources in MSTR 10

EMC Federation Big Data Solutions. Copyright 2015 EMC Corporation. All rights reserved.

What's New in SAS Data Management

Upcoming Announcements

TU04. Best practices for implementing a BI strategy with SAS Mike Vanderlinden, COMSYS IT Partners, Portage, MI

SAS. 9.4 Guide to Software Updates. SAS Documentation

HADOOP ADMINISTATION AND DEVELOPMENT TRAINING CURRICULUM

6.0, 6.5 and Beyond. The Future of Spotfire. Tobias Lehtipalo Sr. Director of Product Management

Constructing a Data Lake: Hadoop and Oracle Database United!

Paper SAS Techniques in Processing Data on Hadoop

Collaborative Big Data Analytics. Copyright 2012 EMC Corporation. All rights reserved.

Ad Hoc Analysis of Big Data Visualization

Oracle Big Data SQL Technical Update

EMC Greenplum Driving the Future of Data Warehousing and Analytics. Tools and Technologies for Big Data

Hadoop Introduction. Olivier Renault Solution Engineer - Hortonworks

Big Data Management and Security

Hadoop & Spark Using Amazon EMR

File S1: Supplementary Information of CloudDOE

Virtualizing Apache Hadoop. June, 2012

SAP Predictive Analytics 2.3 Supported Platforms (PAM)

Please give me your feedback

Well packaged sets of preinstalled, integrated, and optimized software on select hardware in the form of engineered systems and appliances

The Inside Scoop on Hadoop

Simplifying Big Data Analytics: Unifying Batch and Stream Processing. John Fanelli,! VP Product! In-Memory Compute Summit! June 30, 2015!!

Table of Contents Introduction and System Requirements 9 Installing VMware Server 35

In-Memory Analytics for Big Data

MEGA Web Application Architecture Overview MEGA 2009 SP4

Greenplum Database. Getting Started with Big Data Analytics. Ofir Manor Pre Sales Technical Architect, EMC Greenplum

Supported Platforms HPE Vertica Analytic Database. Software Version: 7.2.x

Cisco Data Preparation

SAP Database Strategy Overview. Uwe Grigoleit September 2013

The Future of Data Management

A Tour of the Zoo the Hadoop Ecosystem Prafulla Wani

Einsatzfelder von IBM PureData Systems und Ihre Vorteile.

YARN Apache Hadoop Next Generation Compute Platform

EMC Data Protection Advisor 6.0

Big Data Visualization and Dashboards

Embedded Analytics & Big Data Visualization in Any App

HDP Enabling the Modern Data Architecture

Hadoop and Map-Reduce. Swati Gore

ANALYTICS IN BIG DATA ERA

Capitalize on Big Data for Competitive Advantage with Bedrock TM, an integrated Management Platform for Hadoop Data Lakes

An Oracle White Paper November Leveraging Massively Parallel Processing in an Oracle Environment for Big Data Analytics

TE's Analytics on Hadoop and SAP HANA Using SAP Vora

IBM InfoSphere Guardium Data Activity Monitor for Hadoop-based systems

Big Data Explained. An introduction to Big Data Science.

Cloudera Enterprise Data Hub. GCloud Service Definition Lot 3: Software as a Service

Apache Hadoop: Past, Present, and Future

An Integrated Analytics & Big Data Infrastructure September 21, 2012 Robert Stackowiak, Vice President Data Systems Architecture Oracle Enterprise

Implement Hadoop jobs to extract business value from large and varied data sets

Deploying Hadoop with Manager

A Study of Data Management Technology for Handling Big Data

Open Source Technologies on Microsoft Azure

Performance and Scalability Overview

BIG DATA USING HADOOP

Pro Apache Hadoop. Second Edition. Sameer Wadkar. Madhu Siddalingaiah

Big Data Course Highlights

Hadoop Ecosystem Overview. CMSC 491 Hadoop-Based Distributed Computing Spring 2015 Adam Shook

Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies

System Requirements for SAS 9.4 Foundation for Linux for x64

Infomatics. Big-Data and Hadoop Developer Training with Oracle WDP

SAP and Hortonworks Reference Architecture

Tap into Hadoop and Other No SQL Sources

Deploy. Friction-free self-service BI solutions for everyone Scalable analytics on a modern architecture

Dell In-Memory Appliance for Cloudera Enterprise

SAS 9.4 Intelligence Platform

Introduction to Hadoop HDFS and Ecosystems. Slides credits: Cloudera Academic Partners Program & Prof. De Liu, MSBA 6330 Harvesting Big Data

Performance and Scalability Overview

Hortonworks and ODP: Realizing the Future of Big Data, Now Manila, May 13, 2015

Extend your analytic capabilities with SAP Predictive Analysis

Roadmap Talend : découvrez les futures fonctionnalités de Talend

SAP Crystal Reports & SAP HANA: Integration & Roadmap Kenneth Li SAP SESSION CODE: 0401

Big Data Technologies Compared June 2014

Transcription:

WHAT S NEW IN SAS 9.4 PLATFORM, HPA & SAS GRID COMPUTING MICHAEL GODDARD CHIEF ARCHITECT SAS INSTITUTE, NEW ZEALAND

SAS 9.4 WHAT S NEW IN THE PLATFORM Platform update SAS Grid Computing update Hadoop support High Performance Analytics (HPA) update Deployment patterns

SAS PLATFORM UPDATE

SAS 9.4 CORE THEMES Enable Simplify Innovate Deployment Choices Provide quick, precise answers; reduce time to value Remove barriers of analytical knowledge and complexity; reduce risk Make analytics approachable and easy to integrate; support collaboration -On-premises -Hosted -Cloud

SAS 9.4 PLATFORM UPDATE SAS Metadata Server SAS Middle-Tier SAS Environment Manager Base/Foundation SAS enhancements Security Web Infrastructure Platform Data Server ODS, DS2 Hadoop

SAS 9.4 METADATA SERVER CLUSTERING What is a Metadata Server Cluster? Coordinated set of connected metadata servers Appear as a single standard metadata server to clients Benefits of Metadata Server Clustering Provide failure recovery for high availability Provides scalable performance for large deployments

SAS 9.4 METADATA HOW DOES CLUSTERING WORK A cluster is three or more metadata server nodes Each node is a full metadata server with a complete copy of all metadata One node is designated the master to coordinate the cluster All other nodes are slave nodes Clients connect to slave nodes Once connected it looks like a standard single server to the client

SAS 9.4 MIDDLE-TIER SAS WEB APPLICATION SERVER Reduce Cost Removal of third party application server requirement and associated costs Software + Support Reduce Cost & Complexity Deliver Higher Availability Expand Admin Capabilities Cloud-enabled Platform Reduce Complexity Simplified Support SAS provides enterprise-level support of your entire SAS deployment Simplified deployment and non-disruptive maintenance Expand Management Capabilities Increased ability to monitor and manage SAS servers, solutions and users Deliver Higher Availability Alternative to Remote Services Automated Clustering Support Deploy cloud-enabled SAS platform

SAS 9.4 MIDDLE-TIER SAS WEB APPLICATION SERVER Reduce Cost & Complexity Deliver Higher Availability Expand Admin Capabilities Cloud-enabled Platform Benefits of being embedded No additional cost Automated deployment Default clustering support Built-in integration support Non-disruptive maintenance Good Fit with Virtualization and Cloud Technologies Everything you need to build and run web and cloud applications is included in the box!

SAS 9.4 MIDDLE-TIER FEATURES Automated Deployment SAS Web Server SAS Web Application Server SAS Web Applications Middle Tier Clustering Built in support for Proxy server integration Security Integration Load Balancer Integration Single Point of Support for SAS Middle Tier Tuning Guide for SAS Middle Tier SAS Middle Tier Scripting Framework

SAS 9.4 SAS ENVIRONMENT MANAGER Embedded operational monitoring solution Proactive monitoring of platform health and availability Agent based, auto-discovery of resources Monitoring of log events and configuration changes Alerting and actions

SAS ENVIRONMENT MANAGER OVERVIEW

SAS ENVIRONMENT MANAGER ARCHITECTURE OVERVIEW Platform 1 ( machine ) Mid-Tier Tc Servers Object Spawner Agent Platform 2 ( machine 2 ) Management Server GUI Administration, Provisioning, Groups, Metrics, Alerts, Events, Logs, Agents CLI Open API RESTful Web GUI Dashboard Control Center Metadata tc S server Object Spawner Agent CMDB Service Database Inventory, Events, Alerts Upgradeable via plug-ins

SAS 9.4 BASE SAS SECURITY Lock-down Restrict access to only those host paths and files that are included in that server s list of permitted resources NOTE: Lock-down is not fully supported by all SAS tools and solutions. Metadata-bound libraries Universally enforce metadata-layer permission requirements for physical tables regardless of how a user requests access from SAS SAS/SECURE Automatically delivered with Base SAS at no additional cost* Advanced Encryption Standard (AES) with SAS/SECURE Encrypt SAS data on disks * United States export regulations on encryption software restrict access to SAS/SECURE software and related technical data. http://sww.sas.com/pub/onlinedoc/v8-0/prod/sashtml/shr/z0354312.htm

SAS 9.4 PLATFORM UPDATES SAS 9.4 added support: Java 7 RDBMS- POSTGRES (Web Infrastructure Platform Data Server) Windows 8, Windows 8.1 Windows Server 2008 SP2 Google Chrome Web Browser Internet Explorer 11 (IE11) - on Windows 7, Windows 8, Windows 8.1 SAS 9.4 does not support: Framework Data Server (Firebird) HP-UX PA-RISC (per HP s lifecycle) Linux 32-bit Windows: XP, Vista, Windows Server 2003 No direct migration from SAS 9.1.3 to SAS 9.4 SAS 9.4M1 is available now!

SAS 9.4 BASE SAS NEW OUTPUT DELIVERY SYSTEM (ODS) - DESTINATIONS ODS EPUB - Create SAS reports as e-books that can be read with Apple ibooks e-book reader on ipad and iphone. ODS POWERPOINT Create Microsoft PowerPoint slides that combine text and SAS reports. ODS HTML5 Create HTML5 output for SAS reports to support delivery to any web browser that is HTML5-compatible.

SAS GRID COMPUTING UPDATE

SAS GRID SUMMARY OF NEW FEATURES GRID OPTION SETS Workspace Servers launched using the grid Logging facility SASGSUB enhanced wait

SAS 9.3 MANAGING USERS AND APPLICATIONS Specific users Using specific applications Needing specific grid options + One set of grid options per SAS application server context = Multiple SAS application server contexts

SAS 9.3 AN EXAMPLE

SAS 9.4 GRID OPTION SETS Risk DIS SASGSUB DIS GRID OPTIONS SAS Options: -memsize 256 Resources: <none> Grid Options: queue=normal DIS SASGSUB OPTIONS SAS Options: -memsize 0 Resources: GSUB Grid Options: queue=night Finance SASGSUB

OPTION SETS THE SAME DEPLOYMENT IN 9.4 Using Grid Options Sets

OPTION SETS LOGICAL GRID SERVER DEFINITION Default options

SAS GRID WORKSPACE SERVERS LAUNCHED SAS 9.3 Implementation SAS 9.4 Implementation EG WORKSPACE SERVER EG/AMO Grid Macros GRID EG GRID WORKSPACE SERVER GRID SERVER

HADOOP SUPPORT

HADOOP SAS SUPPORT Support for Cloudera and HortonWorks Cloudera CDH 4.1.2, Hortonworks HDP 1.3 or 2.0 SAS/ACCESS to Hadoop Basic FILE/INFILE connectivity LIBNAME (DATA= and OUT= connectivity) PROC HADOOP SAS/ACESS to Cloudera Impala Scoring accelerator Code accelerator HPA SPD Engine with Hadoop Cloudera Impala is an open source Massively Parallel Processing (MPP) query engine that runs natively on Apache Hadoop.

HADOOP SAS WITH HADOOP ECOSYSTEM User Interface SAS Enterprise Guide SAS Data Integration SAS Enterprise Miner SAS Visual Analytics SAS User Metadata SAS Metadata Data Access Base SAS & SAS/ACCESS Interface to Hadoop In-Memory Data Access Data Processing HBase Pig MapReduce Hive SAS LASR Analytic Server & SAS High- Performance Analytics MPI Based File System HDFS

SAS 9.4 SAS AND HADOOP Use SPD Engine with Hadoop Read, write, and update data through the Hadoop Distributed File System (HDFS). Submit configuration properties to the Hadoop server. Why it matters Customers can take advantage of the benefits of Hadoop HDFS to store and access SAS data sets stored across their Hadoop clusters.

HIGH PERFORMANCE ANALYTICS

SAS HPA ENHANCEMENTS New product bundles SAS High-Performance Statistics SAS High-Performance Data Mining SAS High-Performance Text Mining SAS High-Performance Forecasting SAS High-Performance Econometrics SAS High-Performance Optimization Two operating modes SAS High-Performance Analytics Alongside mode SAS High-Performance Analytics Next to mode No longer need a dedicated appliance, HPA Appliance Asymmetric ( next to mode) and SMP HPA Procs

SAS HPA ASYMMETRIC MODEL CLIENT Asymmetric Data Loading direct into SAS In-Memory from Databases HADOOP GREENPLUM TERADATA ORACLE Exadata SAS Rack SAS In-Database embedded processing engines - For pulling data into the SAS HP Analytics nodes (directly through Asymmetric loading or indirectly using Hadoop as caching layer) - Improved features for processing large data volume - Used also for deploying models directly into Database for operational use

DEPLOYMENT PATTERNS

SHARED PLATFORM A SHARED INTEGRATED PLATFORM Shared environment for all users (analytics and VA) Distributed and non-distributed VA SAS Grid and non-grid environments SAS High Performance Analytics

SHARED PLATFORM A SHARED INTEGRATED PLATFORM From SAS 9.4 it has been possible to have a truly shared platform Distributed HPA proc execution supports RHEL and SUSE so only applies to SAS Grids on RHEL and SUSE Compute / GRID Environment Shared Metadata Compute / GRID Environment Shared Metadata Shared Middle-tier Shared Middle-tier SAS Visual Analytics (SMP) Environment SAS HPA Products (Asymmetric Model) SAS Visual Analytics (Distributed) Environment Big Data Cluster Hadoop Teradata Greenplum Oracle * Non-distributed or SMP VA is supported on Linux and Windows.

HIGH LEVEL ARCHITECT SAS VISUAL ANALYTICS 6.2+ AND SAS (E)BI SERVER SHARED METADATA SERVER FOR 9.4 ENVIRONMENT ONLY VA CLIENTS Desktop Web Mobile SAS VISUAL ANALYTICS ENVIRONMENT VA Server (LAX) Mid-Tier (SAS App Server) Workspace Server LASR Cluster SAS LASR Analytic Server Co-Located Data Storage Hadoop RDBMS Shared Metadata Server Metadata Server Nonrelational EBI CLIENTS Desktop Web SAS (E)BI Middle Tier Server (LAX* recommended) Mid-Tier (SAS App Server) SAS (E)BI Compute Server Workspace Server Stored Process Server ERP Click Stream (*) non-distributed VA can also be deployed on Windows OLAP Server PC Files

HIGH LEVEL ARCHITECTURE SAS GRID & HPA SERVER 0 SERVER 1 SERVER 2 SERVER 3 SERVER 4 SERVER n SAS Web Application Server SAS Grid Control Server SAS Grid Compute Node SAS Grid Compute Node SAS Grid Compute Node SAS Grid Compute Node SAS Web Applications SAS Web Infrastructure Platform SAS Metadata Server SAS Workspace Server / SAS Stored Process Server (SAS Analytics Products) SAS LASR TM ANALTIC SERVER (SAS High-Performance Analytics Products) SAS Environment Manager SAS High Performance Analytics Products (Root Node) SAS High Performance Analytics Products (Worker Node) SAS High Performance Analytics Products (Worker Node) SAS High Performance Analytics Products (Worker Node) SAS Web Infrastructure Data Server SAS HPA Deployment of Hadoop (Name Node) SAS HPA Deployment of Hadoop (Data Node) SAS HPA Deployment of Hadoop (Data Node) SAS HPA Deployment of Hadoop (Data Node)

Questions

THANK YOU