Platfora Deployment Planning Guide

Size: px
Start display at page:

Download "Platfora Deployment Planning Guide"

Transcription

1 Platfora Deployment Planning Guide Version 5.3 Copyright Platfora 2016 Last Updated: 5:30 p.m. June 27, 2016

2 Contents Document Conventions... 4 Contact Platfora Support...5 Copyright Notices... 5 Chapter 1: About Platfora Deployments... 7 Platfora Deployment Architectures...7 On-Premise Hadoop Deployments... 7 Amazon AWS Cloud Deployments...9 Google Cloud Platform Deployments Platfora Server Architecture...10 FAQs Platfora Deployments Chapter 2: Supported Environments and Versions...16 Chapter 3: System Requirements (On-Premise) Platfora Server Requirements...18 Hadoop Resource Requirements...19 Chapter 4: System Requirements (AWS Cloud)...21 Platfora EC2 Instance Requirements...21 Amazon EMR Instance Requirements...22 AWS Security Settings for Platfora...23 Amazon AWS Virtual Private Cloud (VPC) IAM User and IAM Roles for Platfora...24 EC2 Security Group Settings Chapter 5: System Requirements (GCP Cloud) Platfora Compute Engine Machine Requirements...30 Google Dataproc Machine Requirements...31 GCP Security Settings for Platfora Chapter 6: Port Configuration Requirements...34 Ports to Open on Platfora Nodes Ports to Open on Hadoop Nodes Chapter 7: Browser Requirements Appendix A: Hardware Specifications for Platfora Nodes... 38

3 Platfora Deployment Planning Guide - Contents Appendix B: EC2 Considerations for Platfora Instances Page 3

4 Preface This guide provides information about what you need to consider when deploying a new Platfora cluster. This guide is intended for system and Hadoop administrators who are responsible for procuring and managing server resources. Knowledge of Linux system administration, network administration and Hadoop administration is recommended. Document Conventions This documentation uses certain text conventions for language syntax and code examples. Convention Usage Example $ Command-line prompt - proceeds a command to be entered in a command-line terminal session. $ sudo Command-line prompt for a command that requires root permissions (commands will be prefixed with sudo). $ ls $ sudo yum install open-jdk-1.7 UPPERCASE italics [ ] (square brackets)... (elipsis) Function names and keywords are shown in all uppercase for readability, but keywords are caseinsensitive (can be written in upper or lower case). Italics indicate a usersupplied argument or variable. Square brackets denote optional syntax items. An elipsis denotes a syntax item that can be repeated any number of times. SUM(page_views) SUM(field_name) CONCAT(string_expression[,...]) CONCAT(string_expression[,...]) Page 4

5 Platfora Deployment Planning Guide - Introduction Contact Platfora Support For technical support, you can send an to: support@platfora.com Or visit the Platfora support site for the most up-to-date product news, knowledge base articles, and product tips. To access the support portal, you must have a valid support agreement with Platfora. Please contact your Platfora sales representative for details about obtaining a valid support agreement or with questions about your account. Copyright Notices Copyright Platfora Corporation. All rights reserved. Platfora believes the information in this publication is accurate as of its publication date. The information is subject to change without notice. THE INFORMATION IN THIS PUBLICATION IS PROVIDED AS IS. PLATFORA CORPORATION MAKES NO REPRESENTATIONS OR WARRANTIES OF ANY KIND WITH RESPECT TO THE INFORMATION IN THIS PUBLICATION, AND SPECIFICALLY DISCLAIMS IMPLIED WARRANTIES OF MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Use, copying, and distribution of any Platfora software described in this publication requires an applicable software license. Platfora, You Should Know, Interest Driven Pipeline, Fractal Cache, and Adaptive Job Synthesis are trademarks of the Platfora Corporation. Apache Hadoop and Apache Hive are trademarks of the Apache Software Foundation. All other trademarks used herein are the property of their respective owners. Embedded Software Copyrights and License Agreements Platfora contains the following open source and third-party proprietary software subject to their respective copyrights and license agreements: Apache Hive PDK dom4j freemarker GeoNames Google Maps API Apache Jandex Page 5

6 Platfora Deployment Planning Guide - Introduction Apache POI javassist javax.servlet Mortbay Jetty OWASP CSRFGuard 3 PostgreSQL JDBC Scala sjsxp : Unboundid Tableau jbcrypt SimpleSlider Page 6

7 Chapter 1 About Platfora Deployments Platfora runs on dedicated servers in the same network as your Hadoop deployment, which can be in an onpremise data center or in the cloud. Platfora uses the data processing services of Hadoop to process and prepare data for analysis. Platfora uses the data storage services of Hadoop to access the raw data and to store the output of the optimized data it prepares. This section explains how Platfora is deployed and the basics of the Platfora/ Hadoop server architecture. Topics: Platfora Deployment Architectures Platfora Server Architecture FAQs Platfora Deployments Platfora Deployment Architectures The Platfora software runs on a scale-out cluster of servers. These servers can be physical servers in an on-premise data center or virtual server instances in the cloud. Platfora uses native Hadoop protocols to connect to the distributed file system and data processing services of Hadoop. Platfora should be deployed on dedicated machines with low-latency connections to these Hadoop cluster services. This section explains how Platfora is deployed in your network environment, using either an on-premise, Google Dataproc cloud, or AWS cloud deployment of Hadoop. On-Premise Hadoop Deployments An on-premise Hadoop deployment means that you already have an existing Hadoop installation in your data center (either a physical data center or a virtual private cloud). Page 7

8 Platfora Deployment Planning Guide - About Platfora Deployments Platfora connects to the Hadoop cluster managed by your organization, and the majority of your organization's data is stored in the distributed file system of this primary Hadoop cluster. For on-premise Hadoop deployments, the Platfora servers should be on their own dedicated hardware co-located in the same data center as your Hadoop cluster. A data center can be a physical location with actual hardware resources, or a virtual private cloud environment with virtual server instances (such as Rackspace or Amazon EC2). Platfora recommends putting the Platfora servers on a network with at least 1 Gbps connectivity to the Hadoop nodes. Platfora users access the Platfora master node using an HTML5-compliant web browser. The Platfora master node accesses the HDFS NameNode and the MapReduce JobTracker or YARN Resource Manager using native Hadoop protocols. The Platfora worker nodes access the HDFS DataNodes directly. If using a firewall, Platfora recommends placing the Platfora servers on the same side of the firewall as your Hadoop cluster. Platfora software can run on a wide variety of server configurations on as little as one server or scale across multiple servers. Since Platfora runs best with all of the active lenses readily available in RAM, Platfora recommends obtaining servers optimized for higher RAM capacity and a minimum of 8 CPUs. Page 8

9 Platfora Deployment Planning Guide - About Platfora Deployments Amazon AWS Cloud Deployments An Amazon Web Services (AWS) cloud deployment means that you do not have a persistent Hadoop cluster. Instead, your organization uses Amazon S3 for raw data storage and Amazon EMR for ondemand Hadoop data processing. In an Amazon AWS cloud deployment, the Platfora server instances are deployed on dedicated, highmemory EC2 instances. Your organization s raw data is managed in Amazon's Simple Storage Service (S3). Platfora uses Amazon Elastic MapReduce (EMR) to run its data processing jobs (lens builds). The results of the lens build jobs are then written back to S3. Page 9

10 Platfora Deployment Planning Guide - About Platfora Deployments Google Cloud Platform Deployments A Google Cloud Platform (GCP) cloud deployment means that you do not have a persistent Hadoop cluster. Instead, your organization uses Google Cloud Storage for raw data storage and Google Cloud Dataproc for on-demand Hadoop data processing. In a Google Cloud Platform deployment, the Platfora server instances are deployed on dedicated, highmemory Google Compute Engine instances. Your organization s raw data is managed in Google's Cloud Storage. Platfora uses Google Cloud Dataproc to run its data processing jobs (lens builds). The results of the lens build jobs are then written back to Google Cloud Storage. Platfora Server Architecture Platfora connects to an existing Hadoop implementation, and makes the raw data residing in Hadoop accessible to users. The Platfora server has a number of services that work together with Hadoop's Page 10

11 Platfora Deployment Planning Guide - About Platfora Deployments services to access the raw data, prepare it for analysis, and present the results to users. This topic helps you understand the main components of the Platfora server architecture. The Platfora Master Node You can have a fully-functioning Platfora installation with just one node the master node. The master node manages the following Platfora services: Metadata Catalog - Platfora's metadata catalog holds all of the information about the data managed by Platfora (the datasets, lenses, vizboards and so on). The metadata catalog is a relational database that runs on the Platfora master node, but is accessed by all nodes in the Platfora cluster. Lens Builder - The lens builder interfaces with the data processing services of Hadoop. It translates data requests from the Platfora application into a series of custom MapReduce jobs, which it then submits to the Hadoop Job Tracker or Resource Manager for execution. After the requested data has been extracted and transformed in Hadoop, the job results are written back to the Hadoop file system in Platfora's proprietary file format called a lens. On-Disk Storage - Finished lenses are immediately copied from the Hadoop file system to on-disk storage of the Platfora nodes. The data of a lens is distributed across all of the available worker nodes in a Platfora cluster. In-Memory Query Engine - When users explore and analyze data in Platfora, they are actually generating queries that run against a lens. The result of a lens query is rendered as a visualization in Platfora. When users construct visualizations, they choose a lens to work with. Choosing a lens Page 11

12 Platfora Deployment Planning Guide - About Platfora Deployments loads its data into Platfora's in-memory query engine. The in-memory query engine has two kinds of processes that work on a query: 1. Query Coordinator - The query coordinator process runs on the master node only, and translates actions made in the Platfora application into queries. The coordinator sends the query to the workers for processing, then consolidates the partial results from each worker into a final result. 2. Query Worker - The query worker process typically runs on the worker nodes, but the master may also serve as a worker in some cases. A query worker process works on its portion of lens data for a given query. Web Application Server - Platfora's user interface runs as a web application in your network. Users connect to Platfora using any HTML5-compliant browser. Through the browser, users interact with data in Hadoop as easily as browsing a web site. The Platfora Worker Nodes The Platfora worker nodes are used to distribute lens storage capacity and query processing workload. As users work with more and bigger lenses in Platfora, more memory and processing power is needed to render visualizations quickly. Administrators can add additional worker nodes to scale up lens storage capacity and performance. By using the resources of multiple machines to store and process lens data, Platfora can handle true 'big data' query workloads. Page 12

13 Platfora Deployment Planning Guide - About Platfora Deployments FAQs Platfora Deployments Got questions about what you need to get Platfora up and running? Want to know how Platfora is deployed in your data center environment and how it works with Hadoop? This topic answers the most frequently asked questions (FAQs) about Platfora installation and deployment. What do I need before I can install Platfora? Before you can install Platfora, you will need: Hadoop Platfora needs access to an installed and running Hadoop cluster, or to a Google Cloud Platform account with Google Cloud Storage and Google Cloud Dataproc enabled, or to an Amazon Web Services (AWS) account with Amazon S3 (Simple Storage Service) and EMR (Elastic MapReduce) enabled. Linux Server(s) You will need one or more dedicated servers running a supported Linux operating system on which to install Platfora. The Platfora server(s) should be in the same data center (or region) as your Hadoop distribution, but not on the same machines. Platfora Binaries A Platfora customer support representative can give you the download link to the Platfora installation package for your chosen Hadoop distribution. Platfora provides both rpm and tar installer packages. Platfora License A Platfora customer support representative must issue you a license file. Trial period licenses are available upon request for pilot installations. Platfora Installation Guide You will need the Platfora installation guide that covers your specific Hadoop distribution. The setup steps vary slightly depending on the version of Hadoop you are using. What are the high-level steps involved in installing Platfora? Every Platfora installation involves these basic steps, although the details will vary slightly depending on the Hadoop distribution you are using: Configure Hadoop for Platfora Access Make sure that the Platfora server(s) can access your Hadoop services over the network and that Platfora has write access to a designated directory in the Hadoop file system. Obtain the required connection details for your Hadoop services (Platfora connects to Hadoop during setup). Install Prerequisites on all Platfora Nodes Make sure the Platfora servers have the required dependencies before installing Platfora. If using the rpm installer, Platfora provides a base package that includes the dependencies. If using the tar installer, you will need to manually install the dependent software yourself. Install the Platfora Software on the Master Install the Platfora binaries on the master node. Setup the Platfora Master Run the setup utility to configure the Platfora master server and connect it to your Hadoop services. Start Platfora After setup completes, start the Platfora server. You should now have a fullyfunctioning single-node Platfora installation. Page 13

14 Platfora Deployment Planning Guide - About Platfora Deployments Run Tests and Load the Tutorial Data After setup completes, you may want to run some tests to make sure that Platfora is properly configured and can access your Hadoop cluster. One way to test everything is to load the tutorial data that comes with your Platfora installation. This will put some data in Hadoop and build a small lens to make sure everything is working. Add Platfora Worker Nodes Once you have the Platfora master node up and running, you can use it to add Platfora worker nodes to the cluster. The master node is always used to install and manage the worker nodes. Is there a trial version of Platfora? Platfora does not currently have a trial version available for download. You can contact Platfora Customer Support to arrange for a pilot or trial installation. Why would I need multiple Platfora nodes? When users work with lens data in Platfora, that data is loaded into memory so that queries (vizzes) are fast and responsive. If there is more lens data than can fit into memory, then some queries may be slow or not be able to run at all. Adding more nodes to your Platfora cluster makes more disk, memory and CPU available to store and process lens data. How many Platfora nodes would I need? Platfora is intended for big data query workloads, and performs best when using the resources of multiple machines. Although you can have a fully-functioning Platfora installation with just one node, a multi-node installation is necessary for optimal performance and bigger lens sizes. The ideal number of Platfora nodes really depends on a lot of factors: lens size, lens quantity, data variety, and number of concurrent users (to name a few). Your Platfora account representative will help you determine the number of nodes that best fits your unique data requirements. You can also scale up your Platfora cluster as your data and usage grows. How does Platfora interact with Hadoop? Platfora uses the powerful distributed storage and processing features of Hadoop, but masks the complexity of working with HDFS and MapReduce by providing an easy-to-use web interface. Platfora uses Hadoop to access the raw data stored in its distributed file system (DFS) and makes the data visible to Platfora users. It uses the data processing services of Hadoop (MapReduce) to pull requested data and prepare it for analysis. The result of these processing jobs is the Platfora lens. Platfora lenses are stored in the Hadoop distributed file system, as well as copied over to the Platfora servers. Can Platfora connect to more than one source system? When you install Platfora, you connect it to one Hadoop distribution. This is the primary source system that Platfora uses to access the source data and process its lens builds. You can create data sources that point to external sources (such as a cloud storage service or a relational database). However, this external data must be pulled over to the primary Hadoop source system during Page 14

15 Platfora Deployment Planning Guide - About Platfora Deployments lens build processing. To avoid moving large amounts of data over the network, Platfora recommends using external data sources for smaller, supplemental datasets only. What does Platfora do to the data in Hadoop? Platfora reads the raw data, but does not edit, update, or delete it in place. It makes a copy of the requested portion of the data when it builds a lens, and does its lens processing on the copied data. Your original data remains intact and unaltered. How does Platfora keep my data secure? Platfora's role-based security allows you to control who can authenticate to the Platfora application and what actions they can perform. You can maintain user credentials within the Platfora application, or configure Platfora to use an external LDAP directory service to authenticate users. To authorize access to the raw data, you can either manage data access permissions within the Platfora application itself, or you can configure Platfora to use Kerberos authorization check the HDFS file system permissions. How does Platfora handle redundancy and high availability? Platfora relies on Hadoop for redundancy and high-availability of the raw data itself. The Platfora worker nodes are fully redundant and highly available. The worker nodes process the lens queries submitted to the Platfora application. Lens data is distributed and replicated across all of the worker nodes in the Platfora cluster. Depending on the number of worker nodes you have, you can lose a node and still continue processing queries without interruption of service. A redundant Platfora master node involves taking routine backups of the metadata catalog database so you can restore the master node if needed. Page 15

16 Chapter 2 Supported Environments and Versions This section lists the environments and versions that Platfora supports. Hadoop and Hive Versions This section lists the Hadoop distributions and versions that are compatible with the Platfora installation packages. If using Hive as a data source for Platfora, the version of Hive must be compatible with the version of Hadoop you are using. Hadoop Distro Version Hive Version M/R Version Platfora Package Cloudera 5 CDH YARN cdh52 CDH YARN cdh54 CDH YARN cdh54 CDH YARN cdh54 Hortonworks HDP 2.2.x YARN hadoop_2_6_0_hive_0_14_0 HDP 2.3.x YARN hadoop_2_7_1_hive_1_2_1 HDP 2.4.x YARN hadoop_2_7_1_hive_1_2_1 MapR MapR YARN mapr402 MapR YARN mapr41 MapR YARN mapr5 MapR YARN mapr51 Pivotal Labs PivotalHD YARN hadoop_2_6_0_hive_0_14_0 Amazon EMR (AMI 3.10.x) Google Dataproc (1.0) Hadoop YARN hadoop_2_4_0_hive_0_13_0 Hadoop YARN hadoop_2_7_2_hive_1_2_1 Page 16

17 Platfora Deployment Planning Guide - Supported Environments and Versions Operating Systems Operating System Supported Versions Red Hat Enterprise Linux 6.2, 6.3, 6.4, 6.5, and 6.6 CentOS 6.2, 6.3, 6.4, 6.5, and 6.6 Scientific Linux 6.2 Amazon Linux AMI AMI and AMI Ubuntu LTS Oracle Linux 6.x Web Browsers Web Browser Chrome Firefox Supported Versions Latest version (Evergreen) and three previous releases 25.0.x or higher Safari 6.1+ and 7.x Internet Explorer with the IE 11 (Windows 7, Windows 8, Windows 10) Compatibility View feature disabled IE 10 (Windows 7 and Windows 8) Platfora supports these web browsers on desktop machines only. Platfora recommends using a screen resolution width of 1400 pixels or greater for viewing some pages in the Platfora web application. Java java openjdk (recommended) Java Sun/Oracle Python Python 2.6.8, 2.7.1, 2.7.3, 2.7.4, 2.7.5, 2.7.6, 2.7.7, only Postgres Database PostgreSQL , (on Amazon AMI), 9.2.5, Page 17

18 Chapter 3 System Requirements (On-Premise) The Platfora software runs on a scale-out cluster of servers. You can install Platfora on a single node to start, and then scale up storage and processing capacity by adding additional nodes. Platfora requires access to an existing, compatible Hadoop implementation in order to start. Users then access the Platfora application using a compatible web browser client. This section describes the system requirements for on-premise deployments of the Platfora servers, Hadoop source systems, network connectivity, and web browser clients. Topics: Platfora Server Requirements Hadoop Resource Requirements Platfora Server Requirements Platfora recommends the following minimum system requirements for Platfora servers. For multi-node installations, the master server and all worker servers must be the same operating system (OS) and system configuration (same amount of memory, CPU, etc.). 64-bit Operating System or Amazon Machine Image (AMIs) CentOS (7.0 is not supported) RHEL (7.0 is not supported) Scientific Linux 6.2 Amazon Linux AMI Oracle Enterprise Linux 6.x Ubuntu LTS or higher Security-Enhanced Linux Software Java 1.7 Python 2.6.8, 2.7.1, through (3.0 not supported) PostgreSQL , 9.2.5, or 9.3 (master only) OpenSSL or higher 2 Unix Utilities rsync, ssh, scp, cp, tar, tail, sysctl, ntp, wget 1 If you wish to install Security-Enhanced Linux, refer to Platfora's Support site for installation instructions. 2 Only required if you want to enable SSL for secure communications between Platfora servers Page 18

19 Platfora Deployment Planning Guide - System Requirements (On-Premise) Memory 64 GB minimum, 256 recommended The server needs enough memory to accommodate actively used lens data. Additionally, it needs 1-2 GB reserved for normal operations and the lens query engine workspace. CPU Disk Network 8 cores minimum, 16 recommended All Platfora nodes (master or worker) require 300 MB for the Platfora installation. Every node requires high-speed local storage and a local disk cache configured as a single logical volume. Hardware RAID is recommended for the best performance. All nodes combined require appropriate free space for aggregated data structures (Platfora lenses). At a minimum, you will need twice the amount of disk space as the amount of system memory. The Platfora master node requires an additional, approximately 850 MB for metadata catalog (dataset definitions, vizboard and visualization definitions, lens definitions, etc.) 1 Gbps reliable network connectivity between Platfora master server and query processing servers 1 Gbps reliable network connectivity between Platfora master server and Hadoop NameNode and JobTracker/ResourceManager node Network bandwidth should be comparable to the amount of memory on the Platfora master server Hadoop Resource Requirements Platfora must be able to connect to an existing Hadoop installation. Platfora also requires permissions and resources in the Hadoop source system. This section describes the Hadoop resource requirements for Platfora. Platfora uses the remote Distributed File System (DFS) of the Hadoop cluster for persistent storage and as the primary data source. Optionally, you can also configure Platfora to use a Hive metastore server as a data source. Page 19

20 Platfora Deployment Planning Guide - System Requirements (On-Premise) Platfora uses the Hadoop MapReduce services to process data and build lenses. For larger lens builds to succeed, Platfora requires minimum resources on the Hadoop cluster for MapReduce tasks. DFS Disk Space DFS Permissions MapReduce Permissions Platfora requires a designated persistent storage directory in the remote distributed file system (DFS) with appropriate free space for Platfora system files and data structures (lenses). The location is configurable. The platfora system user needs read permissions to source data directories and files. The platfora system user needs write permissions to Platfora's persistent storage directory on DFS. The platfora system user needs to be added to the submit-jobs and administer-jobs access control list (or added to a group that has these permissions). DFS Resources Minimum Open File Limit = 5000 MapReduce Resources Minimum Memory for Task Processes = 1 GB Page 20

21 Chapter 4 System Requirements (AWS Cloud) This section describes the system requirements for customers who plan to use Amazon Web Services (AWS) as their installation environment for Platfora, and Simple Storage Service (S3) and Elastic MapReduce (EMR) and as their Hadoop distributed data storage and processing services. Topics: Platfora EC2 Instance Requirements Amazon EMR Instance Requirements AWS Security Settings for Platfora Platfora EC2 Instance Requirements Platfora recommends the following system requirements for Amazon EC2 instances that will serve as Platfora server nodes. For multi-node installations, the master server instance and all worker server instances must be the same configuration (same EC2 instance type, storage configuration, network configuration, etc.). Amazon Machine Images (AMIs) EC2 Instance Type Root Device Volume (EBS) Additional EBS Volumes Amazon Linux AMI x or higher Red Hat Enterprise Linux Ubuntu Server LTS or higher Small to Medium Lens Sizes: c3.8xlarge Medium to Large Lens Sizes, 10+ Platfora nodes: r3.8xlarge Medium to Large Lens Sizes, 1-9 Platfora nodes: i2.8xlarge Recommended Size = 1 TB Type = General Purpose (SSD) Optional. Additional EBS volumes can be attached to an EC2 instance after launch time, and can be used to increase lens cache storage capacity if needed. EBS volumes are less expensive than Instance Store volumes, and the data is persistent between shutdowns. Page 21

22 Platfora Deployment Planning Guide - System Requirements (AWS Cloud) Instance Store Volume (Ephemeral) Enhanced Networking EBS Optimized Instance Availability Zone Placement Group IAM User Other Required Software Required Unix Utilities Optional. You may choose to add instance store volumes for the Platfora lens cache instead of using EBS volumes. This costs more, but offers slightly faster performance. Instance store volumes can only be attached to an EC2 instance at launch time, and the data is not saved when the instance shuts down. The size of an instance store volume depends on the instance type: c3.8xlarge: 2 x 320 GB SSD (640 GB) r3.8xlarge: 2 x 320 GB SSD (640 GB) i2.8xlarge: 8 x 800 GB SSD (6400 GB) yes (requires use of VPC instead of EC2-Classic) yes (the 8xlarge instance types are EBS optimized instances by default) yes (use same zone for all nodes in the Platfora cluster) yes (use same placement group for all nodes in the Platfora cluster) yes (create a dedicated Platfora IAM User in your AWS account) Java 1.7 Python through (3.0 not supported) (master node only) PostgreSQL (AMZN), 9.2.5, or 9.3 OpenSSL or higher 3 rsync, ssh, scp, cp, tar, tail, sysctl, ntp, wget Amazon EMR Instance Requirements Platfora launches an Elastic MapReduce (EMR) cluster when it builds a lens. This section describes the recommended requirements for the EMR instances that are launched by Platfora. Amazon EMR is Hadoop as a web service. Platfora uses the EMR Hadoop cluster to process its lens builds. Since the EMR Hadoop cluster is only instantiated as needed, the source data does not reside in the Hadoop Distributed File System (HDFS) of the EMR Hadoop cluster. The source data is instead stored on Amazon S3. Data is copied from S3 to EMR for data processing, then the results are written back to S3 when the job completes. 3 Only required if you want to enable SSL for secure communications between Platfora servers Page 22

23 Platfora Deployment Planning Guide - System Requirements (AWS Cloud) At the start of a lens build job, the raw source data is copied from S3 to the local HDFS file system on the EMR nodes. The EMR instances must have enough local instance storage to support the input source dataset and the temporary workspace for intermediate lens build job results. Also consider that the local HDFS of the EMR cluster replicates the data to ensure redundancy and high availability during lens build processing. Platfora recommends the i2.4xlarge instance type for EMR data nodes and the m3.xlarge for the EMR name node. The i2.4xlarge offers a great balance between total local disk space, CPU power, and pernode memory size. Hadoop Version AMI Version Amazon EMR 3 (AMI 3.10) EMR NameNode Instance Type EMR DataNode Instance Type Number of EMR DataNodes m3.xlarge i2.4xlarge The number of nodes you will need to complete a lens build depends on the following factors: The size of the raw dataset in S3 that is considered as input to the lens build. The replication factor of HDFS. EMR clusters of 1-4 nodes have a replication factor of 1, 5-9 nodes have a replication factor of 2, and over 10 nodes have a replication factor of 3. Temporary work space for intermediate lens build results - about 20-30% of total disk space. AWS Security Settings for Platfora Amazon Web Services (AWS) has a number of security features that you can use to protect your AWS account and cloud server instances. This section contains security setting recommendations if you plan to use Amazon Elastic MapReduce (EMR) as the Hadoop implementation for your Platfora cluster. Amazon AWS Virtual Private Cloud (VPC) To use Amazon EMR for Hadoop data processing, Platfora must be able to launch an EMR cluster in a public subnet. Administrators do this by provisioning an Amazon VPC with a public subnet, and then specifying the subnet identifier in Platfora. Platfora must create the EMR cluster on an Internet-facing subnet to allow the AWS EMR Provisioning Service to reach the EMR cluster. Additionally, you must ensure the Platfora server can communicate with the Amazon EMR cluster. If the Platfora server is on the same subnet as the Amazon EMR cluster, this happens automatically. If Page 23

24 Platfora Deployment Planning Guide - System Requirements (AWS Cloud) the Platfora server and the EMR cluster are on different VPC subnets, then a route between the subnets needs to be added to the Route table(s) so that communication can occur between the two subnets. Also, if the VPC uses Access Control Lists (ACLs), then those ACLs must be modified to allow traffic from Platfora to Hadoop. The subnet identifier cannot exceed 255 characters in length. After the Amazon VPC has been provisioned, specify its subnet identifier in the platfora.emr.subnet.id Platfora configuration property. For more information on setting up and using an Amazon VPC with Amazon EMR, see docs.aws.amazon.com/elasticmapreduce/latest/developerguide/emr-plan-vpc-subnet.html. IAM User and IAM Roles for Platfora AWS Identity and Access Management (IAM) allows you to create users, groups, and roles to control access to AWS services and resources. Platfora recommends creating an IAM User account and two IAM Roles specifically for use by Platfora. Platfora uses a combination of an IAM User and IAM Roles to communicate with Amazon AWS and to create an EMR cluster. An Amazon AWS administrator needs to create a platfora IAM User and two IAM Roles specifically for use by Platfora. Then a Platfora system administrator needs to enter some information about that user and those roles in Platfora. The Platfora server uses security credentials of the platfora IAM User to request Amazon AWS to create an Amazon EMR cluster. Once that request is approved, the platfora IAM User then passes an IAM Role to actually launch an EMR cluster, and then uses another IAM Role to start EC2 instances in the EMR cluster. You must specify these roles in Platfora. For more details on creating the user and roles, see Create IAM User for Platfora and Create IAM Roles for Platfora. Create IAM User for Platfora The Amazon AWS administrator can create a new platfora user in the IAM Management Console of your AWS account. After creating the user, download the AWS credentials for this user. The Platfora Page 24

25 Platfora Deployment Planning Guide - System Requirements (AWS Cloud) system administrator will need the Access Key Id and Secret Access Key when you initialize Platfora for use with Amazon EMR. The security policy for the platfora IAM User must have (at a minimum) the permissions listed in the following sample policy: { "Version": " ", "Statement": [ { "Action": [ "iam:listroles", "iam:passrole", "elasticmapreduce:*", "s3:getbucketlocation", "s3:listallmybuckets" Page 25

26 Platfora Deployment Planning Guide - System Requirements (AWS Cloud) }, { ], "Effect": "Allow", "Resource": "*" "Effect": "Allow", "Action": [ "s3:listbucket" ], "Resource": [ "arn:aws:s3:::bucket_defined_in_core-site.xml", "arn:aws:s3:::datasource_bucket_1", "arn:aws:s3:::datasource_bucket_n" } ] }, { }, { } ] "Effect": "Allow", "Action": [ "s3:putobject", "s3:get*", "s3:deleteobject", ], "Resource": [ "arn:aws:s3:::bucket_defined_in_core-site.xml/*" ] "Effect": "Allow", "Action": [ "s3:get*" ], "Resource": [ "arn:aws:s3:::datasource_bucket_1/path/to/files/*", "arn:aws:s3:::datasource_bucket_n/*" ] Under Permissions for this user, attach a security policy that contains the permissions listed above. These permissions allow the platfora IAM User to pass an IAM Role to launch the EMR cluster, start an EMR cluster, and access S3 for source data during data ingest. Create IAM Roles for Platfora Amazon requires all AWS users to use IAM Roles to launch EMR clusters. One IAM Role is used to start the Amazon EMR service, and the other role is used by the EC2 instances in the EMR cluster. Amazon AWS offers some default IAM Roles for these services. However, Platfora recommends creating custom IAM Roles specifically for use by Platfora instead. Page 26

27 Platfora Deployment Planning Guide - System Requirements (AWS Cloud) The Amazon AWS administrator can create the IAM Roles in the IAM Management Console of your AWS account. Create a role for each of the following EMR cluster services, and specify them in Platfora using the specified configuration properties: Amazon EMR service (service role). In Amazon AWS, create an IAM Role and attach a security policy that contains at a minimum the permissions specified below. Enter this IAM Role name in the platfora.emr.service.role Platfora configuration property. The custom role you define corresponds to the default IAM Role Amazon offers called EMR_DefaultRole. EC2 instances (instance profile) in the Amazon EMR cluster. In Amazon AWS, create an IAM Role and attach a security policy that contains at a minimum the permissions specified below. Enter this IAM Role name in the platfora.emr.jobflow.role Platfora configuration property. The custom role you define corresponds to the default IAM Role Amazon offers called EMR_EC2_DefaultRole. The security policy for the Amazon EMR service (service role) IAM Role must have (at a minimum) the permissions listed in the following sample policy: { "Version": " ", "Statement": [ { "Action": [ "ec2:authorizesecuritygroupingress", "ec2:cancelspotinstancerequests", "ec2:createsecuritygroup", "ec2:createtags", "ec2:deletetags", "ec2:describe*", "ec2:modifyimageattribute", "ec2:modifyinstanceattribute", "ec2:requestspotinstances", "ec2:runinstances", "ec2:terminateinstances" ], "Effect": "Allow", "Resource": "*" }, { "Action": [ "iam:passrole", "iam:listrolepolicies", "iam:getrole", "iam:getrolepolicy", "iam:listinstanceprofiles" ], "Effect": "Allow", "Resource": "*" }, { "Effect": "Allow", "Action": [ Page 27

28 Platfora Deployment Planning Guide - System Requirements (AWS Cloud) } ] } "s3:get*" ], "Resource": "arn:aws:s3:::bucket_defined_in_core-site.xml/*" The security policy for the EC2 instances (instance profile) IAM Role must have (at a minimum) the permissions listed in the following sample policy: { "Version": " ", "Statement": [ { "Effect": "Allow", "Resource": "*", "Action": [ "ec2:describe*", "elasticmapreduce:describe*", "elasticmapreduce:listbootstrapactions", "elasticmapreduce:listclusters", "elasticmapreduce:listinstancegroups", "elasticmapreduce:listinstances", "elasticmapreduce:liststeps", "s3:listallmybuckets" ] }, { }, { }, { "Effect": "Allow", "Action": [ "s3:listbucket" ], "Resource": [ "arn:aws:s3:::bucket_defined_in_core-site.xml", "arn:aws:s3:::datasource_bucket_1", "arn:aws:s3:::datasource_bucket_n" ] "Effect": "Allow", "Action": [ "s3:putobject", "s3:get*", "s3:deleteobject" ], "Resource": [ "arn:aws:s3:::bucket_defined_in_core-site.xml/*", ] "Effect": "Allow", "Action": [ Page 28

29 Platfora Deployment Planning Guide - System Requirements (AWS Cloud) } ], ] } "s3:get*", "s3:list*" "Resource": [ "arn:aws:s3:::datasource_bucket_1/path/to/files/*", "arn:aws:s3:::datasource_bucket_n/*", "arn:aws:s3:::*elasticmapreduce/*" ] Verify that the permissions for and access to Amazon resources (especially S3) for the EC2 instances role are the same or greater than the permissions and access assigned to the platfora IAM User. For example, if the platfora IAM User can access an Amazon S3 bucket, but the EC2 instances role cannot, then lens builds that rely on that S3 bucket will fail. For more information on using IAM Roles for EMR, see ElasticMapReduce/latest/DeveloperGuide/emr-iam-roles.html. EC2 Security Group Settings EC2 security groups allow you to specify firewalling rules for your Amazon elastic cloud computing (EC2) server instances. EC2 security group rules are independent of, and in addition to, the software firewalling provided by the instance's operating system. Security groups must be defined before you create an EC2 instance. The security group configured for the Platfora server instance must permit connections from your user network to the Platfora web application server port (8001 by default). You also may want to open the EMR Hadoop ResourceManager and JobHistory web ports so that you can monitor and troubleshoot YARN jobs executed by Platfora. An example security group configuration for a Platfora server instance would look something like the following: Page 29

30 Chapter 5 System Requirements (GCP Cloud) This section describes the system requirements for customers who plan to use Google Cloud Platform (GCP) as their installation environment for Platfora, and Google Cloud Storage (GCS) and Cloud Dataproc and as their Hadoop distributed data storage and processing services. Topics: Platfora Compute Engine Machine Requirements Google Dataproc Machine Requirements GCP Security Settings for Platfora Platfora Compute Engine Machine Requirements Platfora recommends the following system requirements for Google Compute Engine machines that will serve as Platfora server nodes. For multi-node installations, the master machine and all worker machines must be the same configuration (same Compute Engine machine type, storage configuration, network configuration, etc.). Machine Boot Disk Operating System Compute Engine Machine Type Boot Disk Drive Additional Disks Debian GNU/Linux 8 (jessie) Debian GNU/Linux 7 (wheezy) CentOS 6 Ubuntu LTS Red Hat Enterprise Linux 6 Small to Medium Lens Sizes: Custom: 32 vcpus and 64 GB of Memory (RAM) Medium to Large Lens Sizes, 1+ Platfora nodes: n1-highmem-32 Recommended Size = 1 TB Type = SSD Persistent Disk Optional. Additional disks can be attached to a Compute Engine machine after launch time, and can be used to increase lens cache storage capacity if needed. Standard Persistent Disks are less expensive than SSD Persistent Disks, and the data is persistent between shutdowns. Page 30

31 Platfora Deployment Planning Guide - System Requirements (GCP Cloud) Zone Google Service Account Other Required Software Required Unix Utilities yes (use the same zone for all nodes in the Platfora cluster) yes (create a dedicated Service Account for Platfora in your Google Cloud Platform account) Java 1.7 Python through (3.0 not supported) (master node only) PostgreSQL 9.2.5, 9.2.7, or 9.3 OpenSSL or higher 4 rsync, ssh, scp, cp, tar, tail, sysctl, ntp, wget Google Dataproc Machine Requirements Platfora launches a Google Cloud Dataproc cluster when it builds a lens. This section describes the recommended requirements for the Dataproc machines that are launched by Platfora. Google Cloud Dataproc is Hadoop as a web service. Platfora uses the Dataproc Hadoop cluster to process its lens builds. Since the Dataproc Hadoop cluster is only instantiated as needed, the source data does not reside in the Hadoop Distributed File System (HDFS) of the Dataproc Hadoop cluster. The source data is instead stored on Google Cloud Storage (GCS). Data is copied from GCS to Dataproc for data processing, then the results are written back to GCS when the job completes. At the start of a lens build job, the raw source data is copied from GCS to the local HDFS file system on the Dataproc nodes. The Dataproc machines must have enough local machine storage to support the input source dataset and the temporary workspace for intermediate lens build job results. Also consider that the local HDFS of the Dataproc cluster replicates the data to ensure redundancy and high availability during lens build processing. Platfora recommends the n1-highmem-16 machine type for Dataproc data nodes and the n1-standard-4 for the Dataproc name node. The n1-highmem-16 machine type offers a great balance between total local disk space, CPU power, and per-node memory size. Hadoop Version Dataproc Software Version Dataproc NameNode Machine Type Dataproc DataNode Machine Type Dataproc 1.0 n1-standard-4 n1-highmem-16 4 Only required if you want to enable SSL for secure communications between Platfora servers Page 31

32 Platfora Deployment Planning Guide - System Requirements (GCP Cloud) Number of Dataproc DataNodes The number of nodes you will need to complete a lens build depends on the following factors: The size of the raw dataset in GCS that is considered as input to the lens build. The replication factor of HDFS. Dataproc clusters of 1-4 nodes have a replication factor of 1, 5-9 nodes have a replication factor of 2, and over 10 nodes have a replication factor of 3. Temporary work space for intermediate lens build results about 20-30% of total disk space. The number of worker nodes in a Dataproc cluster must be a value of two or higher. GCP Security Settings for Platfora Google Cloud Platform has a number of security features that you can use to protect your Google Cloud Platform account and cloud server machines. This section contains security setting recommendations if you plan to use Google Cloud Dataproc as the Hadoop implementation for your Platfora cluster. Google Cloud Service Account for Platfora A service account is a special Google account that can be used by applications to access Google services programmatically. To use any of the Google services (Dataproc, Storage, or BigQuery), you must create a Google service account in your Google Cloud Platform account that is used by Platfora. You will specify this service account for the Compute Engine machines used for the Platfora cluster. Platfora uses the service account when it accesses other Google services. At a minimum, the service account must meet the following requirements: Read access for every Google Cloud Storage bucket that Platfora needs to access. Write access to the Google Cloud Storage bucket where Platfora writes lens build files. Additionally, Google Cloud Platform creates all Dataproc clusters in the default service account. If you use Dataproc as your Hadoop environment, the default service account must have Edit permission to the Google Project. (This is required for Google Cloud Dataproc. Contact Google Support for any questions about this requirement.) Make sure that no Google Cloud Storage bucket access control lists (ACLs) prevent the Platfora service account from accessing the Storage bucket folders it needs. For more information on Google service accounts, see Page 32

33 Platfora Deployment Planning Guide - System Requirements (GCP Cloud) Google Cloud Subnetwork for Platfora Google Cloud Platform allows you to define a network in which all machine instances are located. You can segment the IP addresses in a GCP network into subnets, which GCP calls subnetworks. To use any of the Google services (Dataproc, Storage, or BigQuery), you must create a Google Cloud Platform subnetwork and use that subnetwork name when configuring Platfora. You must ensure the following are true: All nodes of the Platfora cluster are in the same subnetwork. The Dataproc cluster is configured to launch in the same subnetwork as the Platfora cluster. (platfora.gcp.dataproc.subnet.name configuration property) The Firewall rules in the subnetwork allow each node of the Platfora cluster to communicate with the other Platfora nodes and the nodes in the Dataproc cluster. For more information on Google networks, see networking#before-you-begin. Page 33

34 Chapter 6 Port Configuration Requirements You must open ports in the firewall of your Platfora nodes to allow client access and intra-cluster communications. You also must open ports within your Hadoop cluster to allow access from Platfora. This section lists the default ports required. Topics: Ports to Open on Platfora Nodes Ports to Open on Hadoop Nodes Ports to Open on Platfora Nodes Your Platfora master node must allow HTTP connections from your user network. All nodes must allow connections from the other Platfora nodes in a multi-node cluster. On Amazon EC2 instances, you must configure the port firewall rules on the Platfora server instances in addition to the EC2 Security Group Settings. Platfora Service Default Port Allow connections from Master Web Services Port (HTTP) Secure Master Web Services Port (HTTPS) Master Server Management Port Worker Server Management Port 8001 External user network Platfora worker servers localhost 8443 External user network Platfora worker servers localhost 8002 Platfora worker servers localhost 8002 Platfora master server other Platfora worker servers localhost Page 34

35 Platfora Deployment Planning Guide - Port Configuration Requirements Platfora Service Default Port Allow connections from Master Data Port 8003 Platfora worker servers localhost Spark UI 4040 External user network (optional for troubleshooting Spark jobs) Worker Data Port 8003 Platfora master server other Platfora worker servers localhost Master PostgreSQL Database Port 5432 Platfora worker servers localhost Spark Ephemeral Port Range Depends on the OS. For CentOS and Ubuntu, it is to All nodes in the Hadoop cluster, Dataproc cluster, or EMR cluster Ports to Open on Hadoop Nodes Platfora must be able to access certain services of your Hadoop cluster. This section lists the Hadoop services Platfora needs to access and the default ports for those services. Note that this only applies to on-premise Hadoop deployments or to self-managed Hadoop deployments in a virtual private cloud, not to Google Cloud Dataproc or Amazon Elastic MapReduce (EMR). Hadoop Service Default Ports by Hadoop Distro CDH, HDP, MapR Pivotal Allow connections from HDFS NameNode 8020 N/A Platfora master and worker servers HDFS DataNodes N/A Platfora master and worker servers MapRFS CLDB N/A 7222 Platfora master and worker servers MapRFS DataNodes N/A 5660 Platfora master and worker servers YARN ResourceManager Platfora master server Page 35

36 Platfora Deployment Planning Guide - Port Configuration Requirements Hadoop Service Default Ports by Hadoop Distro CDH, HDP, MapR Pivotal Allow connections from YARN ResourceManager Web UI External user network (optional for troubleshooting) YARN Job History Server Platfora master server YARN Job History Server Web UI External user network (optional for troubleshooting) YARN Application Master Depends on mapredsite.xml 5 Depends on mapredsite.xml 6 Platfora master server HiveServer Thrift Port Platfora master server Hive Metastore DB Port 7 Depends on the database used 8 Depends on the database used 9 Platfora master server Spark Server ephemeral port range ephemeral port range Platfora master server To limit the ephemeral port range, see your Linux operating system documentation about changing the net.ipv4.ip_local_port_range OS setting. 5 See yarn.app.mapreduce.am.job.client.port-range property in mapred-site.xml 6 See yarn.app.mapreduce.am.job.client.port-range property in mapred-site.xml 7 If connecting to Hive directly using JDBC 8 For example, MySQL is 3306, and Postgres is For example, MySQL is 3306, and Postgres is Page 36

Platfora Installation Guide

Platfora Installation Guide Platfora Installation Guide Version 5.0 For Amazon EMR Cloud Deployments Copyright Platfora 2015 Last Updated: 12:47 p.m. November 10, 2015 Contents Document Conventions... 5 Contact Platfora Support...6

More information

Platfora Installation Guide

Platfora Installation Guide Platfora Installation Guide Version 4.5 For On-Premise Hadoop Deployments Copyright Platfora 2015 Last Updated: 10:14 p.m. June 28, 2015 Contents Document Conventions... 5 Contact Platfora Support...6

More information

API Reference Guide. API Version 1. Copyright Platfora 2016

API Reference Guide. API Version 1. Copyright Platfora 2016 API Reference Guide API Version 1 Copyright Platfora 2016 Last Updated: 10:05 a.m. April 21, 2016 Contents Document Conventions... 5 Contact Platfora Support...6 Copyright Notices... 6 Chapter 1: Using

More information

Alfresco Enterprise on AWS: Reference Architecture

Alfresco Enterprise on AWS: Reference Architecture Alfresco Enterprise on AWS: Reference Architecture October 2013 (Please consult http://aws.amazon.com/whitepapers/ for the latest version of this paper) Page 1 of 13 Abstract Amazon Web Services (AWS)

More information

Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments

Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments Important Notice 2010-2015 Cloudera, Inc. All rights reserved. Cloudera, the Cloudera logo, Cloudera Impala, Impala, and

More information

Supported Platforms. HP Vertica Analytic Database. Software Version: 7.0.x

Supported Platforms. HP Vertica Analytic Database. Software Version: 7.0.x HP Vertica Analytic Database Software Version: 7.0.x Document Release Date: 5/7/2014 Legal Notices Warranty The only warranties for HP products and services are set forth in the express warranty statements

More information

Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments

Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments Important Notice 2010-2016 Cloudera, Inc. All rights reserved. Cloudera, the Cloudera logo, Cloudera Impala, Impala, and

More information

Supported Platforms HPE Vertica Analytic Database. Software Version: 7.2.x

Supported Platforms HPE Vertica Analytic Database. Software Version: 7.2.x HPE Vertica Analytic Database Software Version: 7.2.x Document Release Date: 2/4/2016 Legal Notices Warranty The only warranties for Hewlett Packard Enterprise products and services are set forth in the

More information

MySQL and Virtualization Guide

MySQL and Virtualization Guide MySQL and Virtualization Guide Abstract This is the MySQL and Virtualization extract from the MySQL Reference Manual. For legal information, see the Legal Notices. For help with using MySQL, please visit

More information

JAMF Software Server Installation and Configuration Guide for Linux. Version 9.2

JAMF Software Server Installation and Configuration Guide for Linux. Version 9.2 JAMF Software Server Installation and Configuration Guide for Linux Version 9.2 JAMF Software, LLC 2013 JAMF Software, LLC. All rights reserved. JAMF Software has made all efforts to ensure that this guide

More information

insync Installation Guide

insync Installation Guide insync Installation Guide 5.2 Private Cloud Druva Software June 21, 13 Copyright 2007-2013 Druva Inc. All Rights Reserved. Table of Contents Deploying insync Private Cloud... 4 Installing insync Private

More information

Supported Platforms. HP Vertica Analytic Database. Software Version: 7.1.x

Supported Platforms. HP Vertica Analytic Database. Software Version: 7.1.x HP Vertica Analytic Database Software Version: 7.1.x Document Release Date: 10/14/2015 Legal Notices Warranty The only warranties for HP products and services are set forth in the express warranty statements

More information

Hadoop & Spark Using Amazon EMR

Hadoop & Spark Using Amazon EMR Hadoop & Spark Using Amazon EMR Michael Hanisch, AWS Solutions Architecture 2015, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Agenda Why did we build Amazon EMR? What is Amazon EMR?

More information

JAMF Software Server Installation Guide for Linux. Version 8.6

JAMF Software Server Installation Guide for Linux. Version 8.6 JAMF Software Server Installation Guide for Linux Version 8.6 JAMF Software, LLC 2012 JAMF Software, LLC. All rights reserved. JAMF Software has made all efforts to ensure that this guide is accurate.

More information

JAMF Software Server Installation and Configuration Guide for OS X. Version 9.0

JAMF Software Server Installation and Configuration Guide for OS X. Version 9.0 JAMF Software Server Installation and Configuration Guide for OS X Version 9.0 JAMF Software, LLC 2013 JAMF Software, LLC. All rights reserved. JAMF Software has made all efforts to ensure that this guide

More information

H2O on Hadoop. September 30, 2014. www.0xdata.com

H2O on Hadoop. September 30, 2014. www.0xdata.com H2O on Hadoop September 30, 2014 www.0xdata.com H2O on Hadoop Introduction H2O is the open source math & machine learning engine for big data that brings distribution and parallelism to powerful algorithms

More information

JAMF Software Server Installation and Configuration Guide for OS X. Version 9.2

JAMF Software Server Installation and Configuration Guide for OS X. Version 9.2 JAMF Software Server Installation and Configuration Guide for OS X Version 9.2 JAMF Software, LLC 2013 JAMF Software, LLC. All rights reserved. JAMF Software has made all efforts to ensure that this guide

More information

unisys Unisys Stealth(cloud) for Amazon Web Services Deployment Guide Release 1.0 January 2016 8205 5658-001

unisys Unisys Stealth(cloud) for Amazon Web Services Deployment Guide Release 1.0 January 2016 8205 5658-001 unisys Unisys Stealth(cloud) for Amazon Web Services Deployment Guide Release 1.0 January 2016 8205 5658-001 NO WARRANTIES OF ANY NATURE ARE EXTENDED BY THIS DOCUMENT. Any product or related information

More information

Using The Hortonworks Virtual Sandbox

Using The Hortonworks Virtual Sandbox Using The Hortonworks Virtual Sandbox Powered By Apache Hadoop This work by Hortonworks, Inc. is licensed under a Creative Commons Attribution- ShareAlike3.0 Unported License. Legal Notice Copyright 2012

More information

Partek Flow Installation Guide

Partek Flow Installation Guide Partek Flow Installation Guide Partek Flow is a web based application for genomic data analysis and visualization, which can be installed on a desktop computer, compute cluster or cloud. Users can access

More information

IBM Cloud Manager with OpenStack

IBM Cloud Manager with OpenStack IBM Cloud Manager with OpenStack Download Trial Guide Cloud Solutions Team: Cloud Solutions Beta cloudbta@us.ibm.com Page 1 Table of Contents Chapter 1: Introduction...3 Development cycle release scope...3

More information

JAMF Software Server Installation and Configuration Guide for Windows. Version 9.3

JAMF Software Server Installation and Configuration Guide for Windows. Version 9.3 JAMF Software Server Installation and Configuration Guide for Windows Version 9.3 JAMF Software, LLC 2014 JAMF Software, LLC. All rights reserved. JAMF Software has made all efforts to ensure that this

More information

CDH installation & Application Test Report

CDH installation & Application Test Report CDH installation & Application Test Report He Shouchun (SCUID: 00001008350, Email: she@scu.edu) Chapter 1. Prepare the virtual machine... 2 1.1 Download virtual machine software... 2 1.2 Plan the guest

More information

ArcGIS 10.3 Server on Amazon Web Services

ArcGIS 10.3 Server on Amazon Web Services ArcGIS 10.3 Server on Amazon Web Services Copyright 1995-2015 Esri. All rights reserved. Table of Contents Introduction What is ArcGIS Server on Amazon Web Services?............................... 5 Quick

More information

JAMF Software Server Installation and Configuration Guide for Linux. Version 9.0

JAMF Software Server Installation and Configuration Guide for Linux. Version 9.0 JAMF Software Server Installation and Configuration Guide for Linux Version 9.0 JAMF Software, LLC 2013 JAMF Software, LLC. All rights reserved. JAMF Software has made all efforts to ensure that this guide

More information

Deploying Cloudera CDH (Cloudera Distribution Including Apache Hadoop) with Emulex OneConnect OCe14000 Network Adapters

Deploying Cloudera CDH (Cloudera Distribution Including Apache Hadoop) with Emulex OneConnect OCe14000 Network Adapters Deploying Cloudera CDH (Cloudera Distribution Including Apache Hadoop) with Emulex OneConnect OCe14000 Network Adapters Table of Contents Introduction... Hardware requirements... Recommended Hadoop cluster

More information

EMC Data Protection Search

EMC Data Protection Search EMC Data Protection Search Version 1.0 Security Configuration Guide 302-001-611 REV 01 Copyright 2014-2015 EMC Corporation. All rights reserved. Published in USA. Published April 20, 2015 EMC believes

More information

Quick Start Guide for VMware and Windows 7

Quick Start Guide for VMware and Windows 7 PROPALMS VDI Version 2.1 Quick Start Guide for VMware and Windows 7 Rev. 1.1 Published: JULY-2011 1999-2011 Propalms Ltd. All rights reserved. The information contained in this document represents the

More information

Postgres Enterprise Manager Installation Guide

Postgres Enterprise Manager Installation Guide Postgres Enterprise Manager Installation Guide January 22, 2016 Postgres Enterprise Manager Installation Guide, Version 6.0.0 by EnterpriseDB Corporation Copyright 2013-2016 EnterpriseDB Corporation. All

More information

An Introduction to Cloud Computing Concepts

An Introduction to Cloud Computing Concepts Software Engineering Competence Center TUTORIAL An Introduction to Cloud Computing Concepts Practical Steps for Using Amazon EC2 IaaS Technology Ahmed Mohamed Gamaleldin Senior R&D Engineer-SECC ahmed.gamal.eldin@itida.gov.eg

More information

PLATFORA SOLUTION ARCHITECTURE

PLATFORA SOLUTION ARCHITECTURE WHITE PAPER PLATFORA SOLUTION ARCHITECTURE Implementing a Big Data Discovery Solution with Platfora WHITE PAPER PLATFORA SOLUTION ARCHITECTURE Implementing a Big Data Discovery Solution with Platfora INTRODUCTION

More information

OnCommand Cloud Manager 2.2 Storage System Management Guide

OnCommand Cloud Manager 2.2 Storage System Management Guide OnCommand Cloud Manager 2.2 Storage System Management Guide March 2016 215-10912_A0 doccomments@netapp.com Table of Contents 3 Contents Deciding whether to use this guide... 5 Product overview... 6 Cloud

More information

VMware vsphere Big Data Extensions Administrator's and User's Guide

VMware vsphere Big Data Extensions Administrator's and User's Guide VMware vsphere Big Data Extensions Administrator's and User's Guide vsphere Big Data Extensions 1.0 This document supports the version of each product listed and supports all subsequent versions until

More information

System Administration Training Guide. S100 Installation and Site Management

System Administration Training Guide. S100 Installation and Site Management System Administration Training Guide S100 Installation and Site Management Table of contents System Requirements for Acumatica ERP 4.2... 5 Learning Objects:... 5 Web Browser... 5 Server Software... 5

More information

OnCommand Performance Manager 1.1

OnCommand Performance Manager 1.1 OnCommand Performance Manager 1.1 Installation and Setup Guide For Red Hat Enterprise Linux NetApp, Inc. 495 East Java Drive Sunnyvale, CA 94089 U.S. Telephone: +1 (408) 822-6000 Fax: +1 (408) 822-4501

More information

USER CONFERENCE 2011 SAN FRANCISCO APRIL 26 29. Running MarkLogic in the Cloud DEVELOPER LOUNGE LAB

USER CONFERENCE 2011 SAN FRANCISCO APRIL 26 29. Running MarkLogic in the Cloud DEVELOPER LOUNGE LAB USER CONFERENCE 2011 SAN FRANCISCO APRIL 26 29 Running MarkLogic in the Cloud DEVELOPER LOUNGE LAB Table of Contents UNIT 1: Lab description... 3 Pre-requisites:... 3 UNIT 2: Launching an instance on EC2...

More information

INUVIKA OPEN VIRTUAL DESKTOP FOUNDATION SERVER

INUVIKA OPEN VIRTUAL DESKTOP FOUNDATION SERVER INUVIKA OPEN VIRTUAL DESKTOP FOUNDATION SERVER ARCHITECTURE OVERVIEW AND SYSTEM REQUIREMENTS Mathieu SCHIRES Version: 1.0.0 Published March 5, 2015 http://www.inuvika.com Contents 1 Introduction 3 2 Architecture

More information

Acronis and Acronis Secure Zone are registered trademarks of Acronis International GmbH.

Acronis and Acronis Secure Zone are registered trademarks of Acronis International GmbH. 1 Copyright Acronis International GmbH, 2002-2016 Copyright Statement Copyright Acronis International GmbH, 2002-2016. All rights reserved. Acronis and Acronis Secure Zone are registered trademarks of

More information

19.10.11. Amazon Elastic Beanstalk

19.10.11. Amazon Elastic Beanstalk 19.10.11 Amazon Elastic Beanstalk A Short History of AWS Amazon started as an ECommerce startup Original architecture was restructured to be more scalable and easier to maintain Competitive pressure for

More information

CDH 5 Quick Start Guide

CDH 5 Quick Start Guide CDH 5 Quick Start Guide Important Notice (c) 2010-2015 Cloudera, Inc. All rights reserved. Cloudera, the Cloudera logo, Cloudera Impala, and any other product or service names or slogans contained in this

More information

VERSION 9.02 INSTALLATION GUIDE. www.pacifictimesheet.com

VERSION 9.02 INSTALLATION GUIDE. www.pacifictimesheet.com VERSION 9.02 INSTALLATION GUIDE www.pacifictimesheet.com PACIFIC TIMESHEET INSTALLATION GUIDE INTRODUCTION... 4 BUNDLED SOFTWARE... 4 LICENSE KEY... 4 SYSTEM REQUIREMENTS... 5 INSTALLING PACIFIC TIMESHEET

More information

AT&T CLOUD SERVICES. AT&T Synaptic Compute as a Service SM : How to Get Started. Version 2.0 January 2012

AT&T CLOUD SERVICES. AT&T Synaptic Compute as a Service SM : How to Get Started. Version 2.0 January 2012 Version 2.0 January 2012 AT&T CLOUD SERVICES AT&T Synaptic Compute as a Service SM : How to Get Started 2012 AT&T Intellectual Property. All rights reserved. Notice Copyright AT&T Intellectual Property.

More information

Cloudera Manager Installation Guide

Cloudera Manager Installation Guide Cloudera Manager Installation Guide Important Notice (c) 2010-2015 Cloudera, Inc. All rights reserved. Cloudera, the Cloudera logo, Cloudera Impala, and any other product or service names or slogans contained

More information

Integrating SAP BusinessObjects with Hadoop. Using a multi-node Hadoop Cluster

Integrating SAP BusinessObjects with Hadoop. Using a multi-node Hadoop Cluster Integrating SAP BusinessObjects with Hadoop Using a multi-node Hadoop Cluster May 17, 2013 SAP BO HADOOP INTEGRATION Contents 1. Installing a Single Node Hadoop Server... 2 2. Configuring a Multi-Node

More information

RSA Authentication Manager 8.1 Virtual Appliance Getting Started

RSA Authentication Manager 8.1 Virtual Appliance Getting Started RSA Authentication Manager 8.1 Virtual Appliance Getting Started Thank you for purchasing RSA Authentication Manager 8.1, the world s leading two-factor authentication solution. This document provides

More information

Kony MobileFabric. Sync Windows Installation Manual - WebSphere. On-Premises. Release 6.5. Document Relevance and Accuracy

Kony MobileFabric. Sync Windows Installation Manual - WebSphere. On-Premises. Release 6.5. Document Relevance and Accuracy Kony MobileFabric Sync Windows Installation Manual - WebSphere On-Premises Release 6.5 Document Relevance and Accuracy This document is considered relevant to the Release stated on this title page and

More information

Quick Start Guide for Parallels Virtuozzo

Quick Start Guide for Parallels Virtuozzo PROPALMS VDI Version 2.1 Quick Start Guide for Parallels Virtuozzo Rev. 1.1 Published: JULY-2011 1999-2011 Propalms Ltd. All rights reserved. The information contained in this document represents the current

More information

PARALLELS SERVER BARE METAL 5.0 README

PARALLELS SERVER BARE METAL 5.0 README PARALLELS SERVER BARE METAL 5.0 README 1999-2011 Parallels Holdings, Ltd. and its affiliates. All rights reserved. This document provides the first-priority information on the Parallels Server Bare Metal

More information

VMware Identity Manager Connector Installation and Configuration

VMware Identity Manager Connector Installation and Configuration VMware Identity Manager Connector Installation and Configuration VMware Identity Manager This document supports the version of each product listed and supports all subsequent versions until the document

More information

How To Install Acronis Backup & Recovery 11.5 On A Linux Computer

How To Install Acronis Backup & Recovery 11.5 On A Linux Computer Acronis Backup & Recovery 11.5 Server for Linux Update 2 Installation Guide Copyright Statement Copyright Acronis International GmbH, 2002-2013. All rights reserved. Acronis and Acronis Secure Zone are

More information

Prepared for: How to Become Cloud Backup Provider

Prepared for: How to Become Cloud Backup Provider Prepared for: How to Become Cloud Backup Provider Contents Abstract... 3 Introduction... 3 Purpose... 3 Architecture... 4 Result... 4 Requirements... 5 OS... 5 Sizing... 5 Third-party software requirements...

More information

Interworks. Interworks Cloud Platform Installation Guide

Interworks. Interworks Cloud Platform Installation Guide Interworks Interworks Cloud Platform Installation Guide Published: March, 2014 This document contains information proprietary to Interworks and its receipt or possession does not convey any rights to reproduce,

More information

Platfora Big Data Analytics

Platfora Big Data Analytics Platfora Big Data Analytics ISV Partner Solution Case Study and Cisco Unified Computing System Platfora, the leading enterprise big data analytics platform built natively on Hadoop and Spark, delivers

More information

Eucalyptus 3.4.2 User Console Guide

Eucalyptus 3.4.2 User Console Guide Eucalyptus 3.4.2 User Console Guide 2014-02-23 Eucalyptus Systems Eucalyptus Contents 2 Contents User Console Overview...4 Install the Eucalyptus User Console...5 Install on Centos / RHEL 6.3...5 Configure

More information

MATLAB Distributed Computing Server Cloud Center User s Guide

MATLAB Distributed Computing Server Cloud Center User s Guide MATLAB Distributed Computing Server Cloud Center User s Guide How to Contact MathWorks Latest news: Sales and services: User community: Technical support: www.mathworks.com www.mathworks.com/sales_and_services

More information

Background on Elastic Compute Cloud (EC2) AMI s to choose from including servers hosted on different Linux distros

Background on Elastic Compute Cloud (EC2) AMI s to choose from including servers hosted on different Linux distros David Moses January 2014 Paper on Cloud Computing I Background on Tools and Technologies in Amazon Web Services (AWS) In this paper I will highlight the technologies from the AWS cloud which enable you

More information

Quick Deployment Step-by-step instructions to deploy Oracle Big Data Lite Virtual Machine

Quick Deployment Step-by-step instructions to deploy Oracle Big Data Lite Virtual Machine Quick Deployment Step-by-step instructions to deploy Oracle Big Data Lite Virtual Machine Version 3.0 Please note: This appliance is for testing and educational purposes only; it is unsupported and not

More information

Single Node Hadoop Cluster Setup

Single Node Hadoop Cluster Setup Single Node Hadoop Cluster Setup This document describes how to create Hadoop Single Node cluster in just 30 Minutes on Amazon EC2 cloud. You will learn following topics. Click Here to watch these steps

More information

Amazon Elastic Compute Cloud Getting Started Guide. My experience

Amazon Elastic Compute Cloud Getting Started Guide. My experience Amazon Elastic Compute Cloud Getting Started Guide My experience Prepare Cell Phone Credit Card Register & Activate Pricing(Singapore) Region Amazon EC2 running Linux(SUSE Linux Windows Windows with SQL

More information

http://docs.trendmicro.com

http://docs.trendmicro.com Trend Micro Incorporated reserves the right to make changes to this document and to the products described herein without notice. Before installing and using the product, please review the readme files,

More information

Leveraging SAP HANA & Hortonworks Data Platform to analyze Wikipedia Page Hit Data

Leveraging SAP HANA & Hortonworks Data Platform to analyze Wikipedia Page Hit Data Leveraging SAP HANA & Hortonworks Data Platform to analyze Wikipedia Page Hit Data 1 Introduction SAP HANA is the leading OLTP and OLAP platform delivering instant access and critical business insight

More information

Nasuni Filer Virtualization Getting Started Guide. Version 7.5 June 2016 Last modified: June 9, 2016 2016 Nasuni Corporation All Rights Reserved

Nasuni Filer Virtualization Getting Started Guide. Version 7.5 June 2016 Last modified: June 9, 2016 2016 Nasuni Corporation All Rights Reserved Nasuni Filer Virtualization Getting Started Guide Version 7.5 June 2016 Last modified: June 9, 2016 2016 Nasuni Corporation All Rights Reserved Document Information Nasuni Filer Virtualization Getting

More information

EMC Data Domain Management Center

EMC Data Domain Management Center EMC Data Domain Management Center Version 1.1 Initial Configuration Guide 302-000-071 REV 04 Copyright 2012-2015 EMC Corporation. All rights reserved. Published in USA. Published June, 2015 EMC believes

More information

How To Install An Aneka Cloud On A Windows 7 Computer (For Free)

How To Install An Aneka Cloud On A Windows 7 Computer (For Free) MANJRASOFT PTY LTD Aneka 3.0 Manjrasoft 5/13/2013 This document describes in detail the steps involved in installing and configuring an Aneka Cloud. It covers the prerequisites for the installation, the

More information

How to Install and Configure EBF15328 for MapR 4.0.1 or 4.0.2 with MapReduce v1

How to Install and Configure EBF15328 for MapR 4.0.1 or 4.0.2 with MapReduce v1 How to Install and Configure EBF15328 for MapR 4.0.1 or 4.0.2 with MapReduce v1 1993-2015 Informatica Corporation. No part of this document may be reproduced or transmitted in any form, by any means (electronic,

More information

System Requirements and Platform Support Guide

System Requirements and Platform Support Guide Foglight 5.6.7 System Requirements and Platform Support Guide 2013 Quest Software, Inc. ALL RIGHTS RESERVED. This guide contains proprietary information protected by copyright. The software described in

More information

docs.hortonworks.com

docs.hortonworks.com docs.hortonworks.com Hortonworks Data Platform : Automated Install with Ambari Copyright 2012-2015 Hortonworks, Inc. Some rights reserved. The Hortonworks Data Platform, powered by Apache Hadoop, is a

More information

Amazon EC2 Product Details Page 1 of 5

Amazon EC2 Product Details Page 1 of 5 Amazon EC2 Product Details Page 1 of 5 Amazon EC2 Functionality Amazon EC2 presents a true virtual computing environment, allowing you to use web service interfaces to launch instances with a variety of

More information

A Study of Data Management Technology for Handling Big Data

A Study of Data Management Technology for Handling Big Data Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 3, Issue. 9, September 2014,

More information

Amazon Cloud Storage Options

Amazon Cloud Storage Options Amazon Cloud Storage Options Table of Contents 1. Overview of AWS Storage Options 02 2. Why you should use the AWS Storage 02 3. How to get Data into the AWS.03 4. Types of AWS Storage Options.03 5. Object

More information

Installation & Upgrade Guide

Installation & Upgrade Guide Installation & Upgrade Guide Document Release: September 2012 SnapLogic, Inc. 71 East Third Avenue San Mateo, California 94401 U.S.A. www.snaplogic.com Copyright Information 2011-2012 SnapLogic, Inc. All

More information

File S1: Supplementary Information of CloudDOE

File S1: Supplementary Information of CloudDOE File S1: Supplementary Information of CloudDOE Table of Contents 1. Prerequisites of CloudDOE... 2 2. An In-depth Discussion of Deploying a Hadoop Cloud... 2 Prerequisites of deployment... 2 Table S1.

More information

ENABLING GLOBAL HADOOP WITH EMC ELASTIC CLOUD STORAGE

ENABLING GLOBAL HADOOP WITH EMC ELASTIC CLOUD STORAGE ENABLING GLOBAL HADOOP WITH EMC ELASTIC CLOUD STORAGE Hadoop Storage-as-a-Service ABSTRACT This White Paper illustrates how EMC Elastic Cloud Storage (ECS ) can be used to streamline the Hadoop data analytics

More information

VMware vcenter Log Insight Getting Started Guide

VMware vcenter Log Insight Getting Started Guide VMware vcenter Log Insight Getting Started Guide vcenter Log Insight 1.5 This document supports the version of each product listed and supports all subsequent versions until the document is replaced by

More information

Hadoop Installation MapReduce Examples Jake Karnes

Hadoop Installation MapReduce Examples Jake Karnes Big Data Management Hadoop Installation MapReduce Examples Jake Karnes These slides are based on materials / slides from Cloudera.com Amazon.com Prof. P. Zadrozny's Slides Prerequistes You must have an

More information

Every Silver Lining Has a Vault in the Cloud

Every Silver Lining Has a Vault in the Cloud Irvin Hayes Jr. Autodesk, Inc. PL6015-P Don t worry about acquiring hardware and additional personnel in order to manage your Vault software installation. Learn how to spin up a hosted server instance

More information

AppBoard TM 2.6. System Requirements. Technical Documentation. Version 2.6.0. July 2015

AppBoard TM 2.6. System Requirements. Technical Documentation. Version 2.6.0. July 2015 Technical Documentation AppBoard TM 2.6 System Requirements Version 2.6.0 July 2015 Edge Technologies 1881 Campus Commons Drive Suite 101 Reston, VA 20191 T 703.691.7900 F 703.691.4020 1.888.771.EDGE www.edge-technologies.com

More information

Rally Installation Guide

Rally Installation Guide Rally Installation Guide Rally On-Premises release 2015.1 rallysupport@rallydev.com www.rallydev.com Version 2015.1 Table of Contents Overview... 3 Server requirements... 3 Browser requirements... 3 Access

More information

VMware vcenter Log Insight Getting Started Guide

VMware vcenter Log Insight Getting Started Guide VMware vcenter Log Insight Getting Started Guide vcenter Log Insight 2.0 This document supports the version of each product listed and supports all subsequent versions until the document is replaced by

More information

Revolution R Enterprise 7 Hadoop Configuration Guide

Revolution R Enterprise 7 Hadoop Configuration Guide Revolution R Enterprise 7 Hadoop Configuration Guide The correct bibliographic citation for this manual is as follows: Revolution Analytics, Inc. 2014. Revolution R Enterprise 7 Hadoop Configuration Guide.

More information

docs.hortonworks.com

docs.hortonworks.com docs.hortonworks.com : Security Administration Tools Guide Copyright 2012-2014 Hortonworks, Inc. Some rights reserved. The, powered by Apache Hadoop, is a massively scalable and 100% open source platform

More information

AWS Directory Service. Simple AD Administration Guide Version 1.0

AWS Directory Service. Simple AD Administration Guide Version 1.0 AWS Directory Service Simple AD Administration Guide AWS Directory Service: Simple AD Administration Guide Copyright 2015 Amazon Web Services, Inc. and/or its affiliates. All rights reserved. Amazon's

More information

GeoCloud Project Report GEOSS Clearinghouse

GeoCloud Project Report GEOSS Clearinghouse GeoCloud Project Report GEOSS Clearinghouse Qunying Huang, Doug Nebert, Chaowei Yang, Kai Liu 2011.12.06 Description of Application GEOSS clearinghouse is a FGDC, GEO, and NASA project that connects directly

More information

THE HADOOP DISTRIBUTED FILE SYSTEM

THE HADOOP DISTRIBUTED FILE SYSTEM THE HADOOP DISTRIBUTED FILE SYSTEM Konstantin Shvachko, Hairong Kuang, Sanjay Radia, Robert Chansler Presented by Alexander Pokluda October 7, 2013 Outline Motivation and Overview of Hadoop Architecture,

More information

Resource Sizing: Spotfire for AWS

Resource Sizing: Spotfire for AWS Resource Sizing: for AWS With TIBCO for AWS, you can have the best in analytics software available at your fingertips in just a few clicks. On a single Amazon Machine Image (AMI), you get a multi-user

More information

Dell One Identity Cloud Access Manager 7.0.2. Installation Guide

Dell One Identity Cloud Access Manager 7.0.2. Installation Guide Dell One Identity Cloud Access Manager 7.0.2 2014 Dell Inc. ALL RIGHTS RESERVED. This guide contains proprietary information protected by copyright. The software described in this guide is furnished under

More information

Syncplicity On-Premise Storage Connector

Syncplicity On-Premise Storage Connector Syncplicity On-Premise Storage Connector Implementation Guide Abstract This document explains how to install and configure the Syncplicity On-Premise Storage Connector. In addition, it also describes how

More information

Cloudera Backup and Disaster Recovery

Cloudera Backup and Disaster Recovery Cloudera Backup and Disaster Recovery Important Note: Cloudera Manager 4 and CDH 4 have reached End of Maintenance (EOM) on August 9, 2015. Cloudera will not support or provide patches for any of the Cloudera

More information

How to Test Out Backup & Replication 6.5 for Hyper-V

How to Test Out Backup & Replication 6.5 for Hyper-V How to Test Out Backup & Replication 6.5 for Hyper-V Mike Resseler May, 2013 2013 Veeam Software. All rights reserved. All trademarks are the property of their respective owners. No part of this publication

More information

Apache Hadoop. Alexandru Costan

Apache Hadoop. Alexandru Costan 1 Apache Hadoop Alexandru Costan Big Data Landscape No one-size-fits-all solution: SQL, NoSQL, MapReduce, No standard, except Hadoop 2 Outline What is Hadoop? Who uses it? Architecture HDFS MapReduce Open

More information

Deploy Apache Hadoop with Emulex OneConnect OCe14000 Ethernet Network Adapters

Deploy Apache Hadoop with Emulex OneConnect OCe14000 Ethernet Network Adapters CONNECT - Lab Guide Deploy Apache Hadoop with Emulex OneConnect OCe14000 Ethernet Network Adapters Hardware, software and configuration steps needed to deploy Apache Hadoop 2.4.1 with the Emulex family

More information

F-Secure Messaging Security Gateway. Deployment Guide

F-Secure Messaging Security Gateway. Deployment Guide F-Secure Messaging Security Gateway Deployment Guide TOC F-Secure Messaging Security Gateway Contents Chapter 1: Deploying F-Secure Messaging Security Gateway...3 1.1 The typical product deployment model...4

More information

Verax Service Desk Installation Guide for UNIX and Windows

Verax Service Desk Installation Guide for UNIX and Windows Verax Service Desk Installation Guide for UNIX and Windows March 2015 Version 1.8.7 and higher Verax Service Desk Installation Guide 2 Contact Information: E-mail: sales@veraxsystems.com Internet: http://www.veraxsystems.com/

More information

Tableau Spark SQL Setup Instructions

Tableau Spark SQL Setup Instructions Tableau Spark SQL Setup Instructions 1. Prerequisites 2. Configuring Hive 3. Configuring Spark & Hive 4. Starting the Spark Service and the Spark Thrift Server 5. Connecting Tableau to Spark SQL 5A. Install

More information

Citrix XenServer Workload Balancing 6.5.0 Quick Start. Published February 2015 1.0 Edition

Citrix XenServer Workload Balancing 6.5.0 Quick Start. Published February 2015 1.0 Edition Citrix XenServer Workload Balancing 6.5.0 Quick Start Published February 2015 1.0 Edition Citrix XenServer Workload Balancing 6.5.0 Quick Start Copyright 2015 Citrix Systems. Inc. All Rights Reserved.

More information

Open Cloud System. (Integration of Eucalyptus, Hadoop and AppScale into deployment of University Private Cloud)

Open Cloud System. (Integration of Eucalyptus, Hadoop and AppScale into deployment of University Private Cloud) Open Cloud System (Integration of Eucalyptus, Hadoop and into deployment of University Private Cloud) Thinn Thu Naing University of Computer Studies, Yangon 25 th October 2011 Open Cloud System University

More information

Benchmarking Sahara-based Big-Data-as-a-Service Solutions. Zhidong Yu, Weiting Chen (Intel) Matthew Farrellee (Red Hat) May 2015

Benchmarking Sahara-based Big-Data-as-a-Service Solutions. Zhidong Yu, Weiting Chen (Intel) Matthew Farrellee (Red Hat) May 2015 Benchmarking Sahara-based Big-Data-as-a-Service Solutions Zhidong Yu, Weiting Chen (Intel) Matthew Farrellee (Red Hat) May 2015 Agenda o Why Sahara o Sahara introduction o Deployment considerations o Performance

More information

RSA Authentication Manager 8.1 Setup and Configuration Guide. Revision 2

RSA Authentication Manager 8.1 Setup and Configuration Guide. Revision 2 RSA Authentication Manager 8.1 Setup and Configuration Guide Revision 2 Contact Information Go to the RSA corporate website for regional Customer Support telephone and fax numbers: www.emc.com/domains/rsa/index.htm

More information

LoadRunner and Performance Center v11.52 Technical Awareness Webinar Training

LoadRunner and Performance Center v11.52 Technical Awareness Webinar Training LoadRunner and Performance Center v11.52 Technical Awareness Webinar Training Tony Wong 1 Copyright Copyright 2012 2012 Hewlett-Packard Development Development Company, Company, L.P. The L.P. information

More information