The Easiest Way to Run Spark Jobs. How-To Guide
|
|
|
- Dominick Carr
- 10 years ago
- Views:
Transcription
1 The Easiest Way to Run Spark Jobs How-To Guide
2 The Easiest Way to Run Spark Jobs Recently, Databricks added a new feature, Jobs, to our cloud service. You can find a detailed overview of this feature in our blog about jobs. This feature allows you to programmatically run Spark jobs on Amazon s EC2 easier than ever before. This how-to will provide a quick tour of this feature. What is a Job? The job feature is very flexible. You can run a job not only on any Spark JAR, but also notebooks you have created with Databricks. In addition, notebooks can be used as scripts to create sophisticated pipelines. 2
3 How to run a Job? As shown below, Databricks offers an intuitive, easy to use interface to create a job. When creating a job, you will first need to specify the name of the job. By default, a job will use a new cluster size of 54GB each time it runs, however you will also have the option to change a few parameters of the cluster to fit your needs: Cluster Type: New or existing cluster. If you choose to use a new cluster, Databricks will also automatically tear down the cluster once the job is completed. Memory: Determines the performance of the job. Spot Instance: You can choose to use Spot Instances to reduce your costs. Next, you need to specify the notebook or the JAR you intend to run as a job, the input arguments of the job (both JARs and notebooks can take input arguments), and the job s configuration parameters: schedule, timeout, alerts, and the type of EC2 instances you would like the job to use. Next, we consider each of these configuration parameters in turn. 3
4 Scheduling You can run any job periodically, by simply specifying the starting time and the interval, as shown below. Timeout Optionally, you can set a timeout which specifies the time the job is allowed to run before being terminated. This feature is especially useful when handling runaway jobs, and to make sure an instance of a periodic job terminates before the next instance begins. If no timeout is specified and a job instance takes more than the scheduled period, no new instances are started before the current one terminates. Alerts When running production jobs, it is critical get alerts when any significant event occurs. Databricks allows you to specify the events you would like to be alerted about via when job starts, when it successfully finishes, or when it finishes with error. Resource type Finally, you can specify whether you would want to use spot or on-demand instances to run the job. 4
5 History and Results The Job UI provides an easy way to inspect the status of each run of a given job. The figure below shows the status of multiple runs of the same job. i.e., when each run starts, how long it takes, and if it has terminated successfully. By clicking on any of the Run x links, you can immediately see the output of the corresponding run including its output logs and errors, if any. The picture below shows the output of Run 6 above. 5
6 Similarly, the figure below shows the output of running a notebook as a job. Incidentally, the output is the same as running the notebook manually. Summary Databricks provides a powerful, yet easy to use feature to run not only Spark JARs compiled by any Spark install, but also notebooks created with Databricks. If you d like to run your own jobs with Databricks, you can evaluate Databricks with a trial account now. Additional Resoures Other Databricks Cloud how-tos can be found at: Analyzing Apache Access Logs with Databricks Cloud Evaluate Databricks Cloud with a trial account now: databricks.com/registration 6
Customer Case Study. Celtra
Customer Case Study Celtra Customer Case Study Celtra Benefits Increased the amount of ad-hoc analysis done six-fold, leading to better informed product design and quicker issue detection and resolution.
Customer Case Study. Automatic Labs
Customer Case Study Automatic Labs Customer Case Study Automatic Labs Benefits Validated product in days Completed complex queries in minutes Freed up 1 full-time data scientist Infrastructure savings
Customer Case Study. Sharethrough
Customer Case Study Customer Case Study Benefits Faster prototyping of new applications Easier debugging of complex pipelines Improved overall engineering team productivity Summary offers a robust advertising
How to Run Spark Application
How to Run Spark Application Junghoon Kang Contents 1 Intro 2 2 How to Install Spark on a Local Machine? 2 2.1 On Ubuntu 14.04.................................... 2 3 How to Run Spark Application on a
Toad for Oracle 12.5.1. Installation Guide
Toad for Oracle 12.5.1 Installation Guide Contents Installation Requirements 3 System Requirements 3 Required Windows Privileges 7 Install Toad 8 Install or Upgrade Toad 8 Install Toad on Citrix or Remote
Big Data on AWS. Services Overview. Bernie Nallamotu Principle Solutions Architect
on AWS Services Overview Bernie Nallamotu Principle Solutions Architect \ So what is it? When your data sets become so large that you have to start innovating around how to collect, store, organize, analyze
Deploying Microsoft Operations Manager with the BIG-IP system and icontrol
Deployment Guide Deploying Microsoft Operations Manager with the BIG-IP system and icontrol Deploying Microsoft Operations Manager with the BIG-IP system and icontrol Welcome to the BIG-IP LTM system -
Databricks. A Primer
Databricks A Primer Who is Databricks? Databricks was founded by the team behind Apache Spark, the most active open source project in the big data ecosystem today. Our mission at Databricks is to dramatically
TIBCO ActiveMatrix BusinessWorks Plug-in for Big Data Release Notes
TIBCO ActiveMatrix BusinessWorks Plug-in for Big Data Release Notes Software Release 6.1.0 August 2015 Two-Second Advantage 2 Important Information SOME TIBCO SOFTWARE EMBEDS OR BUNDLES OTHER TIBCO SOFTWARE.
Databricks. A Primer
Databricks A Primer Who is Databricks? Databricks vision is to empower anyone to easily build and deploy advanced analytics solutions. The company was founded by the team who created Apache Spark, a powerful
PaperStream Connect. Setup Guide. Version 1.0.0.0. Copyright Fujitsu
PaperStream Connect Setup Guide Version 1.0.0.0 Copyright Fujitsu 2014 Contents Introduction to PaperStream Connect... 2 Setting up PaperStream Capture to Release to Cloud Services... 3 Selecting a Cloud
CS15-319 / 15-619 Cloud Computing. Recitation 2 September 2 & 4, 2014
CS15-319 / 15-619 Cloud Computing Recitation 2 September 2 & 4, 2014 Administrative Issues Overview --TA hours, Piazza posts guidelines Last Week s Reflection --Project Primer, Module 1 This Week s Schedule
Hadoop Setup. 1 Cluster
In order to use HadoopUnit (described in Sect. 3.3.3), a Hadoop cluster needs to be setup. This cluster can be setup manually with physical machines in a local environment, or in the cloud. Creating a
How to Schedule Report Execution and Mailing
SAP Business One How-To Guide PUBLIC How to Schedule Report Execution and Mailing Release Family 8.8 Applicable Releases: SAP Business One 8.81 PL10 and PL11 SAP Business One 8.82 PL01 and later All Countries
Running Knn Spark on EC2 Documentation
Pseudo code Running Knn Spark on EC2 Documentation Preparing to use Amazon AWS First, open a Spark launcher instance. Open a m3.medium account with all default settings. Step 1: Login to the AWS console.
SAP CRM on SAP HANA Getting Started Today with 9 Easy Steps. May 2014
SAP CRM on SAP HANA Getting Started Today with 9 Easy Steps May 2014 SAP CRM on SAP HANA Overview Available on the SAP HANA Marketplace A 30 Day Free Trial Offer featuring a configurable SAP CRM on SAP
Fair Scheduler. Table of contents
Table of contents 1 Purpose... 2 2 Introduction... 2 3 Installation... 3 4 Configuration...3 4.1 Scheduler Parameters in mapred-site.xml...4 4.2 Allocation File (fair-scheduler.xml)... 6 4.3 Access Control
Cloud Performance Benchmark Series
Cloud Performance Benchmark Series Amazon Elastic Load Balancing (ELB) Md. Borhan Uddin Bo He Radu Sion ver. 0.5b 1. Overview Experiments were performed to benchmark the Amazon Elastic Load Balancing (ELB)
SCALING USER-SESSIONS FOR LOAD TESTING OF INTERNET APPLICATIONS
SCALING USER-SESSIONS FOR LOAD TESTING OF INTERNET APPLICATIONS Benjamin Houdeshell IS809 5/14/2014 Background/Motivation Performance/load testing research concerned with the simulation of users behavior
Automating Administration with SQL Agent
Automating Administration with SQL Agent Automating Administration with SQL Agent Objectives Configure SQL Server Agent. Set SQL Server Agent properties. Configure a fail-safe operator. Create operators.
Ali Ghodsi Head of PM and Engineering Databricks
Making Big Data Simple Ali Ghodsi Head of PM and Engineering Databricks Big Data is Hard: A Big Data Project Tasks Tasks Build a Hadoop cluster Challenges Clusters hard to setup and manage Build a data
Automated Performance Testing of Desktop Applications
By Ostap Elyashevskyy Automated Performance Testing of Desktop Applications Introduction For the most part, performance testing is associated with Web applications. This area is more or less covered by
MATLAB Distributed Computing Server Licensing Guide
MATLAB Distributed Computing Server Licensing Guide How to Contact MathWorks Latest news: www.mathworks.com Sales and services: www.mathworks.com/sales_and_services User community: www.mathworks.com/matlabcentral
MONITORING PERFORMANCE IN WINDOWS 7
MONITORING PERFORMANCE IN WINDOWS 7 Performance Monitor In this demo we will take a look at how we can use the Performance Monitor to capture information about our machine performance. We can access Performance
AdWhirl Open Source Server Setup Instructions
AdWhirl Open Source Server Setup Instructions 11/09 AdWhirl Server Setup Instructions The server runs in Amazon s web cloud. To set up the server, you need an Amazon Web Services (AWS) account and the
Uptime Infrastructure Monitor. Installation Guide
Uptime Infrastructure Monitor Installation Guide This guide will walk through each step of installation for Uptime Infrastructure Monitor software on a Windows server. Uptime Infrastructure Monitor is
PC Monitor Enterprise Server. Setup Guide
PC Monitor Enterprise Server Setup Guide Prerequisites Server Requirements - Microsoft Windows Server 2008 R2 or 2012-2GB RAM - IIS 7.5 or IIS 8.0 (with ASP.NET 4.0 installed) - Microsoft SQL Server 2008
Scalable Architecture on Amazon AWS Cloud
Scalable Architecture on Amazon AWS Cloud Kalpak Shah Founder & CEO, Clogeny Technologies [email protected] 1 * http://www.rightscale.com/products/cloud-computing-uses/scalable-website.php 2 Architect
www.basho.com Technical Overview Simple, Scalable, Object Storage Software
www.basho.com Technical Overview Simple, Scalable, Object Storage Software Table of Contents Table of Contents... 1 Introduction & Overview... 1 Architecture... 2 How it Works... 2 APIs and Interfaces...
Renderbot Tutorial. Intro to AWS
Renderbot Tutorial Thanks for choosing to render your Blender projects in the cloud using Renderbot. This guide will introduce Amazon AWS, walk you through the setup process, and help you render your first
Talend Real-Time Big Data Sandbox. Big Data Insights Cookbook
Talend Real-Time Big Data Talend Real-Time Big Data Overview of Real-time Big Data Pre-requisites to run Setup & Talend License Talend Real-Time Big Data Big Data Setup & About this cookbook What is the
TSM Studio Server User Guide 2.9.0.0
TSM Studio Server User Guide 2.9.0.0 1 Table of Contents Disclaimer... 4 What is TSM Studio Server?... 5 System Requirements... 6 Database Requirements... 6 Installing TSM Studio Server... 7 TSM Studio
Backup Utility. Specify when the backup utility runs. You may configure the backup program to run at regular intervals or at a specified time.
Backup Utility 3501175 ver.01 The Buffalo Backup Utility is software to back up and restore data from your PC. To use, select the source drives to be backed up and a target location for backup data. Features
QUICK START GUIDE. Cloud based Web Load, Stress and Functional Testing
QUICK START GUIDE Cloud based Web Load, Stress and Functional Testing Performance testing for the Web is vital for ensuring commercial success. JAR:Load is a Web Load Testing Solution delivered from the
UBUNTU DISK IO BENCHMARK TEST RESULTS
UBUNTU DISK IO BENCHMARK TEST RESULTS FOR JOYENT Revision 2 January 5 th, 2010 The IMS Company Scope: This report summarizes the Disk Input Output (IO) benchmark testing performed in December of 2010 for
COURSE CONTENT Big Data and Hadoop Training
COURSE CONTENT Big Data and Hadoop Training 1. Meet Hadoop Data! Data Storage and Analysis Comparison with Other Systems RDBMS Grid Computing Volunteer Computing A Brief History of Hadoop Apache Hadoop
Making big data simple with Databricks
Making big data simple with Databricks We are Databricks, the company behind Spark Founded by the creators of Apache Spark in 2013 Data 75% Share of Spark code contributed by Databricks in 2014 Value Created
Introduction. Symbol Script Timeout Setting. Sample MES Custom Code in Symbol Script. Application Versions. Sample Code
Tech Note 1006 Managing Symbol Script Timeouts When Using WW MES API Pop-up Dialogs All Tech Notes, Tech Alerts and KBCD documents and software are provided "as is" without warranty of any kind. See the
Hadoop & Spark Using Amazon EMR
Hadoop & Spark Using Amazon EMR Michael Hanisch, AWS Solutions Architecture 2015, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Agenda Why did we build Amazon EMR? What is Amazon EMR?
A. Aiken & K. Olukotun PA3
Programming Assignment #3 Hadoop N-Gram Due Tue, Feb 18, 11:59PM In this programming assignment you will use Hadoop s implementation of MapReduce to search Wikipedia. This is not a course in search, so
USER CONFERENCE 2011 SAN FRANCISCO APRIL 26 29. Running MarkLogic in the Cloud DEVELOPER LOUNGE LAB
USER CONFERENCE 2011 SAN FRANCISCO APRIL 26 29 Running MarkLogic in the Cloud DEVELOPER LOUNGE LAB Table of Contents UNIT 1: Lab description... 3 Pre-requisites:... 3 UNIT 2: Launching an instance on EC2...
Informatica Cloud & Redshift Getting Started User Guide
Informatica Cloud & Redshift Getting Started User Guide 2014 Informatica Corporation. No part of this document may be reproduced or transmitted in any form, by any means (electronic, photocopying, recording
EMC ViPR Controller Add-in for Microsoft System Center Virtual Machine Manager
EMC ViPR Controller Add-in for Microsoft System Center Virtual Machine Manager Version 2.3 Installation and Configuration Guide 302-002-080 01 Copyright 2013-2015 EMC Corporation. All rights reserved.
Deploying the BIG-IP LTM with the Cacti Open Source Network Monitoring System
DEPLOYMENT GUIDE Deploying the BIG-IP LTM with the Cacti Open Source Network Monitoring System Version 1.0 Deploying F5 with Cacti Open Source Network Monitoring System Welcome to the F5 and Cacti deployment
Amazon S3 Cloud Backup Solution Contents
Contents 1. Overview... 2 2. Preparation... 2 2-1. Register an AWS account... 2 2-2. Thecus NAS F/W 2.03.01 (Thecus OS 5.0)... 2 3. Backup NAS data to the Amazon S3 cloud... 2 3-1. The Backup Menu... 2
Configuring WMI Performance Monitors
Configuring WMI Performance Monitors With WMI, WhatsUp Gold Premium Edition monitors and sends alerts based on performance counters that are reported from Microsoft Windows devices. The data collected
AWS Account Setup and Services Overview
AWS Account Setup and Services Overview 1. Purpose of the Lab Understand definitions of various Amazon Web Services (AWS) and their use in cloud computing based web applications that are accessible over
Release Notes for McAfee(R) VirusScan(R) Enterprise for Linux Version 1.9.0 Copyright (C) 2014 McAfee, Inc. All Rights Reserved.
Release Notes for McAfee(R) VirusScan(R) Enterprise for Linux Version 1.9.0 Copyright (C) 2014 McAfee, Inc. All Rights Reserved. Release date: August 28, 2014 This build was developed and tested on: -
Hadoop Installation MapReduce Examples Jake Karnes
Big Data Management Hadoop Installation MapReduce Examples Jake Karnes These slides are based on materials / slides from Cloudera.com Amazon.com Prof. P. Zadrozny's Slides Prerequistes You must have an
CSE 344 Introduction to Data Management. Section 9: AWS, Hadoop, Pig Latin TA: Yi-Shu Wei
CSE 344 Introduction to Data Management Section 9: AWS, Hadoop, Pig Latin TA: Yi-Shu Wei Homework 8 Big Data analysis on billion triple dataset using Amazon Web Service (AWS) Billion Triple Set: contains
SUSE Manager in the Public Cloud. SUSE Manager Server in the Public Cloud
SUSE Manager in the Public Cloud SUSE Manager Server in the Public Cloud Contents 1 Instance Requirements... 2 2 Setup... 3 3 Registration of Cloned Systems... 6 SUSE Manager delivers best-in-class Linux
Microsoft HPC. V 1.0 José M. Cámara ([email protected])
Microsoft HPC V 1.0 José M. Cámara ([email protected]) Introduction Microsoft High Performance Computing Package addresses computing power from a rather different approach. It is mainly focused on commodity
How to create Event Filters directly from the Event Viewer
How to create Event Filters directly from the Event Viewer Event Filters determine the action that SNMPc takes when a trap is received or an event is triggered. SNMPc 7.0 supports the ability to create
Getting Started with StoreGrid Cloud
Getting Started with StoreGrid Cloud This document describes the steps to quickly sign up and start backing up your data to StoreGrid Cloud. I. Signing Up 1. Go to http://storegridcloud.vembu.com and select
Using The Hortonworks Virtual Sandbox
Using The Hortonworks Virtual Sandbox Powered By Apache Hadoop This work by Hortonworks, Inc. is licensed under a Creative Commons Attribution- ShareAlike3.0 Unported License. Legal Notice Copyright 2012
Unified Batch & Stream Processing Platform
Unified Batch & Stream Processing Platform Himanshu Bari Director Product Management Most Big Data Use Cases Are About Improving/Re-write EXISTING solutions To KNOWN problems Current Solutions Were Built
Are You Ready for the Holiday Rush?
Are You Ready for the Holiday Rush? Five Survival Tips Written by Joseph Palumbo, Cloud Usability Team Leader Are You Ready for the Holiday Rush? Five Survival Tips Cover Table of Contents 1. Vertical
CycleServer Grid Engine Support Install Guide. version 1.25
CycleServer Grid Engine Support Install Guide version 1.25 Contents CycleServer Grid Engine Guide 1 Administration 1 Requirements 1 Installation 1 Monitoring Additional OGS/SGE/etc Clusters 3 Monitoring
Amazon Web Services (AWS) Setup Guidelines
Amazon Web Services (AWS) Setup Guidelines For CSE6242 HW3, updated version of the guidelines by Diana Maclean [Estimated time needed: 1 hour] Note that important steps are highlighted in yellow. What
Using SUSE Studio to Build and Deploy Applications on Amazon EC2. Guide. Solution Guide Cloud Computing. www.suse.com
Using SUSE Studio to Build and Deploy Applications on Amazon EC2 Guide Solution Guide Cloud Computing Cloud Computing Solution Guide Using SUSE Studio to Build and Deploy Applications on Amazon EC2 Quickly
Databricks Cloud Platform Native REST API 1.1
Databricks Cloud Platform Native REST API 1.1 Overview This document describes the Databricks Native API that can be used by third party applications to interact with the Spark clusters managed by Databricks
vcenter Operations Management Pack for SAP HANA Installation and Configuration Guide
vcenter Operations Management Pack for SAP HANA Installation and Configuration Guide This document supports the version of each product listed and supports all subsequent versions until a new edition replaces
Databricks Cloud Platform Native REST API 1.0
Databricks Cloud Platform Native REST API 1.0 Overview This document describes the Databricks Native API that can be used by third party applications to interact with the Spark clusters managed by Databricks
Version 2.0.2 English 06.08.2015
Version 2.0.2 English 06.08.2015 This HOWTO describes how you can virtualize the IAC-BOX on Hyper-V. Please note the reference table of the minimum hardware requirements. Contents... 1 1. Hints... 2 2.
Cloudera Manager Training: Hands-On Exercises
201408 Cloudera Manager Training: Hands-On Exercises General Notes... 2 In- Class Preparation: Accessing Your Cluster... 3 Self- Study Preparation: Creating Your Cluster... 4 Hands- On Exercise: Working
Silect Software s MP Author
Silect MP Author for Microsoft System Center Operations Manager Silect Software s MP Author User Guide September 2, 2015 Disclaimer The information in this document is furnished for informational use only,
Problems and Measures Regarding Waste 1 Management and 3R Era of public health improvement Situation subsequent to the Meiji Restoration
How to configure an Advanced Expert Probe as NetFlow Collector
created by: Rainer Bemsel Version 1.0 Dated: Jan/31/2012 There are two types of NetFlow collectors in Observer. In most cases, it will likely be the NetFlow Trending collector being configured. The Trending
Monitoring Oracle Enterprise Performance Management System Release 11.1.2.3 Deployments from Oracle Enterprise Manager 12c
Monitoring Oracle Enterprise Performance Management System Release 11.1.2.3 Deployments from Oracle Enterprise Manager 12c This document describes how to set up Oracle Enterprise Manager 12c to monitor
Building Success on Acquia Cloud:
Building Success on Acquia Cloud: 10 Layers of PaaS TECHNICAL Guide Table of Contents Executive Summary.... 3 Introducing the 10 Layers of PaaS... 4 The Foundation: Five Layers of PaaS Infrastructure...
Openbravo ERP Magento. Installation Guide
Openbravo ERP Magento Installation Guide Version 2.0 9 th of December 2010 Copyright 2010, Transitional Data Services, All Rights Reserved No part of this document may be reproduced or transmitted in any
AWS Data Pipeline. Developer Guide API Version 2012-10-29
AWS Data Pipeline Developer Guide Amazon Web Services AWS Data Pipeline: Developer Guide Amazon Web Services What is AWS Data Pipeline?... 1 How Does AWS Data Pipeline Work?... 1 Pipeline Definition...
User Manual: Using Hadoop with WS-PGRADE. workflow.
User Manual: Using Hadoop with WS-PGRADE workflows December 9, 2014 1 About This manual explains the configuration of a set of workflows that can be used to submit a Hadoop job through a WS-PGRADE portal.
WordPress Security Scan Configuration
WordPress Security Scan Configuration To configure the - WordPress Security Scan - plugin in your WordPress driven Blog, login to WordPress as administrator, by simply entering the url_of_your_website/wp-admin
AutoMerge Online Service Configuration for MS CRM 2013
AutoMerge Online Service Configuration for MS CRM 2013 Version 2013.3, December 2013 Online Service Configuration Guide (How to configure AutoMerge Online Service for MS CRM 2013) The content of this document
Spark Application Carousel. Spark Summit East 2015
Spark Application Carousel Spark Summit East 2015 About Today s Talk About Me: Vida Ha - Solutions Engineer at Databricks. Goal: For beginning/early intermediate Spark Developers. Motivate you to start
MyCloudLab: An Interactive Web-based Management System for Cloud Computing Administration
MyCloudLab: An Interactive Web-based Management System for Cloud Computing Administration Hoi-Wan Chan 1, Min Xu 2, Chung-Pan Tang 1, Patrick P. C. Lee 1 & Tsz-Yeung Wong 1, 1 Department of Computer Science
Xopero Backup Build your private cloud backup environment. Getting started
Xopero Backup Build your private cloud backup environment Getting started 07.05.2015 List of contents Introduction... 2 Get Management Center... 2 Setup Xopero to work... 3 Change the admin password...
Amazon Elastic MapReduce. Jinesh Varia Peter Sirota Richard Cole
Amazon Elastic MapReduce Jinesh Varia Peter Sirota Richard Cole Start End From IDE Command line Web Console Notify Input Data Get Results Start End From IDE Command line Web Console AWS EC2 Instance Notify
Getting Started with Database-as-a-Service
Getting Started with Database-as-a-Service VMware vfabric Data Director 2.0 This document supports the version of each product listed and supports all subsequent versions until the document is replaced
BASICS OF SCALING: LOAD BALANCERS
BASICS OF SCALING: LOAD BALANCERS Lately, I ve been doing a lot of work on systems that require a high degree of scalability to handle large traffic spikes. This has led to a lot of questions from friends
SAP Predictive Analysis Installation
SAP Predictive Analysis Installation SAP Predictive Analysis is the latest addition to the SAP BusinessObjects suite and introduces entirely new functionality to the existing Business Objects toolbox.
Microsoft Windows PowerShell v2 For Administrators
Course 50414B: Microsoft Windows PowerShell v2 For Administrators Course Details Course Outline Module 1: Introduction to PowerShell the Basics This module explains how to install and configure PowerShell.
How to create a load testing environment for your web apps using open source tools by Sukrit Dhandhania
How to create a load testing environment for your web apps using open source tools by Sukrit Dhandhania Open source load testing for web putting demand on an application and measuring its response see
How Exclaimer Mail Archiver Works
How Exclaimer Mail Archiver Works www.exclaimer.com How Exclaimer Mail Archiver Works In this document, you ll learn how email is archived, stored and searched by Exclaimer Mail Archiver. We ll look at
Building 1000 node cluster on EMR Manjeet Chayel
Building 1000 node cluster on EMR Manjeet Chayel What is EMR? Amazon Elas+c MapReduce Hadoop- as- a- service Map- Reduce engine What is EMR? Integrated with tools Massively parallel Integrated to AWS services
DocuShare Installation Guide
DocuShare Installation Guide Publication date: February 2011 This document supports DocuShare Release 6.6.1 Prepared by: Xerox Corporation DocuShare Business Unit 3400 Hillview Avenue Palo Alto, California
SQL Server Solutions GETTING STARTED WITH. SQL Safe Backup
SQL Server Solutions GETTING STARTED WITH SQL Safe Backup Purpose of this document Due to its depth and potential for customization, there are often parts of SQL Safe Backup that are overlooked during
How to Prepare for the Upgrade to Microsoft Dynamics CRM 2013 (On-premises)
How to Prepare for the Upgrade to Microsoft Dynamics CRM 2013 (On-premises) COMPANY: Microsoft Corporation RELEASED: September 2013 VERSION: 1.0 Copyright This document is provided "as-is". Information
EWeb: Highly Scalable Client Transparent Fault Tolerant System for Cloud based Web Applications
ECE6102 Dependable Distribute Systems, Fall2010 EWeb: Highly Scalable Client Transparent Fault Tolerant System for Cloud based Web Applications Deepal Jayasinghe, Hyojun Kim, Mohammad M. Hossain, Ali Payani
Ankush Cluster Manager - Hadoop2 Technology User Guide
Ankush Cluster Manager - Hadoop2 Technology User Guide Ankush User Manual 1.5 Ankush User s Guide for Hadoop2, Version 1.5 This manual, and the accompanying software and other documentation, is protected
DocuShare Installation Guide
DocuShare Installation Guide Publication date: May 2009 This document supports DocuShare Release 6.5/DocuShare CPX Release 6.5 Prepared by: Xerox Corporation DocuShare Business Unit 3400 Hillview Avenue
ThermIT: Setting up Alarms
ThermIT: Setting up Alarms Contents Introduction... 2 Alarm Types... 2 Main Module Alarms:... 2 Linked Sensor Alarms... 2 Temperature Alarms:... 3 How to find the alarm page... 3 Alarm overview... 4 Main
How To Choose A Data Flow Pipeline From A Data Processing Platform
S N A P L O G I C T E C H N O L O G Y B R I E F SNAPLOGIC BIG DATA INTEGRATION PROCESSING PLATFORMS 2 W Fifth Avenue Fourth Floor, San Mateo CA, 94402 telephone: 888.494.1570 www.snaplogic.com Big Data
Continuous Integration and Bamboo. Ryan Cutter CSCI 5828 2012 Spring Semester
Continuous Integration and Bamboo Ryan Cutter CSCI 5828 2012 Spring Semester Agenda What is CI and how can it help me? Fundamentals of CI Fundamentals of Bamboo Configuration / Price Quick example Features
