Hadoop Applications on High Performance Computing. Devaraj Kavali
|
|
- Beverly Robinson
- 8 years ago
- Views:
Transcription
1 Hadoop Applications on High Performance Computing Devaraj Kavali
2 About Me Apache Hadoop Committer Yarn/MapReduce Contributor Senior Software Corporation 2
3 Agenda Objectives HDFS Applications with HPC File Systems Yarn Application Mapreduce Job HPC Schedulers Yarn Protocols Log Aggregation Shuffle Implementation Q&A 3
4 Objectives Use existing HPC Cluster for running Hadoop Applications Use any of the HPC File Systems like Lustre, PVFS, IBRIX Fusion, etc. Use any of the HPC schedulers like Slurm, Moab, PBS Pro, etc. Combine Hadoop workloads with HPC workloads No code changes to existing Hadoop(HDFS/YARN/MR) applications Minimal Hadoop configuration changes 4
5 HDFS Applications Using HPC File Systems public class <HPC>Adapter extends AbstractFileSystem{.. HDFS Application.. HPC FS } 5
6 Hadoop Configurations for File System <property> <name>fs.defaultfs</name> <value>${hpc-uri}:///</value> </property> <property> <name>fs.abstractfilesystem.${hpc-uri}.impl</name> <value>hpcfilesystemadapter</value> </property> 6
7 YARN Application Node Manager Container Container Client Resource Manager Node Manager App Master Container 7
8 YARN Application with HPC Scheduler HPC Scheduler Slave Container Container Client HPC Scheduler Master HPC Scheduler Slave App Master Container 8
9 Yarn Application Submission with HPC Scheduler Yarn Application 1. submitapplication() Yarn Client HPC 2. submit() Application 3. run 7. Launch Yarn Child HPC Scheduler Client Protocol Impl Container Yarn Child 4. Launch 5. Allocate App Master 6. Run Child Task HPC Application Master Protocol Impl Application Master HPC Container Management Protocol Impl 8. Report Progress AMRM Client NM Client 9
10 Mapreduce Job with HPC Scheduler 1. submit() LocalClientProtocol Provider Job Client Local Job Runner Yarn Client ApplicationC lientprotocol Proxy Resource Manager 2. submitjobinternal() YarnClientProtocol Provider 5. submit() 6. submit() Job Submitter 3. submitjob() YARN Runner 4. submit() ResourceMg rdelegate HPCApplicat ionclientprot ocolimpl 7. run() HPC Scheduler Job History Server 10
11 Yarn Protocols Configurations RPC class Configuration <property> <description>rpc class implementation</description> <name>yarn.ipc.rpc.class</name> <value>hadoopyarnhpcrpc</value> </property> 11
12 Yarn Protocols Configurations public class HadoopYarnHPCRPC extends HadoopYarnProtoRPC public Object getproxy(class protocol, InetSocketAddress address, Configuration conf) { Object proxy; if (protocol == ApplicationClientProtocol.class) { proxy = new HPCApplicationClientProtocolImpl(conf); } else if (protocol == ApplicationMasterProtocol.class) { proxy = new HPCApplicationMasterProtocolImpl(conf); } else if (protocol == ContainerManagementProtocol.class) { proxy = new HPCContainerManagementProtocolImpl(conf); } else { proxy = super.getproxy(protocol, address, conf); } return proxy; } 12
13 Application Client Protocol Yarn Application Client Protocol Flow Yarn Application Client RMClient RMApplicati onclientpro tocolproxy 1. getnewapplication () 2. submitapplication () 3. forcekillapplication () 4. getclustermetrics () 5. getclusternodes () 6. getqueueinfo ().. Resource Manager 13
14 Application Client Protocol Yarn Application Client HPC Scheduler Flow Yarn Application Client RMClient HPCApplicati onclientprot ocolimpl 1. Allocate Resources cmd 2. Submit Job(Batch) cmd 3. Cancel Job cmd 4. Cluster Info cmd 5. Get Jobs Report cmd.. HPC Scheduler 14
15 Application Client Protocol API s for interaction 1. getnewapplication() The interface used by clients to obtain a new ApplicationId for submitting new applications. 2. submitapplication() The interface used by clients to submit a new application to the ResourceManager. 3. forcekillapplication() The interface used by clients to request the ResourceManager to abort submitted application. 4. getclustermetrics() 5. getclusternodes() 6. getqueueinfo() 15
16 Application Master Protocol Yarn Application Master Flow Diagram RMClient Application Master RMApplicati onmasters erviceproto colproxy 1. registerapplicationmaster() 2. allocate() 3. finishapplicationmaster() Resource Manager 16
17 Application Master Protocol HPC Scheduler Application Master Flow Diagram NMClient Application Master HPCApplicati onmasterprot ocolimpl 1. Cluster Info Cmd 2. Resource Allocation Cmd 3. Finish Tasks Cmds HPC Scheduler 17
18 Application Master Protocol API s for interaction 1. registerapplicationmaster() The interface used by a new ApplicationMaster to register with the ResourceManager. 2. allocate() The main interface between an ApplicationMaster and the ResourceManager. 3. finishapplicationmaster() The interface used by an ApplicationMaster to notify the ResourceManager about its completion (success or failed). 18
19 Container Management Protocol Yarn Container Management Flow NMClient Application Master NMContain ermanagem entprotocol Proxy 1. startcontainers() 2. stopcontainers() 3. getcontainerstatuses() Node Manager 19
20 Container Management Protocol HPC Scheduler Task Management Flow NMClient Application Master HPCContai nermanage mentprotoc olimpl 1. Start Containers Cmd 2. Stop Containers Cmd 3. Get Container Statuses Cmd HPC Scheduler 20
21 Container Management Protocol API s for interaction 1. startcontainers() The ApplicationMaster provides a list of StartContainerRequest's to a NodeManager to start Container's allocated to it using this interface. 2. stopcontainers() The ApplicationMaster requests a NodeManager to stop a list of Container's allocated to it using this interface. 3. getcontainerstatuses() The API used by the ApplicationMaster to request for current statuses of Container's from the NodeManager. 21
22 Yarn Log Aggregation Log Aggregation by Node Manager <property> <name>yarn.log-aggregation-enable</name> <value>true</value> </property> Log Aggregation with HPC Scheduler Issue an HPC scheduler command to execute in all nodes(where application tasks executed) as part of ApplicationMasterProtocol.finishApplicationMaster() for aggregating the application logs. 22
23 Shuffle Handling Hadoop Node 1 Map Task 1. Assign Task 3. Task Completed MRAppMaster 4. Get Map Completion Events 5. Map Completed Reduce Task EventFetcher Node Manager 2. Write Final Map O/P Local Dirs Shuffle Consumer 6. Read Map O/P 23
24 Shuffle Handling HPC File Systems Map Task 1. Assign Task 3. Task Completed MRAppMaster 4. Get Map Completion Events 5. Map Completed Reduce Task EventFetcher Shuffle Consumer 2. Write Final Map O/P 6. Read Map O/P Parallel File System 24
25 Shuffle Handling Shuffle Handler <property> <name>mapreduce.job.map.output.collector.class</name> <value>org.apache.hadoop.mapred.maptask$mapoutputbuffer</value> <description> The MapOutputCollector implementation(s) to use. This may be a comma-separated list of class names, in which case the map task will try to initialize each of the collectors in turn. The first to successfully initialize will be used. </description> </property> 25
26 Shuffle Handling Shuffle Consumer <property> <name>mapreduce.job.reduce.shuffle.consumer.plugin.class</name> <value>org.apache.hadoop.mapreduce.task.reduce.shuffle</value> <description> Name of the class whose instance will be used to send shuffle requests by reduce tasks of this job. The class must be an instance of org.apache.hadoop.mapred.shuffleconsumerplugin. </description> </property> 26
27 Summary HDFS configuration for new File System HPC Schedulers YARN Protocols M/R Shuffle Implementation Yarn Log Aggregation 27
28 Q & A 28
29 Thank You 29
30 Notices and Disclaimers Copyright 2014 Intel Corporation. Intel, the Intel logo are trademarks of Intel Corporation in the U.S. and/or other countries. *Other names and brands may be claimed as the property of others. See Trademarks on intel.com for full list of Intel trademarks. All information provided here is subject to change without notice. Contact your Intel representative to obtain the latest Intel product specifications and roadmaps Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. For more complete information about performance and benchmark results, visit Intel does not control or audit third-party benchmark data or the web sites referenced in this document. You should visit the referenced web site and confirm whether referenced data are accurate. Results have been estimated or simulated using internal Intel analysis or architecture simulation or modeling, and provided to you for informational purposes. Any differences in your system hardware, software or configuration may affect your actual performance. Intel technologies may require enabled hardware, specific software, or services activation. Check with your system manufacturer or retailer. No computer system can be absolutely secure. Intel does not assume any liability for lost or stolen data or systems or any damages resulting from such losses. You may not use or facilitate the use of this document in connection with any infringement or other legal analysis concerning Intel products described herein. You agree to grant Intel a non-exclusive, royalty-free license to any patent claim thereafter drafted which includes subject matter disclosed herein. No license (express or implied, by estoppel or otherwise) to any intellectual property rights is granted by this document. The products described may contain design defects or errors known as errata which may cause the product to deviate from publish. 30
31 31
Next-Gen Big Data Analytics using the Spark stack
Next-Gen Big Data Analytics using the Spark stack Jason Dai Chief Architect of Big Data Technologies Software and Services Group, Intel Agenda Overview Apache Spark stack Next-gen big data analytics Our
More informationApache Hadoop YARN: The Nextgeneration Distributed Operating. System. Zhijie Shen & Jian He @ Hortonworks
Apache Hadoop YARN: The Nextgeneration Distributed Operating System Zhijie Shen & Jian He @ Hortonworks About Us Software Engineer @ Hortonworks, Inc. Hadoop Committer @ The Apache Foundation We re doing
More informationExtended Attributes and Transparent Encryption in Apache Hadoop
Extended Attributes and Transparent Encryption in Apache Hadoop Uma Maheswara Rao G Yi Liu ( 刘 轶 ) Who we are? Uma Maheswara Rao G - umamahesh@apache.org - Software Engineer at Intel - PMC/committer, Apache
More informationHetero Streams Library 1.0
Release Notes for release of Copyright 2013-2016 Intel Corporation All Rights Reserved US Revision: 1.0 World Wide Web: http://www.intel.com Legal Disclaimer Legal Disclaimer You may not use or facilitate
More informationHadoop* on Lustre* Liu Ying (emoly.liu@intel.com) High Performance Data Division, Intel Corporation
Hadoop* on Lustre* Liu Ying (emoly.liu@intel.com) High Performance Data Division, Intel Corporation Agenda Overview HAM and HAL Hadoop* Ecosystem with Lustre * Benchmark results Conclusion and future work
More informationIntel Media SDK Library Distribution and Dispatching Process
Intel Media SDK Library Distribution and Dispatching Process Overview Dispatching Procedure Software Libraries Platform-Specific Libraries Legal Information Overview This document describes the Intel Media
More informationIntel Unite. User Guide
Intel Unite User Guide Intel Unite Users Guide September 2015 Legal Disclaimers & Copyrights All information provided here is subject to change without notice. Contact your Intel representative to obtain
More informationThe Case for Rack Scale Architecture
The Case for Rack Scale Architecture An introduction to the next generation of Software Defined Infrastructure Intel Data Center Group Pooled System Top of Rack Switch POD Manager Network CPU/Memory Storage
More informationPage Modification Logging for Virtual Machine Monitor White Paper
Page Modification Logging for Virtual Machine Monitor White Paper This document is intended only for VMM or hypervisor software developers and not for application developers or end-customers. Readers are
More informationIntel Desktop public roadmap
Intel Desktop public roadmap 1H Expires end of Q3 Info: roadmaps@intel.com Intel Desktop Public Roadmap - Consumer Intel High End Desktop Intel Core i7 Intel Core i7 processor Extreme Edition: i7-5960x
More informationIntel HPC Distribution for Apache Hadoop* Software including Intel Enterprise Edition for Lustre* Software. SC13, November, 2013
Intel HPC Distribution for Apache Hadoop* Software including Intel Enterprise Edition for Lustre* Software SC13, November, 2013 Agenda Abstract Opportunity: HPC Adoption of Big Data Analytics on Apache
More informationBig Data for Big Science. Bernard Doering Business Development, EMEA Big Data Software
Big Data for Big Science Bernard Doering Business Development, EMEA Big Data Software Internet of Things 40 Zettabytes of data will be generated WW in 2020 1 SMART CLIENTS INTELLIGENT CLOUD Richer user
More informationMapReduce and Lustre * : Running Hadoop * in a High Performance Computing Environment
MapReduce and Lustre * : Running Hadoop * in a High Performance Computing Environment Ralph H. Castain Senior Architect, Intel Corporation Omkar Kulkarni Software Developer, Intel Corporation Xu, Zhenyu
More informationExtending PCIe NVMe Storage to Client. John Carroll Intel Corporation. Flash Memory Summit 2015 Santa Clara, CA 1
Extending PCIe NVMe Storage to Client John Carroll Intel Corporation Flash Memory Summit 2015 Santa Clara, CA 1 No license (express or implied, by estoppel or otherwise) to any intellectual property rights
More informationIntel HTML5 Development Environment. Article - Native Application Facebook* Integration
Intel HTML5 Development Environment Article - Native Application Facebook* Integration V3.06 : 07.16.2013 Legal Information INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL PRODUCTS. NO
More informationIntel Service Assurance Administrator. Product Overview
Intel Service Assurance Administrator Product Overview Running Enterprise Workloads in the Cloud Enterprise IT wants to Start a private cloud initiative to service internal enterprise customers Find an
More informationIntel Platform and Big Data: Making big data work for you.
Intel Platform and Big Data: Making big data work for you. 1 From data comes insight New technologies are enabling enterprises to transform opportunity into reality by turning big data into actionable
More informationYARN and how MapReduce works in Hadoop By Alex Holmes
YARN and how MapReduce works in Hadoop By Alex Holmes YARN was created so that Hadoop clusters could run any type of work. This meant MapReduce had to become a YARN application and required the Hadoop
More informationCray XC30 Hadoop Platform Jonathan (Bill) Sparks Howard Pritchard Martha Dumler
Cray XC30 Hadoop Platform Jonathan (Bill) Sparks Howard Pritchard Martha Dumler Safe Harbor Statement This presentation may contain forward-looking statements that are based on our current expectations.
More informationJun Liu, Senior Software Engineer Bianny Bian, Engineering Manager SSG/STO/PAC
Jun Liu, Senior Software Engineer Bianny Bian, Engineering Manager SSG/STO/PAC Agenda Quick Overview of Impala Design Challenges of an Impala Deployment Case Study: Use Simulation-Based Approach to Design
More informationIntel IoT Gateways: Publishing Data to an MQTT Broker Using Python
Intel IoT Gateways: Publishing Data to an MQTT Broker Using Python Getting Started Guide September 2015 Order No.: 333067-001 By using this document, in addition to any agreements you have with Intel,
More informationIntel and Qihoo 360 Internet Portal Datacenter - Big Data Storage Optimization Case Study
Intel and Qihoo 360 Internet Portal Datacenter - Big Data Storage Optimization Case Study The adoption of cloud computing creates many challenges and opportunities in big data management and storage. To
More informationYARN Apache Hadoop Next Generation Compute Platform
YARN Apache Hadoop Next Generation Compute Platform Bikas Saha @bikassaha Hortonworks Inc. 2013 Page 1 Apache Hadoop & YARN Apache Hadoop De facto Big Data open source platform Running for about 5 years
More informationIntel Retail Client Manager Audience Analytics
Intel Retail Client Manager Audience Analytics By using this document, in addition to any agreements you have with Intel, you accept the terms set forth below. You may not use or facilitate the use of
More informationCloud based Holdfast Electronic Sports Game Platform
Case Study Cloud based Holdfast Electronic Sports Game Platform Intel and Holdfast work together to upgrade Holdfast Electronic Sports Game Platform with cloud technology Background Shanghai Holdfast Online
More informationIntel HTML5 Development Environment. Tutorial Building an Apple ios* Application Binary
Intel HTML5 Development Environment Tutorial Building an Apple ios* Application Binary V1.02 : 08.08.2013 Legal Information INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL PRODUCTS. NO
More informationIntel Core i5 processor 520E CPU Embedded Application Power Guideline Addendum January 2011
Intel Core i5 processor 520E CPU Embedded Application Power Guideline Addendum January 2011 Document Number: 324818-001 INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL PRODUCTS. NO LICENSE,
More informationIntelligent Business Operations
White Paper Intel Xeon Processor E5 Family Data Center Efficiency Financial Services Intelligent Business Operations Best Practices in Cash Supply Chain Management Executive Summary The purpose of any
More informationEHCI Removal from 6 th Generation Intel Core Processor Family Platform Controller Hub (PCH)
EHCI Removal from 6 th Generation Intel Core Processor Family Platform Controller Hub (PCH) Technical White Paper September 2015 Revision 1.0 333136-001 You may not use or facilitate the use of this document
More informationIntel Cloud Builder Guide to Cloud Design and Deployment on Intel Platforms
Intel Cloud Builder Guide to Cloud Design and Deployment on Intel Platforms Ubuntu* Enterprise Cloud Executive Summary Intel Cloud Builder Guide Intel Xeon Processor Ubuntu* Enteprise Cloud Canonical*
More informationIntel Solid-State Drive Pro 2500 Series Opal* Compatibility Guide
Opal* Compatibility Guide 1.0 Order Number: 331049-001US INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL PRODUCTS. NO LICENSE, EXPRESS OR IMPLIED, BY ESTOPPEL OR OTHERWISE, TO ANY INTELLECTUAL
More informationIntel Identity Protection Technology (IPT)
Intel Identity Protection Technology (IPT) Enabling improved user-friendly strong authentication in VASCO's latest generation solutions June 2013 Steve Davies Solution Architect Intel Corporation 1 Copyright
More informationIntel SSD 520 Series Specification Update
Intel SSD 520 Series Specification Update June 2012 Revision 1.0 Document Number: 327567-001US INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL PRODUCTS. NO LICENSE, EXPRESS OR IMPLIED,
More informationhttp://www.intel.com/performance/resources Version 2008-09 Rev. 1.0
Software Evaluation Guide for ImTOO* YouTube* to ipod* Converter and Adobe Premiere Elements* 4.0 Downloading YouTube videos to your ipod while uploading a home video to YouTube http://www.intel.com/performance/resources
More informationBSP for Windows* Embedded Compact* 7 and Windows* Embedded Compact 2013 for Mobile Intel 4th Generation Core TM Processors and Intel 8 Series Chipset
BSP for Windows* Embedded Compact* 7 and Windows* Embedded Compact 2013 for Mobile Intel 4th Generation Core TM Processors and Intel 8 Series Chipset Software Developer Guide February 2015 Software Release
More informationwith PKI Use Case Guide
Intel Identity Protection Technology (Intel IPT) with PKI Use Case Guide Version 1.0 Document Release Date: February 29, 2012 Intel IPT with PKI Use Case Guide i Legal Notices and Disclaimers INFORMATION
More informationFast, Low-Overhead Encryption for Apache Hadoop*
Fast, Low-Overhead Encryption for Apache Hadoop* Solution Brief Intel Xeon Processors Intel Advanced Encryption Standard New Instructions (Intel AES-NI) The Intel Distribution for Apache Hadoop* software
More informationVendor Update Intel 49 th IDC HPC User Forum. Mike Lafferty HPC Marketing Intel Americas Corp.
Vendor Update Intel 49 th IDC HPC User Forum Mike Lafferty HPC Marketing Intel Americas Corp. Legal Information Today s presentations contain forward-looking statements. All statements made that are not
More informationIntel HTML5 Development Environment. Tutorial Test & Submit a Microsoft Windows Phone 8* App (BETA)
Intel HTML5 Development Environment Tutorial Test & Submit a Microsoft Windows Phone 8* App v1.00 : 04.09.2013 Legal Information INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL PRODUCTS.
More informationiscsi Quick-Connect Guide for Red Hat Linux
iscsi Quick-Connect Guide for Red Hat Linux A supplement for Network Administrators The Intel Networking Division Revision 1.0 March 2013 Legal INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH
More informationCLOUD SECURITY: Secure Your Infrastructure
CLOUD SECURITY: Secure Your Infrastructure 1 Challenges to security Security challenges are growing more complex. ATTACKERS HAVE EVOLVED TECHNOLOGY ARCHITECTURE HAS CHANGED NIST, HIPAA, PCI-DSS, SOX INCREASED
More informationIntel Retail Client Manager
October 2014 By using this document, in addition to any agreements you have with Intel, you accept the terms set forth below. You may not use or facilitate the use of this document in connection with any
More information* * * Intel RealSense SDK Architecture
Multiple Implementations Intel RealSense SDK Architecture Introduction The Intel RealSense SDK is architecturally different from its predecessor, the Intel Perceptual Computing SDK. If you re a developer
More informationIntel Data Center Manager. Data center IT agility and control
Intel Data Center Manager Data center IT agility and control The Data Center Ecosystem 2 Why do we care about Data Center Management? attributed to devices connected to the Internet of Everything (up from
More informationMaximize Performance and Scalability of RADIOSS* Structural Analysis Software on Intel Xeon Processor E7 v2 Family-Based Platforms
Maximize Performance and Scalability of RADIOSS* Structural Analysis Software on Family-Based Platforms Executive Summary Complex simulations of structural and systems performance, such as car crash simulations,
More informationPerformance Comparison of SQL based Big Data Analytics with Lustre and HDFS file systems
Performance Comparison of SQL based Big Data Analytics with Lustre and HDFS file systems Rekha Singhal and Gabriele Pacciucci * Other names and brands may be claimed as the property of others. Lustre File
More informationHigh Performance Computing and Big Data: The coming wave.
High Performance Computing and Big Data: The coming wave. 1 In science and engineering, in order to compete, you must compute Today, the toughest challenges, and greatest opportunities, require computation
More informationExperiences with Lustre* and Hadoop*
Experiences with Lustre* and Hadoop* Gabriele Paciucci (Intel) June, 2014 Intel * Some Con fidential name Do Not Forward and brands may be claimed as the property of others. Agenda Overview Intel Enterprise
More informationScaling Networking Solutions for IoT Challenges and Opportunities
Scaling Networking Solutions for IoT Challenges and Opportunities Anil Kumar, Intel. Santa Clara, CA USA April 2015 1 Legal Disclaimer INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL
More informationDeveloping High-Performance, Scalable, cost effective storage solutions with Intel Cloud Edition Lustre* and Amazon Web Services
Reference Architecture Developing Storage Solutions with Intel Cloud Edition for Lustre* and Amazon Web Services Developing High-Performance, Scalable, cost effective storage solutions with Intel Cloud
More informationThree Paths to Faster Simulations Using ANSYS Mechanical 16.0 and Intel Architecture
White Paper Intel Xeon processor E5 v3 family Intel Xeon Phi coprocessor family Digital Design and Engineering Three Paths to Faster Simulations Using ANSYS Mechanical 16.0 and Intel Architecture Executive
More informationIntel Identity Protection Technology Enabling improved user-friendly strong authentication in VASCO's latest generation solutions
Intel Identity Protection Technology Enabling improved user-friendly strong authentication in VASCO's latest generation solutions June 2013 Dirk Roziers Market Manager PC Client Services Intel Corporation
More informationIntel Desktop Board DG41TY
Intel Desktop Board DG41TY Specification Update July 2010 Order Number E58490-006US The Intel Desktop Board DG41TY may contain design defects or errors known as errata, which may cause the product to deviate
More informationReal-Time Analytical Processing (RTAP) Using the Spark Stack. Jason Dai jason.dai@intel.com Intel Software and Services Group
Real-Time Analytical Processing (RTAP) Using the Spark Stack Jason Dai jason.dai@intel.com Intel Software and Services Group Project Overview Research & open source projects initiated by AMPLab in UC Berkeley
More informationIntel Solid-State Drives Increase Productivity of Product Design and Simulation
WHITE PAPER Intel Solid-State Drives Increase Productivity of Product Design and Simulation Intel Solid-State Drives Increase Productivity of Product Design and Simulation A study of how Intel Solid-State
More informationBig Data, SAP HANA. SUSE Linux Enterprise Server for SAP Applications. Kim Aaltonen kim.aaltonen@suse.com
Big Data, SAP HANA SUSE Linux Enterprise Server for SAP Applications Kim Aaltonen kim.aaltonen@suse.com 2 Agenda 3 Big Data SAP HANA Optimized Linux for SAP Why SUSE for SAP? Summary 4 5 Big Data What
More informationScaling up to Production
1 Scaling up to Production Overview Productionize then Scale Building Production Systems Scaling Production Systems Use Case: Scaling a Production Galaxy Instance Infrastructure Advice 2 PRODUCTIONIZE
More informationCloudera Enterprise Reference Architecture for Google Cloud Platform Deployments
Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments Important Notice 2010-2015 Cloudera, Inc. All rights reserved. Cloudera, the Cloudera logo, Cloudera Impala, Impala, and
More informationIntel Cloud Builder Guide: Cloud Design and Deployment on Intel Platforms
EXECUTIVE SUMMARY Intel Cloud Builder Guide Intel Xeon Processor-based Servers Red Hat* Cloud Foundations Intel Cloud Builder Guide: Cloud Design and Deployment on Intel Platforms Red Hat* Cloud Foundations
More informationIntel HTML5 Development Environment Article Using the App Dev Center
Intel HTML5 Development Environment Article Using the App Dev Center v1.06 : 06.04.2013 Legal Information INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL PRODUCTS. NO LICENSE, EXPRESS
More informationBenchmarking Cloud Storage through a Standard Approach Wang, Yaguang Intel Corporation
Benchmarking Cloud Storage through a Standard Approach Wang, Yaguang Intel Corporation Legal Notices and Disclaimers INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL PRODUCTS. NO LICENSE,
More informationGraySort and MinuteSort at Yahoo on Hadoop 0.23
GraySort and at Yahoo on Hadoop.23 Thomas Graves Yahoo! May, 213 The Apache Hadoop[1] software library is an open source framework that allows for the distributed processing of large data sets across clusters
More informationResetting USB drive using Windows Diskpart command
Resetting USB drive using Windows Diskpart command Simon Huang Technical Product Manager simon.huang@supertalent.com Super Talent Technology October, 2013 Release 1.00 1 Legal Disclaimer INFORMATION IN
More informationBig Data Analytics(Hadoop) Prepared By : Manoj Kumar Joshi & Vikas Sawhney
Big Data Analytics(Hadoop) Prepared By : Manoj Kumar Joshi & Vikas Sawhney General Agenda Understanding Big Data and Big Data Analytics Getting familiar with Hadoop Technology Hadoop release and upgrades
More informationArchitecture of Next Generation Apache Hadoop MapReduce Framework
Architecture of Next Generation Apache Hadoop MapReduce Framework Authors: Arun C. Murthy, Chris Douglas, Mahadev Konar, Owen O Malley, Sanjay Radia, Sharad Agarwal, Vinod K V BACKGROUND The Apache Hadoop
More informationCOSBench: A benchmark Tool for Cloud Object Storage Services. Jiangang.Duan@intel.com 2012.10
COSBench: A benchmark Tool for Cloud Object Storage Services Jiangang.Duan@intel.com 2012.10 Updated June 2012 Self introduction COSBench Introduction Agenda Case Study to evaluate OpenStack* swift performance
More informationAccelerating Business Intelligence with Large-Scale System Memory
Accelerating Business Intelligence with Large-Scale System Memory A Proof of Concept by Intel, Samsung, and SAP Executive Summary Real-time business intelligence (BI) plays a vital role in driving competitiveness
More informationIntel Cyber Security Briefing: Trends, Solutions, and Opportunities. Matthew Rosenquist, Cyber Security Strategist, Intel Corp
Intel Cyber Security Briefing: Trends, Solutions, and Opportunities Matthew Rosenquist, Cyber Security Strategist, Intel Corp Legal Notices and Disclaimers INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION
More information新 一 代 軟 體 定 義 的 網 路 架 構 Software Defined Networking (SDN) and Network Function Virtualization (NFV)
新 一 代 軟 體 定 義 的 網 路 架 構 Software Defined Networking (SDN) and Network Function Virtualization (NFV) 李 國 輝 客 戶 方 案 事 業 群 亞 太 區 解 決 方 案 架 構 師 美 商 英 特 爾 亞 太 科 技 有 限 公 司 Email: kuo-hui.li@intel.com 1 Legal
More informationNew Dimensions in Configurable Computing at runtime simultaneously allows Big Data and fine Grain HPC
New Dimensions in Configurable Computing at runtime simultaneously allows Big Data and fine Grain HPC Alan Gara Intel Fellow Exascale Chief Architect Legal Disclaimer Today s presentations contain forward-looking
More informationIntel Desktop Board D945GCPE
Intel Desktop Board D945GCPE Specification Update January 2009 Order Number: E11670-003US The Intel Desktop Board D945GCPE may contain design defects or errors known as errata, which may cause the product
More informationIntel Simple Network Management Protocol (SNMP) Subagent v6.0
Intel Simple Network Management Protocol (SNMP) Subagent v6.0 User Guide March 2013 ii Intel SNMP Subagent User s Guide Legal Information INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL
More informationBenchmarking Sahara-based Big-Data-as-a-Service Solutions. Zhidong Yu, Weiting Chen (Intel) Matthew Farrellee (Red Hat) May 2015
Benchmarking Sahara-based Big-Data-as-a-Service Solutions Zhidong Yu, Weiting Chen (Intel) Matthew Farrellee (Red Hat) May 2015 Agenda o Why Sahara o Sahara introduction o Deployment considerations o Performance
More informationHPC & Big Data THE TIME HAS COME FOR A SCALABLE FRAMEWORK
HPC & Big Data THE TIME HAS COME FOR A SCALABLE FRAMEWORK Barry Davis, General Manager, High Performance Fabrics Operation Data Center Group, Intel Corporation Legal Disclaimer Today s presentations contain
More informationIntel Desktop Board DG41BI
Intel Desktop Board DG41BI Specification Update July 2010 Order Number: E88214-002US The Intel Desktop Board DG41BI may contain design defects or errors known as errata, which may cause the product to
More informationLustre* Testing: The Basics. Justin Miller, Cray Inc. James Nunez, Intel Corporation LAD 15 Paris, France
Lustre* Testing: The Basics Justin Miller, Cray Inc. James Nunez, Intel Corporation LAD 15 Paris, France 1 Legal Disclaimer Information in this document is provided in connection with Cray Inc. products.
More informationIntel Remote Configuration Certificate Utility Frequently Asked Questions
Intel Remote Configuration Certificate Utility Frequently Asked Questions General Info 1. What operating system do I have to use to run the wizard? 2. What configurations are required prior to running
More informationPerformance Comparison of Intel Enterprise Edition for Lustre* software and HDFS for MapReduce Applications
Performance Comparison of Intel Enterprise Edition for Lustre software and HDFS for MapReduce Applications Rekha Singhal, Gabriele Pacciucci and Mukesh Gangadhar 2 Hadoop Introduc-on Open source MapReduce
More informationIntel Desktop Board DG43RK
Intel Desktop Board DG43RK Specification Update December 2010 Order Number: E92421-003US The Intel Desktop Board DG43RK may contain design defects or errors known as errata, which may cause the product
More informationIntegrating Genetic Data into Clinical Workflow with Clinical Decision Support Apps
White Paper Healthcare Integrating Genetic Data into Clinical Workflow with Clinical Decision Support Apps Executive Summary The Transformation Lab at Intermountain Healthcare in Salt Lake City, Utah,
More informationCloud Service Brokerage Case Study. Health Insurance Association Launches a Security and Integration Cloud Service Brokerage
Cloud Service Brokerage Case Study Health Insurance Association Launches a Security and Integration Cloud Service Brokerage Cloud Service Brokerage Case Study Health Insurance Association Launches a Security
More informationDell* In-Memory Appliance for Cloudera* Enterprise
Built with Intel Dell* In-Memory Appliance for Cloudera* Enterprise Find out what faster big data analytics can do for your business The need for speed in all things related to big data is an enormous
More informationIntel Internet of Things (IoT) Developer Kit
Intel Internet of Things (IoT) Developer Kit IoT Cloud-Based Analytics User Guide September 2014 IoT Cloud-Based Analytics User Guide Introduction Table of Contents 1.0 Introduction... 4 1.1. Revision
More informationHadoop Security Analysis NOTE: This is a working draft. Notes are being collected and will be edited for readability.
Hadoop Security Analysis NOTE: This is a working draft. Notes are being collected and will be edited for readability. Introduction This document describes the state of security in a Hadoop YARN cluster.
More informationDeploying Hadoop with Manager
Deploying Hadoop with Manager SUSE Big Data Made Easier Peter Linnell / Sales Engineer plinnell@suse.com Alejandro Bonilla / Sales Engineer abonilla@suse.com 2 Hadoop Core Components 3 Typical Hadoop Distribution
More informationReal-Time Big Data Analytics SAP HANA with the Intel Distribution for Apache Hadoop software
Real-Time Big Data Analytics with the Intel Distribution for Apache Hadoop software Executive Summary is already helping businesses extract value out of Big Data by enabling real-time analysis of diverse
More informationSoftware Evaluation Guide for Autodesk 3ds Max 2009* and Enemy Territory: Quake Wars* Render a 3D character while playing a game
Software Evaluation Guide for Autodesk 3ds Max 2009* and Enemy Territory: Quake Wars* Render a 3D character while playing a game http://www.intel.com/performance/resources Version 2008-09 Rev. 1.0 Information
More informationIntel Desktop Board D945GCPE Specification Update
Intel Desktop Board D945GCPE Specification Update Release Date: July 11, 2007 Order Number: E11670-001US The Intel Desktop Board D945GCPE may contain design defects or errors known as errata, which may
More informationLustre * Filesystem for Cloud and Hadoop *
OpenFabrics Software User Group Workshop Lustre * Filesystem for Cloud and Hadoop * Robert Read, Intel Lustre * for Cloud and Hadoop * Brief Lustre History and Overview Using Lustre with Hadoop Intel Cloud
More informationIntel Desktop Board D101GGC Specification Update
Intel Desktop Board D101GGC Specification Update Release Date: November 2006 Order Number: D38925-003US The Intel Desktop Board D101GGC may contain design defects or errors known as errata, which may cause
More informationIntel Desktop Board DP55WB
Intel Desktop Board DP55WB Specification Update July 2010 Order Number: E80453-004US The Intel Desktop Board DP55WB may contain design defects or errors known as errata, which may cause the product to
More informationBig Data Analytics on Object Storage -- Hadoop over Ceph* Object Storage with SSD Cache
Big Data Analytics on Object Storage -- Hadoop over Ceph* Object Storage with SSD Cache David Cohen (david.e.cohen@intel.com ) Yuan Zhou (yuan.zhou@intel.com) Jun Sun (jun.sun@intel.com) Weiting Chen (weiting.chen@intel.com)
More informationIntel Desktop Board DG31PR
Intel Desktop Board DG31PR Specification Update July 2010 Order Number: E30564-007US The Intel Desktop Board DG31PR may contain design defects or errors known as errata, which may cause the product to
More informationIntel Cloud Builder Guide to Cloud Design and Deployment on Intel Xeon Processor-based Platforms
Intel Cloud Builder Guide to Cloud Design and Deployment on Intel Xeon Processor-based Platforms Enomaly Elastic Computing Platform, * Service Provider Edition Executive Summary Intel Cloud Builder Guide
More informationCDH 5 Quick Start Guide
CDH 5 Quick Start Guide Important Notice (c) 2010-2015 Cloudera, Inc. All rights reserved. Cloudera, the Cloudera logo, Cloudera Impala, and any other product or service names or slogans contained in this
More informationHow to Configure Intel X520 Ethernet Server Adapter Based Virtual Functions on Citrix* XenServer 6.0*
How to Configure Intel X520 Ethernet Server Adapter Based Virtual Functions on Citrix* XenServer 6.0* Technical Brief v1.0 December 2011 Legal Lines and Disclaimers INFORMATION IN THIS DOCUMENT IS PROVIDED
More informationThe Improved Job Scheduling Algorithm of Hadoop Platform
The Improved Job Scheduling Algorithm of Hadoop Platform Yingjie Guo a, Linzhi Wu b, Wei Yu c, Bin Wu d, Xiaotian Wang e a,b,c,d,e University of Chinese Academy of Sciences 100408, China b Email: wulinzhi1001@163.com
More informationIntel vpro Technology Module for Microsoft* Windows PowerShell*
Intel vpro Technology Module for Microsoft* Windows PowerShell* 1 Legal Disclaimer INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL PRODUCTS. NO LICENSE, EXPRESS OR IMPLIED, BY ESTOPPEL
More information