OpenPOWER Software Stack with Big Data Example March 2014
|
|
|
- Jonah Benson
- 10 years ago
- Views:
Transcription
1 OpenPOWER Software Stack with Big Data Example March 2014
2 Driving industry innovation The goal of the OpenPOWER Foundation is to create an open ecosystem, using the POWER Architecture to share expertise, investment, and server-class intellectual property to serve the evolving needs of customers. Opening the architecture to give the industry the ability to innovate across the full Hardware and Software stack Simplify system design with alternative architecture Includes SOC design, Bus Specifications, Reference Designs, FW OS and Hypervisor Open Source Little Endian Linux to ease the migration of software to POWER Driving an expansion of enterprise class Hardware and Software stack for the data center Building a complete ecosystem to provide customers with the flexibility to build servers best suited to the Power architecture
3 Giving ecosystem partners a license to innovate OpenPOWER will enable data centers to rethink their approach to technology. Member companies may use POWER for custom open servers and components for Linux based cloud data centers. OpenPOWER ecosystem partners can optimize the interactions of server building blocks microprocessors, networking, I/O & other components to tune performance. How will the OpenPOWER Foundation benefit clients? OpenPOWER technology creates greater choice for customers Open and collaborative development model on the Power platform will create more opportunity for innovation New innovators will broaden the capability and value of the Power platform What does this mean to the industry? Game changer on the competitive landscape of the server industry Will enable and drive innovation in the industry Provide more choice in the industry Platinum Members
4 PCIe Proposed Ecosystem Enablement Power Open Source Software Stack Components Software Cloud Software Standard Operating Environment (System Mgmt) Operating System / KVM Firmware XCAT OpenPOWER Firmware Existing Open Source Software Communitie s New OSS Community System Operating Environment Software Stack A modern development environment is emerging based on this type of tools and services Hardware OpenPOWER Technology Multiple Options to Design with POWER Technology Within OpenPOWER POWER8 Framework to Integrate System IP on Chip Industry IP License Model Hardware CAPP CAPI over PCIe Standard POWER Products 2014 Customizable Custom POWER SoC Future
5 Software Stack
6 OpenPOWER Enables Hybrid Cloud OpenPOWER SW Stack Commonality Linux to provide commonality for: Operating system management Operating system features Application programming model Little Endian to provide source code and data commonality KVM to provide virtualization management and feature commonality Firmware interfaces to provide platform management commonality Migrating Software to OpenPOWER Software written in interpreted languages (Javascript, PHP, Perl, Python, Ruby, Java, etc.) Generally, no work is required. Software written in compiled languages (C/C++, Fortran, etc.) Generally, this requires just a simple recompile for POWER. Rarely, dependencies on specific behaviors can require source code modification: Multi-threaded applications that don t use standard synchronization models and depend upon specific memory ordering behavior (unusual) Applications that depend upon specific memory page sizes (rare)
7 Development for PowerLinux 7 x86 Platform Architect & Develop Eclipse-based PowerLinux SDK Advanced toolchain Gnu tools, LLVM, Gold linker IBM JVM, OpenJDK Edit, Compile, & Debug SDK Source Code Advisor SDK integration of oprofile, Helgrind, Valgrind, FDPR, CPI breakdown, etc. Profile & Tune SDK support for dynamic languages SDK Migration Advisor Power-Linux support for SOE stacks Open source development infrastructure support Migrate / Publish SDK remote deployment control SDK Power emulator for x86 HW Access (Local/ Remote) Profile & Tune Dev/Test Cloud Eval Boards POWER System HW
8 LE Toolchain & Development Ecosystem Build Tools GCC (+binutils+glibc) toolchain (AT native and cross) config.guess, config.sub, M4, automake, autoconf, libtool, pkg-config, Gcov, rpm, Perf, SystemTap Profiling and Tracing Tools Oprofile (operf, ocount, perf-events), Valgrind (memcheck, massiff, cachegrind, hellgrind, itrace) PowerLinux Multi-core Runtime TBB, CBB, userrcu, SPHDE, tcmalloc, Boost,... IBM Tools (binary only) FDPR (bprober) Pthread_mon (sync_mon) Mambo (simulator) Eclipse SDK Tools PPC64LE JVM (prereq - binary only) PPC64LE Eclipse IES: Native code + CDT, PTP, LTP,... PPC64LE Eclipse PowerLinux plugins: MA, SCA, CPI, TA,...
9 CUDA / GPU / POWER Technology Kit Technology Kit Available in 3Q2014 CUDA toolkit and runtime for POWER Linux for POWER NVIDIA Tesla GPUs POWER platform AIX Kernel Contact Richard Talbot, IBM rdtalbot <at> us.ibm.com
10 Java GPU Acceleration
11 Java GPU Acceleration on OpenPOWER IBM and NVIDIA partnering to bring GPU acceleration to mainstream Java workloads and enterprise environments on OpenPOWER Evolve the Java support to enable both transparent Java acceleration and Java programmer enablement to program explicitly for GPUs from Java Key compute-intensive algorithms in Java standard libraries and domain-specific libraries to be accelerated with GPU First step was to build a GPU control and runtime framework in Java Allow Java programmers to do data transfers and launch GPU kernels Gain experience in APIs and GPU programming models applied to Java Demo! 11 OpenPOWER Foundation 2014
12 Java GPU Framework Overview Exploring Java equivalents of CUDA concepts Reuse concepts defined by CUDA in Java Improve usability with familiar Java idioms Key mappings from Java to CUDA CudaDevice a CUDA-capable device CudaBuffer a region of device memory CudaException for when something goes wrong CudaModule code loaded onto a device CudaFunction a kernel entry point CudaKernel for launching a device function CudaEvent for timing or inter-stream synchronization CudaStream a CUDA stream Key Benefits Java write once, run anywhere extended to include GPU code Using exceptions removes the tedium of error checking and avoids problems of omissions Provides a foundation for building more general class libraries and reusing existing GPU code 12 OpenPOWER Foundation 2014
13 Java GPU Framework Device Management // how many devices do we have? int devicecount = CudaDevice.getCount(); // create a handle for the first device CudaDevice device = new CudaDevice(0); // how many multiprocessors does the device have? int smcount = device.getattribute( CudaDevice.ATTRIBUTE_MULTIPROCESSOR_COUNT); // add a Runnable callback (to the default stream) device.addcallback(action); // wait for a device to complete queued work device.synchronize(); Prototype API shown here is subject to change or withdrawal without notice, and represents goals and objectives only. 13 OpenPOWER Foundation 2014
14 Java GPU Framework Memory Management // allocate memory on a device CudaBuffer buffer = new CudaBuffer(device, bytecount); // copy data from Java array to device memory buffer.copyfrom(array, startindex, endindex); // copy data to Java array from device memory buffer.copyto(array, startindex, endindex); // clear device memory buffer.fillbyte(0, bytecount); // release device memory buffer.close(); 14 OpenPOWER Foundation 2014
15 Java GPU Framework Launching Kernels // load a module onto a device CudaModule module = new CudaModule(device, content); // locate a kernel function CudaKernel kernel = new CudaKernel(module, kernelname); // define grid size CudaGrid grid = new CudaGrid(gridDim, blockdim); // build bundle of launch arguments CudaParameters args = new CudaParameters(2); args.set(0, buffer); args.set(1, n); // launch kernel kernel.launch(grid, args); 15 OpenPOWER Foundation 2014
16 Java GPU Framework Events // create a new event CudaEvent event = new CudaEvent(); // queue an event device.record(event); stream.record(event); // wait until an event has occurred event.synchronize(); // check whether an event has occurred int errorcode = event.query(); // query the time between events float millis = event.elapsedtimesince(priorevent); // destroy an event event.close(); 16 OpenPOWER Foundation 2014
17 Java GPU Framework Streams // create a new stream CudaStream stream = new CudaStream(device); // make stream wait for an event stream.waitfor(event); // wait until all items have been processed stream.synchronize(); // check whether all items have been processed int errorcode = stream.query(); // destroy a stream stream.close(); 17 OpenPOWER Foundation 2014
18 Automatic Resource Management Relevant classes implement AutoCloseable to enable automatic release via Java try-with-resources statement Manual memory management with fewer errors int[] data =... try (CudaBuffer buffer = new CudaBuffer(device, data.length * 4)) { // copy from Java to device buffer.copyfrom(data, 0, data.length); // launch kernels, etc. // copy from device to Java buffer.copyto(data, 0, data.length); } // buffer.close() called automatically 18 OpenPOWER Foundation 2014
19 GTC Demo Motivation Large global retailers collect petabytes of data from customer transactions They want to identify customers with similar behavior, so they can create customized products and more effective marketing programs K-Means is a well-known algorithm for performing clustering analysis that can identify these segments Businesses that react quickly to changes in market segments will have an advantage: we want a faster implementation Apache Mahout is an open-source Java implementation which is applicable to Big Data because it is built on top of Apache Hadoop Can we do better using GPUs? 19 OpenPOWER Foundation 2014
20 GTC Demo K-Means Algorithm Overview Input consists of: D the number dimensions in each data point K the number of clusters sought a set of N sample data points a set of K points which are approximate representatives of the clusters The output is an improved set of cluster representatives The process iterates until the set of representatives stabilizes 20 OpenPOWER Foundation 2014
21 GTC Demo K-Means Map-Reduce Structure Each point is logically mapped to a matrix with K rows and 2D+1 columns (the table below assumes D=2) The non-zero row corresponds to the closest input cluster center Count X Y X^2 Y^ x y x^2 y^ The reduction phase computes the simple sum of these matrices. From this, the output centers and radii can be derived 21 OpenPOWER Foundation 2014
22 GTC Demo K-Means Map-Reduce Structure (continued) The map computation can be parallelized (at a finer granularity than managed by Hadoop); a new mapper class was created Batches of points are transferred to a GPU for mapping A partial reduction is returned to the host A single Mahout class was modified only to allow a system property to select the (new) mapper class 22 OpenPOWER Foundation 2014
23 GTC Demo Performance Comparison NVIDIA K40 GPU Speed-up factor vs. modern CPU (higher is better) Baseline is standard Mahout implementation Comparison excludes Hadoop framework and I/O D=1 D=2 D=4 D=8 D=16 D=32 D=64 K= K= K= K= K= K= Measurements completed in a controlled environment. Results may vary by customer based on individual workload, configuration and software levels. 23 OpenPOWER Foundation 2014
24 GTC Demo Clustering of Big Data 24 OpenPOWER Foundation 2014
25 What s Next? Java 8 platform adding lambda support Concise syntax for writing small snippets of code used in bulk operations (kernels) Perfect for GPUs, parallel operation and high core count machines Standard class libraries will leverage GPUs where it makes sense Bulk operations (map, reduce) on Java collections New parallel operations defined as part of a standard JCP process Java platform needs to evolve for best performance optimal layout and control of memory needed for highest performance IBM PackedObjects, Oracle value types evolving for future Java platform definitions Demo in booth #921 in the Exhibit Hall Related talks: NVIDIA compiling Java kernels to PTX as a proof of concept see their talks S4830 CUDA 6 and Beyond S4939 Accelerating Java on GPUs 25 OpenPOWER Foundation 2014
26 Questions?
OpenPOWER Outlook AXEL KOEHLER SR. SOLUTION ARCHITECT HPC
OpenPOWER Outlook AXEL KOEHLER SR. SOLUTION ARCHITECT HPC Driving industry innovation The goal of the OpenPOWER Foundation is to create an open ecosystem, using the POWER Architecture to share expertise,
Introducing the IBM Software Development Kit for PowerLinux
Introducing the IBM Software Development Kit for PowerLinux Wainer S. Moschetta IBM, PowerLinux SDK Team Leader [email protected] 1 2009 IBM Acknowledgments The information in this presentation was created
Applications to Computational Financial and GPU Computing. May 16th. Dr. Daniel Egloff +41 44 520 01 17 +41 79 430 03 61
F# Applications to Computational Financial and GPU Computing May 16th Dr. Daniel Egloff +41 44 520 01 17 +41 79 430 03 61 Today! Why care about F#? Just another fashion?! Three success stories! How Alea.cuBase
Programming models for heterogeneous computing. Manuel Ujaldón Nvidia CUDA Fellow and A/Prof. Computer Architecture Department University of Malaga
Programming models for heterogeneous computing Manuel Ujaldón Nvidia CUDA Fellow and A/Prof. Computer Architecture Department University of Malaga Talk outline [30 slides] 1. Introduction [5 slides] 2.
ANDROID DEVELOPER TOOLS TRAINING GTC 2014. Sébastien Dominé, NVIDIA
ANDROID DEVELOPER TOOLS TRAINING GTC 2014 Sébastien Dominé, NVIDIA AGENDA NVIDIA Developer Tools Introduction Multi-core CPU tools Graphics Developer Tools Compute Developer Tools NVIDIA Developer Tools
Data Center and Cloud Computing Market Landscape and Challenges
Data Center and Cloud Computing Market Landscape and Challenges Manoj Roge, Director Wired & Data Center Solutions Xilinx Inc. #OpenPOWERSummit 1 Outline Data Center Trends Technology Challenges Solution
Introducing PgOpenCL A New PostgreSQL Procedural Language Unlocking the Power of the GPU! By Tim Child
Introducing A New PostgreSQL Procedural Language Unlocking the Power of the GPU! By Tim Child Bio Tim Child 35 years experience of software development Formerly VP Oracle Corporation VP BEA Systems Inc.
Data Centric Systems (DCS)
Data Centric Systems (DCS) Architecture and Solutions for High Performance Computing, Big Data and High Performance Analytics High Performance Computing with Data Centric Systems 1 Data Centric Systems
Overview. Lecture 1: an introduction to CUDA. Hardware view. Hardware view. hardware view software view CUDA programming
Overview Lecture 1: an introduction to CUDA Mike Giles [email protected] hardware view software view Oxford University Mathematical Institute Oxford e-research Centre Lecture 1 p. 1 Lecture 1 p.
Infrastructure Matters: POWER8 vs. Xeon x86
Advisory Infrastructure Matters: POWER8 vs. Xeon x86 Executive Summary This report compares IBM s new POWER8-based scale-out Power System to Intel E5 v2 x86- based scale-out systems. A follow-on report
NVIDIA CUDA GETTING STARTED GUIDE FOR MAC OS X
NVIDIA CUDA GETTING STARTED GUIDE FOR MAC OS X DU-05348-001_v6.5 August 2014 Installation and Verification on Mac OS X TABLE OF CONTENTS Chapter 1. Introduction...1 1.1. System Requirements... 1 1.2. About
Multi-core Programming System Overview
Multi-core Programming System Overview Based on slides from Intel Software College and Multi-Core Programming increasing performance through software multi-threading by Shameem Akhter and Jason Roberts,
Implement Hadoop jobs to extract business value from large and varied data sets
Hadoop Development for Big Data Solutions: Hands-On You Will Learn How To: Implement Hadoop jobs to extract business value from large and varied data sets Write, customize and deploy MapReduce jobs to
Applied Micro development platform. ZT Systems (ST based) HP Redstone platform. Mitac Dell Copper platform. ARM in Servers
ZT Systems (ST based) Applied Micro development platform HP Redstone platform Mitac Dell Copper platform ARM in Servers 1 Server Ecosystem Momentum 2009: Internal ARM trials hosting part of website on
The Virtualization Practice
The Virtualization Practice White Paper: Managing Applications in Docker Containers Bernd Harzog Analyst Virtualization and Cloud Performance Management October 2014 Abstract Docker has captured the attention
HPC Wales Skills Academy Course Catalogue 2015
HPC Wales Skills Academy Course Catalogue 2015 Overview The HPC Wales Skills Academy provides a variety of courses and workshops aimed at building skills in High Performance Computing (HPC). Our courses
Development With ARM DS-5. Mervyn Liu FAE Aug. 2015
Development With ARM DS-5 Mervyn Liu FAE Aug. 2015 1 Support for all Stages of Product Development Single IDE, compiler, debug, trace and performance analysis for all stages in the product development
Linux Performance Optimizations for Big Data Environments
Linux Performance Optimizations for Big Data Environments Dominique A. Heger Ph.D. DHTechnologies (Performance, Capacity, Scalability) www.dhtusa.com Data Nubes (Big Data, Hadoop, ML) www.datanubes.com
High Performance or Cycle Accuracy?
CHIP DESIGN High Performance or Cycle Accuracy? You can have both! Bill Neifert, Carbon Design Systems Rob Kaye, ARM ATC-100 AGENDA Modelling 101 & Programmer s View (PV) Models Cycle Accurate Models Bringing
Accelerating Hadoop MapReduce Using an In-Memory Data Grid
Accelerating Hadoop MapReduce Using an In-Memory Data Grid By David L. Brinker and William L. Bain, ScaleOut Software, Inc. 2013 ScaleOut Software, Inc. 12/27/2012 H adoop has been widely embraced for
Parallel Computing: Strategies and Implications. Dori Exterman CTO IncrediBuild.
Parallel Computing: Strategies and Implications Dori Exterman CTO IncrediBuild. In this session we will discuss Multi-threaded vs. Multi-Process Choosing between Multi-Core or Multi- Threaded development
Le langage OCaml et la programmation des GPU
Le langage OCaml et la programmation des GPU GPU programming with OCaml Mathias Bourgoin - Emmanuel Chailloux - Jean-Luc Lamotte Le projet OpenGPU : un an plus tard Ecole Polytechnique - 8 juin 2011 Outline
E6895 Advanced Big Data Analytics Lecture 14:! NVIDIA GPU Examples and GPU on ios devices
E6895 Advanced Big Data Analytics Lecture 14: NVIDIA GPU Examples and GPU on ios devices Ching-Yung Lin, Ph.D. Adjunct Professor, Dept. of Electrical Engineering and Computer Science IBM Chief Scientist,
Introduction to GPU Programming Languages
CSC 391/691: GPU Programming Fall 2011 Introduction to GPU Programming Languages Copyright 2011 Samuel S. Cho http://www.umiacs.umd.edu/ research/gpu/facilities.html Maryland CPU/GPU Cluster Infrastructure
Android Development: a System Perspective. Javier Orensanz
Android Development: a System Perspective Javier Orensanz 1 ARM - Linux and Communities Linux kernel GNU Tools 2 Linaro Partner Initiative Mission: Make open source development easier by delivering a common
The red hat enterprise linux developer program
Program Guide The red hat enterprise linux developer program access essential resources and an ecosystem of experts to develop great applications Key benefits Collaborate with a passionate developer community
Take full advantage of IBM s IDEs for end- to- end mobile development
Take full advantage of IBM s IDEs for end- to- end mobile development ABSTRACT Mobile development with Rational Application Developer 8.5, Rational Software Architect 8.5, Rational Developer for zenterprise
Building Applications Using Micro Focus COBOL
Building Applications Using Micro Focus COBOL Abstract If you look through the Micro Focus COBOL documentation, you will see many different executable file types referenced: int, gnt, exe, dll and others.
Eddy Integrated Development Environment, LemonIDE for Embedded Software System Development
Introduction to -based solution for embedded software development Section 1 Eddy Real-Time, Lemonix Section 2 Eddy Integrated Development Environment, LemonIDE Section 3 Eddy Utility Programs Eddy Integrated
NVIDIA CUDA GETTING STARTED GUIDE FOR MICROSOFT WINDOWS
NVIDIA CUDA GETTING STARTED GUIDE FOR MICROSOFT WINDOWS DU-05349-001_v6.0 February 2014 Installation and Verification on TABLE OF CONTENTS Chapter 1. Introduction...1 1.1. System Requirements... 1 1.2.
The Top Six Advantages of CUDA-Ready Clusters. Ian Lumb Bright Evangelist
The Top Six Advantages of CUDA-Ready Clusters Ian Lumb Bright Evangelist GTC Express Webinar January 21, 2015 We scientists are time-constrained, said Dr. Yamanaka. Our priority is our research, not managing
Multi-/Many-core Modeling at Freescale
Multi-/Many-core Modeling at Freescale David Murrell, Jim Holt, Michele Reese Perspective: Virtual Platform ROI Schedules: SW and HW available on day 1 becoming an industry expectation FSL P4080 Linux
Chapter 2 System Structures
Chapter 2 System Structures Operating-System Structures Goals: Provide a way to understand an operating systems Services Interface System Components The type of system desired is the basis for choices
High Performance Cloud: a MapReduce and GPGPU Based Hybrid Approach
High Performance Cloud: a MapReduce and GPGPU Based Hybrid Approach Beniamino Di Martino, Antonio Esposito and Andrea Barbato Department of Industrial and Information Engineering Second University of Naples
Manjrasoft Market Oriented Cloud Computing Platform
Manjrasoft Market Oriented Cloud Computing Platform Aneka Aneka is a market oriented Cloud development and management platform with rapid application development and workload distribution capabilities.
Parallel Computing for Data Science
Parallel Computing for Data Science With Examples in R, C++ and CUDA Norman Matloff University of California, Davis USA (g) CRC Press Taylor & Francis Group Boca Raton London New York CRC Press is an imprint
NVIDIA CUDA Software and GPU Parallel Computing Architecture. David B. Kirk, Chief Scientist
NVIDIA CUDA Software and GPU Parallel Computing Architecture David B. Kirk, Chief Scientist Outline Applications of GPU Computing CUDA Programming Model Overview Programming in CUDA The Basics How to Get
Enhanced Project Management for Embedded C/C++ Programming using Software Components
Enhanced Project Management for Embedded C/C++ Programming using Software Components Evgueni Driouk Principal Software Engineer MCU Development Tools 1 Outline Introduction Challenges of embedded software
Bringing Big Data Modelling into the Hands of Domain Experts
Bringing Big Data Modelling into the Hands of Domain Experts David Willingham Senior Application Engineer MathWorks [email protected] 2015 The MathWorks, Inc. 1 Data is the sword of the
IBM Platform Computing : infrastructure management for HPC solutions on OpenPOWER Jing Li, Software Development Manager IBM
IBM Platform Computing : infrastructure management for HPC solutions on OpenPOWER Jing Li, Software Development Manager IBM #OpenPOWERSummit Join the conversation at #OpenPOWERSummit 1 Scale-out and Cloud
Architecting for the next generation of Big Data Hortonworks HDP 2.0 on Red Hat Enterprise Linux 6 with OpenJDK 7
Architecting for the next generation of Big Data Hortonworks HDP 2.0 on Red Hat Enterprise Linux 6 with OpenJDK 7 Yan Fisher Senior Principal Product Marketing Manager, Red Hat Rohit Bakhshi Product Manager,
NVIDIA CUDA GETTING STARTED GUIDE FOR MAC OS X
NVIDIA CUDA GETTING STARTED GUIDE FOR MAC OS X DU-05348-001_v5.5 July 2013 Installation and Verification on Mac OS X TABLE OF CONTENTS Chapter 1. Introduction...1 1.1. System Requirements... 1 1.2. About
An Easier Way for Cross-Platform Data Acquisition Application Development
An Easier Way for Cross-Platform Data Acquisition Application Development For industrial automation and measurement system developers, software technology continues making rapid progress. Software engineers
Benchmark Hadoop and Mars: MapReduce on cluster versus on GPU
Benchmark Hadoop and Mars: MapReduce on cluster versus on GPU Heshan Li, Shaopeng Wang The Johns Hopkins University 3400 N. Charles Street Baltimore, Maryland 21218 {heshanli, shaopeng}@cs.jhu.edu 1 Overview
PERFORMANCE ANALYSIS OF KERNEL-BASED VIRTUAL MACHINE
PERFORMANCE ANALYSIS OF KERNEL-BASED VIRTUAL MACHINE Sudha M 1, Harish G M 2, Nandan A 3, Usha J 4 1 Department of MCA, R V College of Engineering, Bangalore : 560059, India [email protected] 2 Department
U N C L A S S I F I E D
CUDA and Java GPU Computing in a Cross Platform Application Scot Halverson [email protected] LA-UR-13-20719 Slide 1 What s the goal? Run GPU code alongside Java code Take advantage of high parallelization Utilize
Effective Java Programming. efficient software development
Effective Java Programming efficient software development Structure efficient software development what is efficiency? development process profiling during development what determines the performance of
Optimizing Application Performance with CUDA Profiling Tools
Optimizing Application Performance with CUDA Profiling Tools Why Profile? Application Code GPU Compute-Intensive Functions Rest of Sequential CPU Code CPU 100 s of cores 10,000 s of threads Great memory
ODROID Multithreading in Android
Multithreading in Android 1 Index Android Overview Android Stack Android Development Tools Main Building Blocks(Activity Life Cycle) Threading in Android Multithreading via AsyncTask Class Multithreading
Neptune. A Domain Specific Language for Deploying HPC Software on Cloud Platforms. Chris Bunch Navraj Chohan Chandra Krintz Khawaja Shams
Neptune A Domain Specific Language for Deploying HPC Software on Cloud Platforms Chris Bunch Navraj Chohan Chandra Krintz Khawaja Shams ScienceCloud 2011 @ San Jose, CA June 8, 2011 Cloud Computing Three
GPU Tools Sandra Wienke
Sandra Wienke Center for Computing and Communication, RWTH Aachen University MATSE HPC Battle 2012/13 Rechen- und Kommunikationszentrum (RZ) Agenda IDE Eclipse Debugging (CUDA) TotalView Profiling (CUDA
A general-purpose virtualization service for HPC on cloud computing: an application to GPUs
A general-purpose virtualization service for HPC on cloud computing: an application to GPUs R.Montella, G.Coviello, G.Giunta* G. Laccetti #, F. Isaila, J. Garcia Blas *Department of Applied Science University
OpenACC 2.0 and the PGI Accelerator Compilers
OpenACC 2.0 and the PGI Accelerator Compilers Michael Wolfe The Portland Group [email protected] This presentation discusses the additions made to the OpenACC API in Version 2.0. I will also present
Manjrasoft Market Oriented Cloud Computing Platform
Manjrasoft Market Oriented Cloud Computing Platform Innovative Solutions for 3D Rendering Aneka is a market oriented Cloud development and management platform with rapid application development and workload
COM 444 Cloud Computing
COM 444 Cloud Computing Lec 3: Virtual Machines and Virtualization of Clusters and Datacenters Prof. Dr. Halûk Gümüşkaya [email protected] [email protected] http://www.gumuskaya.com Virtual
Enhancing Cloud-based Servers by GPU/CPU Virtualization Management
Enhancing Cloud-based Servers by GPU/CPU Virtualiz Management Tin-Yu Wu 1, Wei-Tsong Lee 2, Chien-Yu Duan 2 Department of Computer Science and Inform Engineering, Nal Ilan University, Taiwan, ROC 1 Department
BSC vision on Big Data and extreme scale computing
BSC vision on Big Data and extreme scale computing Jesus Labarta, Eduard Ayguade,, Fabrizio Gagliardi, Rosa M. Badia, Toni Cortes, Jordi Torres, Adrian Cristal, Osman Unsal, David Carrera, Yolanda Becerra,
Open Source for Cloud Infrastructure
Open Source for Cloud Infrastructure June 29, 2012 Jackson He General Manager, Intel APAC R&D Ltd. Cloud is Here and Expanding More users, more devices, more data & traffic, expanding usages >3B 15B Connected
TEGRA X1 DEVELOPER TOOLS SEBASTIEN DOMINE, SR. DIRECTOR SW ENGINEERING
TEGRA X1 DEVELOPER TOOLS SEBASTIEN DOMINE, SR. DIRECTOR SW ENGINEERING NVIDIA DEVELOPER TOOLS BUILD. DEBUG. PROFILE. C/C++ IDE INTEGRATION STANDALONE TOOLS HARDWARE SUPPORT CPU AND GPU DEBUGGING & PROFILING
I/O Considerations in Big Data Analytics
Library of Congress I/O Considerations in Big Data Analytics 26 September 2011 Marshall Presser Federal Field CTO EMC, Data Computing Division 1 Paradigms in Big Data Structured (relational) data Very
PHP vs. Java. In this paper, I am not discussing following two issues since each is currently hotly debated in various communities:
PHP vs. Java *This document reflects my opinion about PHP and Java. I have written this without any references. Let me know if there is a technical error. --Hasari Tosun It isn't correct to compare Java
GETTING STARTED WITH ANDROID DEVELOPMENT FOR EMBEDDED SYSTEMS
Embedded Systems White Paper GETTING STARTED WITH ANDROID DEVELOPMENT FOR EMBEDDED SYSTEMS September 2009 ABSTRACT Android is an open source platform built by Google that includes an operating system,
Reminders. Lab opens from today. Many students want to use the extra I/O pins on
Reminders Lab opens from today Wednesday 4:00-5:30pm, Friday 1:00-2:30pm Location: MK228 Each student checks out one sensor mote for your Lab 1 The TA will be there to help your lab work Many students
The Fastest Way to Parallel Programming for Multicore, Clusters, Supercomputers and the Cloud.
White Paper 021313-3 Page 1 : A Software Framework for Parallel Programming* The Fastest Way to Parallel Programming for Multicore, Clusters, Supercomputers and the Cloud. ABSTRACT Programming for Multicore,
GPU System Architecture. Alan Gray EPCC The University of Edinburgh
GPU System Architecture EPCC The University of Edinburgh Outline Why do we want/need accelerators such as GPUs? GPU-CPU comparison Architectural reasons for GPU performance advantages GPU accelerated systems
HIGH PERFORMANCE BIG DATA ANALYTICS
HIGH PERFORMANCE BIG DATA ANALYTICS Kunle Olukotun Electrical Engineering and Computer Science Stanford University June 2, 2014 Explosion of Data Sources Sensors DoD is swimming in sensors and drowning
File S1: Supplementary Information of CloudDOE
File S1: Supplementary Information of CloudDOE Table of Contents 1. Prerequisites of CloudDOE... 2 2. An In-depth Discussion of Deploying a Hadoop Cloud... 2 Prerequisites of deployment... 2 Table S1.
Data Centers and Cloud Computing
Data Centers and Cloud Computing CS377 Guest Lecture Tian Guo 1 Data Centers and Cloud Computing Intro. to Data centers Virtualization Basics Intro. to Cloud Computing Case Study: Amazon EC2 2 Data Centers
ORACLE MOBILE APPLICATION FRAMEWORK DATA SHEET
ORACLE MOBILE APPLICATION FRAMEWORK DATA SHEET PRODUCTIVE ENTERPRISE MOBILE APPLICATIONS DEVELOPMENT KEY FEATURES Visual and declarative development Mobile optimized user experience Simplified access to
ITG Software Engineering
IBM WebSphere Administration 8.5 Course ID: Page 1 Last Updated 12/15/2014 WebSphere Administration 8.5 Course Overview: This 5 Day course will cover the administration and configuration of WebSphere 8.5.
FPGA Accelerator Virtualization in an OpenPOWER cloud. Fei Chen, Yonghua Lin IBM China Research Lab
FPGA Accelerator Virtualization in an OpenPOWER cloud Fei Chen, Yonghua Lin IBM China Research Lab Trend of Acceleration Technology Acceleration in Cloud is Taking Off Used FPGA to accelerate Bing search
Characteristics of Java (Optional) Y. Daniel Liang Supplement for Introduction to Java Programming
Characteristics of Java (Optional) Y. Daniel Liang Supplement for Introduction to Java Programming Java has become enormously popular. Java s rapid rise and wide acceptance can be traced to its design
Amazon EC2 Product Details Page 1 of 5
Amazon EC2 Product Details Page 1 of 5 Amazon EC2 Functionality Amazon EC2 presents a true virtual computing environment, allowing you to use web service interfaces to launch instances with a variety of
IBM Platform Computing Cloud Service Ready to use Platform LSF & Symphony clusters in the SoftLayer cloud
IBM Platform Computing Cloud Service Ready to use Platform LSF & Symphony clusters in the SoftLayer cloud February 25, 2014 1 Agenda v Mapping clients needs to cloud technologies v Addressing your pain
Virtual Machines. www.viplavkambli.com
1 Virtual Machines A virtual machine (VM) is a "completely isolated guest operating system installation within a normal host operating system". Modern virtual machines are implemented with either software
Integrating TAU With Eclipse: A Performance Analysis System in an Integrated Development Environment
Integrating TAU With Eclipse: A Performance Analysis System in an Integrated Development Environment Wyatt Spear, Allen Malony, Alan Morris, Sameer Shende {wspear, malony, amorris, sameer}@cs.uoregon.edu
IBM i25 Trends & Directions
Gl. Avernæs 20. November 2013 Erik Rex Cert. Consultant [email protected] Thanks to Steve Will IBM i Chief Architect 2013 IBM Corporation The Family Tree 1975 1988 2013 2013 IBM Corporation 3 2013 IBM Corporation
Parallel Computing with MATLAB
Parallel Computing with MATLAB Scott Benway Senior Account Manager Jiro Doke, Ph.D. Senior Application Engineer 2013 The MathWorks, Inc. 1 Acceleration Strategies Applied in MATLAB Approach Options Best
System Structures. Services Interface Structure
System Structures Services Interface Structure Operating system services (1) Operating system services (2) Functions that are helpful to the user User interface Command line interpreter Batch interface
Monitis Project Proposals for AUA. September 2014, Yerevan, Armenia
Monitis Project Proposals for AUA September 2014, Yerevan, Armenia Distributed Log Collecting and Analysing Platform Project Specifications Category: Big Data and NoSQL Software Requirements: Apache Hadoop
Towards Fast SQL Query Processing in DB2 BLU Using GPUs A Technology Demonstration. Sina Meraji [email protected]
Towards Fast SQL Query Processing in DB2 BLU Using GPUs A Technology Demonstration Sina Meraji [email protected] Please Note IBM s statements regarding its plans, directions, and intent are subject to
Big Data Analytics - Accelerated. stream-horizon.com
Big Data Analytics - Accelerated stream-horizon.com Legacy ETL platforms & conventional Data Integration approach Unable to meet latency & data throughput demands of Big Data integration challenges Based
Last Class: OS and Computer Architecture. Last Class: OS and Computer Architecture
Last Class: OS and Computer Architecture System bus Network card CPU, memory, I/O devices, network card, system bus Lecture 3, page 1 Last Class: OS and Computer Architecture OS Service Protection Interrupts
Packet-based Network Traffic Monitoring and Analysis with GPUs
Packet-based Network Traffic Monitoring and Analysis with GPUs Wenji Wu, Phil DeMar [email protected], [email protected] GPU Technology Conference 2014 March 24-27, 2014 SAN JOSE, CALIFORNIA Background Main
A survey on platforms for big data analytics
Singh and Reddy Journal of Big Data 2014, 1:8 SURVEY PAPER Open Access A survey on platforms for big data analytics Dilpreet Singh and Chandan K Reddy * * Correspondence: [email protected] Department
Introduction to Cloud Computing
Introduction to Cloud Computing Cloud Computing I (intro) 15 319, spring 2010 2 nd Lecture, Jan 14 th Majd F. Sakr Lecture Motivation General overview on cloud computing What is cloud computing Services
Enabling Technologies for Distributed and Cloud Computing
Enabling Technologies for Distributed and Cloud Computing Dr. Sanjay P. Ahuja, Ph.D. 2010-14 FIS Distinguished Professor of Computer Science School of Computing, UNF Multi-core CPUs and Multithreading
Example of Standard API
16 Example of Standard API System Call Implementation Typically, a number associated with each system call System call interface maintains a table indexed according to these numbers The system call interface
Chapter 3 Operating-System Structures
Contents 1. Introduction 2. Computer-System Structures 3. Operating-System Structures 4. Processes 5. Threads 6. CPU Scheduling 7. Process Synchronization 8. Deadlocks 9. Memory Management 10. Virtual
Lecture 1: an introduction to CUDA
Lecture 1: an introduction to CUDA Mike Giles [email protected] Oxford University Mathematical Institute Oxford e-research Centre Lecture 1 p. 1 Overview hardware view software view CUDA programming
<Insert Picture Here> Java, the language for the future
1 Java, the language for the future Adam Messinger Vice President of Development The following is intended to outline our general product direction. It is intended for information
GPU Parallel Computing Architecture and CUDA Programming Model
GPU Parallel Computing Architecture and CUDA Programming Model John Nickolls Outline Why GPU Computing? GPU Computing Architecture Multithreading and Arrays Data Parallel Problem Decomposition Parallel
Best of breed Multi Architecture Cloud
Best of breed Multi Architecture Cloud Fusszeile / Datum About Me Co-Owner Adfinis SyGroup AG Co-Owner p p Security AG Business Development and Strategy Cloud Solution Architect 15.06.15 15.06.15 About
Parallel Databases. Parallel Architectures. Parallelism Terminology 1/4/2015. Increase performance by performing operations in parallel
Parallel Databases Increase performance by performing operations in parallel Parallel Architectures Shared memory Shared disk Shared nothing closely coupled loosely coupled Parallelism Terminology Speedup:
Distributed Computing and Big Data: Hadoop and MapReduce
Distributed Computing and Big Data: Hadoop and MapReduce Bill Keenan, Director Terry Heinze, Architect Thomson Reuters Research & Development Agenda R&D Overview Hadoop and MapReduce Overview Use Case:
Hadoop 只 支 援 用 Java 開 發 嘛? Is Hadoop only support Java? 總 不 能 全 部 都 重 新 設 計 吧? 如 何 與 舊 系 統 相 容? Can Hadoop work with existing software?
Hadoop 只 支 援 用 Java 開 發 嘛? Is Hadoop only support Java? 總 不 能 全 部 都 重 新 設 計 吧? 如 何 與 舊 系 統 相 容? Can Hadoop work with existing software? 可 以 跟 資 料 庫 結 合 嘛? Can Hadoop work with Databases? 開 發 者 們 有 聽 到
Migration and Building of Data Centers in IBM SoftLayer with the RackWare Management Module
Migration and Building of Data Centers in IBM SoftLayer with the RackWare Management Module June, 2015 WHITE PAPER Contents Advantages of IBM SoftLayer and RackWare Together... 4 Relationship between
