HPC-related R&D in 863 Program



Similar documents
Accelerating Simulation & Analysis with Hybrid GPU Parallelization and Cloud Computing

HPC and Big Data. EPCC The University of Edinburgh. Adrian Jackson Technical Architect

Sun Constellation System: The Open Petascale Computing Architecture

Agenda. HPC Software Stack. HPC Post-Processing Visualization. Case Study National Scientific Center. European HPC Benchmark Center Montpellier PSSC

COMP/CS 605: Intro to Parallel Computing Lecture 01: Parallel Computing Overview (Part 1)

A GPU COMPUTING PLATFORM (SAGA) AND A CFD CODE ON GPU FOR AEROSPACE APPLICATIONS

Barry Bolding, Ph.D. VP, Cray Product Division

GTC Presentation March 19, Copyright 2012 Penguin Computing, Inc. All rights reserved

Cluster, Grid, Cloud Concepts

<Insert Picture Here> Infrastructure as a Service (IaaS) Cloud Computing for Enterprises

HETEROGENEOUS HPC, ARCHITECTURE OPTIMIZATION, AND NVLINK

Appro Supercomputer Solutions Best Practices Appro 2012 Deployment Successes. Anthony Kenisky, VP of North America Sales

Introducing PgOpenCL A New PostgreSQL Procedural Language Unlocking the Power of the GPU! By Tim Child

GPU System Architecture. Alan Gray EPCC The University of Edinburgh

SURFsara HPC Cloud Workshop

Kriterien für ein PetaFlop System

Evoluzione dell Infrastruttura di Calcolo e Data Analytics per la ricerca

ALPS Supercomputing System A Scalable Supercomputer with Flexible Services

The Lattice Project: A Multi-Model Grid Computing System. Center for Bioinformatics and Computational Biology University of Maryland

SR-IOV In High Performance Computing

SURFsara HPC Cloud Workshop

PRIMERGY server-based High Performance Computing solutions

Cloud Computing. Alex Crawford Ben Johnstone

HPC Update: Engagement Model

STORAGE HIGH SPEED INTERCONNECTS HIGH PERFORMANCE COMPUTING VISUALISATION GPU COMPUTING

Supercomputing Resources in BSC, RES and PRACE

Unleashing the Performance Potential of GPUs for Atmospheric Dynamic Solvers

Automating Big Data Benchmarking for Different Architectures with ALOJA

High Performance Computing in CST STUDIO SUITE

Parallel Computing. Introduction

Altix Usage and Application Programming. Welcome and Introduction

The High Performance Internet of Things: using GVirtuS for gluing cloud computing and ubiquitous connected devices

Building a Top500-class Supercomputing Cluster at LNS-BUAP

Hadoop on the Gordon Data Intensive Cluster

BSC - Barcelona Supercomputer Center

Parallel Programming Survey

Turbomachinery CFD on many-core platforms experiences and strategies

Data Centric Systems (DCS)

Purchase of High Performance Computing (HPC) Central Compute Resources by Northwestern Researchers

RWTH GPU Cluster. Sandra Wienke November Rechen- und Kommunikationszentrum (RZ) Fotos: Christian Iwainsky

1 Bull, 2011 Bull Extreme Computing

High Performance Computing

Estonian Scientific Computing Infrastructure (ETAIS)

Visit to the National University for Defense Technology Changsha, China. Jack Dongarra. University of Tennessee. Oak Ridge National Laboratory

How To Compare Amazon Ec2 To A Supercomputer For Scientific Applications

Welcome to ASC Student Supercomputer Challenge 2015

THE SUN STORAGE AND ARCHIVE SOLUTION FOR HPC

Overview of HPC systems and software available within

Cosmological simulations on High Performance Computers

Scaling Objectivity Database Performance with Panasas Scale-Out NAS Storage

Toward a practical HPC Cloud : Performance tuning of a virtualized HPC cluster

Clusters: Mainstream Technology for CAE

LS-DYNA Best-Practices: Networking, MPI and Parallel File System Effect on LS-DYNA Performance

PLGrid Infrastructure Solutions For Computational Chemistry

Linux Cluster Computing An Administrator s Perspective

Cloud Computing through Virtualization and HPC technologies

PARALLEL & CLUSTER COMPUTING CS 6260 PROFESSOR: ELISE DE DONCKER BY: LINA HUSSEIN

PACE Predictive Analytics Center of San Diego Supercomputer Center, UCSD. Natasha Balac, Ph.D.

Big Data and Cloud Computing for GHRSST

Parallel Large-Scale Visualization


PCIe Over Cable Provides Greater Performance for Less Cost for High Performance Computing (HPC) Clusters. from One Stop Systems (OSS)

HPC Wales Skills Academy Course Catalogue 2015

High-Performance Computing and Big Data Challenge

CRIBI. Calcolo Scientifico e Bioinformatica oggi Università di Padova 13 gennaio 2012

1 DCSC/AU: HUGE. DeIC Sekretariat /RB. Bilag 1. DeIC (DCSC) Scientific Computing Installations

Dr. Raju Namburu Computational Sciences Campaign U.S. Army Research Laboratory. The Nation s Premier Laboratory for Land Forces UNCLASSIFIED

HPC Programming Framework Research Team

HPC technology and future architecture

Interconnect Your Future Enabling the Best Datacenter Return on Investment. TOP500 Supercomputers, November 2015

Cloud Computing. Lecture 5 Grid Case Studies

IBM Platform Computing Cloud Service Ready to use Platform LSF & Symphony clusters in the SoftLayer cloud

On Cloud Computing Technology in the Construction of Digital Campus

HP ProLiant SL270s Gen8 Server. Evaluation Report

Enabling Technologies for Distributed Computing

Trends in High-Performance Computing for Power Grid Applications

High Performance. CAEA elearning Series. Jonathan G. Dudley, Ph.D. 06/09/ CAE Associates

Mississippi State University High Performance Computing Collaboratory Brief Overview. Trey Breckenridge Director, HPC

Intel Cluster Ready Appro Xtreme-X Computers with Mellanox QDR Infiniband

A general-purpose virtualization service for HPC on cloud computing: an application to GPUs

GPU Computing. The GPU Advantage. To ExaScale and Beyond. The GPU is the Computer

HPC Cluster Decisions and ANSYS Configuration Best Practices. Diana Collier Lead Systems Support Specialist Houston UGM May 2014

Pedraforca: ARM + GPU prototype

REFERENCE. Microsoft in HPC. Tejas Karmarkar, Solution Sales Professional, Microsoft

Programming models for heterogeneous computing. Manuel Ujaldón Nvidia CUDA Fellow and A/Prof. Computer Architecture Department University of Malaga

SRNWP Workshop. HP Solutions and Activities in Climate & Weather Research. Michael Riedmann European Performance Center

Transcription:

HPC-related R&D in 863 Program Depei Qian Sino-German Joint Software Institute (JSI) Beihang University Aug. 27, 2010

Outline The 863 key project on HPC and Grid Status and Next 5 years

863 efforts on HPC and Grid High performance computer and core software 4-year effort, May 2002 to Dec. 2005 100 million Yuan funding from the MOST More than 2Χ associated funds from local government, application organizations, and industry Outcomes: China National Grid (CNGrid) High productivity Computer and Grid Service Environment Period: 2006-2010 940 million Yuan from the MOST and more matching investment from other sources

Major R&D activities HPC development Grid software development CNGrid development Grid and HPC applications

1. HPC development Two phase development First phase: two 100TFlops machines for Dawning 5000A for SSC Lenovo DeepComp 7000 for CASSC Second phase: three 1000Tflops machines for Tianjin Supercomputing Center (Binhai New Zoon) South China Supercomputing Center (Shenzhen) Shandong Supercomputing Center (Jinan)

Dawning 5000A Constellation based on AMD multicore processors Low power CPU and high density blade design High performance InfiniBand switch 233.472TFlops peak performance, 180.6TFlops Linpack performance The 10th in TOP500 in Nov. 2008, the fastest machine outside USA

Lenovo DeepComp 7000 Hybrid cluster architecture using Intel multicore processors Two sets of interconnect InInfiniBand Gb Ethernet SAN connection between I/O nodes and disk array 145.965TFlops peak performance 106.5 Tflops Linpack performance The 19th in TOP500 in Nov. 2008

Tianhe I Hybrid system General purpose unit--intel 4- core processors Acceleration unit--amd GPUs Service unit Intel 4-core processors 1206TFlops peak General purpose: 200+TF GPU: 900+TF 560 TFlops Linpack performance Reasonable efficiency for a hybrid system 1.6MW Announced on Oct. 29, 2009

Tianhe I High speed network chips developed 2x QDR InfiniBand Coordination of heterogeneous components of the system Programming support for the heterogeneous system The second phase of Tianhe (Tianhe II?) will be completed by the end of this year Adding more general purpose computing power (1PF) Adding small amount of domestic processor Replacing the AMD GPUs with nvidia s new Fermi GPUs (1.3PF)

Dawning 6000 Hybrid system Service unit (Nebula) 9600 Inter 6-core Westmere processor 4800 nvidia Fermi GPGPU 3PF peak performance 1.27 Linpack performance About 2.6 MW Computing unit Implemented with Loonson 3B processor Complete in 2011

Issues about hybrid architectures How to effectively use heterogeneous parts of the system? Applications which can effectively use the accelerators are still limited Difficulty in programming the hybrid system Need programming support for the hybrid systems

2. Grid software development Goal Developing system level software for supporting grid environment operation and grid applications Pursuing technological innovation Emphasizing maturity and robustness of the software

Security CNGrid GOS Architecture IDE GSML Workshop. Debugger GSML Composer Compiler GSML Browser Other Domain Specific Applications Cmd Line Tools Gsh & cmd tools VegaSSH HPCG App & Mgmt Portal System Mgmt Portal GOS Library (Batch, Message, File, etc) GOS System Call (Resource mgmt,agora mgmt, User mgmt, Grip mgmt, etc) Tool/App Grid Portal, Gsh+CLI, GSML Workshop and Grid Apps Core, System and App Level Services HPCG Backend metainfo mgmt File mgmt BatchJob mgmt Account mgmt MetaSchedule CA Service Message Service Dynamic DeployService DataGrid DB Service GridWorkflow Work Flow Engine System Axis Handlers for Message Level Security Tomcat(5.0.28) + Axis(1.2 rc2) Grip Grip Instance Mgmt Resource Space User Mgmt Agora Res AC & Sharing Agora Mgmt Res Mgmt Core J2SE(1.4.2_07, 1.5.0_07) Grip Runtime Naming OS (Linux/Unix/Windows) Other ServiceController RController Tomcat(Apache)+Axis, GT4, glite, OMII Java J2SE Other 3rd software & tools Hosting Environment PC Server (Grid Server)

CNGrid GOS deployment CNGrid GOS deployed on 10 sites and some application Grids Support heterogeneous HPCs: Galaxy, Dawning, DeepComp Support multiple platforms Unix, Linux, Windows Using public network connection, enable only HTTP port Flexible client Web browser Special client GSML client

3. CNGrid development Ten sites CNIC, CAS (Beijing, major site) Shanghai Supercomputer Center (Shanghai, major site ) Tsinghua University (Beijing) Institute of Applied Physics and Computational Mathematics (Beijing) University of Science and Technology of China (Hefei, Anhui) Xi an Jiaotong University (Xi an, Shaanxi) Shenzhen Institute of Advanced Technology (Shenzhen, Guangdong) University of Hong Kong (Hong Kong) Shandong University (Jinan, Shandong) Huazhong University of Science and Technology (Wuhan, Hubei) The CNGrid Operation Center (based on CNIC, CAS) Three PF sites will be integrated into CNGrid

Tsinghua University: 1.33TFlops, 158TB storage, 29 applications, 100+ users. IPV4/V6 access XJTU: 4TFlops, 25TB storage, 14 applications, 120+ users, IPv4/v6 access HUST: 1.7TFlops, 15TB storage, IPv4/v6 access IAPCM: 1TFlops, 4.9TB storage, 10 applications, 138 users, IPv4/v6 access CNIC: 150TFlops, 1.4PB storage,30 applications, 269 users all over the country, IPv4/v6 access Shandong University 1.3TFlops, 18TB storage, 7 applications, 60+ users, IPv4/v6 access SSC: 200TFlops, 600TB storage, 15 applications, 286 users, IPv4/v6 access SIAT: 1.44TFlops, 17.6TB storage, IPv4/v6 access HKU: 2.6TFlops, 80+ users, IPv4/v6 access USTC: 1TFlops, 15TB storage, 18 applications, 60+ users, IPv4/v6 access

CNGrid: resources 10 sites 380TFlops 2200TB storage

CNGrid:services and users 230 services >1400 users China commercial Aircraft Corp Bao Steel automobile institutes of CAS universities

CNGrid:applications Supporting >700 projects 973, 863, NSFC, CAS Innovative, and Engineering projects

CAS Supercomputing Center (SCCAS) Established at CNIC CAS in 1996 Currently installation Lenovo DeepComp 7000 Applications 200+ registered users Supporting more than 120 important projects A number of important achievement obtained

Shanghai Supercomputing Center (SSC) Established by Shanghai city government in 2000 Currently installation Dawning 5000A 300+ users

Utilization (SSC) 90 80 70 73.3 79.1 67.9 76.2 83.6 60 50 46.8 49.3 40 34.4 35.5 30 24.8 20 10 0

Application scale (SSC) 1-3 个 core, 0.00% 1024 个 core 及 以 上, 31.81% 512-1023 个 core, 3.61% 16-63 个 64-127 4-15 个 core, core, core, 15.11% 4.33% 0.11% 128-255 个 core, 7.28% 256-511 个 core, 37.76%

4: Grid and HPC applications Developing productive HPC and Grid applications Verification of the technologies Applications from some selected areas Resource and Environment Research Services Manufacturing

Grid applications Drug Discovery Weather forecasting Scientific data Grid and its application in research Water resource Information system Grid-enabled railway freight Information system Grid for Chinese medicine database applications HPC and Grid for Aerospace Industry (AviGrid) National forestry project planning, monitoring and evaluation

HPC applications Computational chemistry Computational Astronomy Parallel program for large fluid machinery design Fusion ignition simulation Parallel algorithms for bio- and pharmacy applications Parallel algorithms for weather forecasting based on GRAPES 10000+ core scale simulation for aircraft design Seismic imaging for oil exploration Parallel algorithm libraries for PetaFlops systems

Domain application Grids Domain application Grids for Simulation and optimization automobile industry aircraft design steel industry Scientific computing Bio-information application computational chemistry Introducing Cloud Computing concept CNGrid as IaaS and partially PaaS Domain application Grids as SaaS and partially PaaS

CNGrid (2006-2010) HPC Systems Two 100 Tflops 3 PFlops Grid Software: CNGrid GOS CNGrid Environment 12 sites One OP Centers Some domain application Grids Applications Research Resource & Environment Manufacturing Services

China s status in related fields Significant progress in certain areas Still far behind in many aspects kernel technologies applications multi-disciplinary research talent people Sustainable development is crucial

Next 5 years ChinaCloud A priority key project that will be launched this year Strategic study to identify the goal and the scope of the project Network OS, cloud based search engine, and cloud based language translation High-end computing infrastructure (Still pending) Goals? Demands? HPC in cloud?

HPC Consortium A HPC consortium based on CNGrid is under consideration Part of the country s cyber-infrastructure High-end computing facility for research Simulation and optimization capability needed by industry A consortium for taking national R&D projects A new model of operation need to be defined HKU site is extremely valuable to CNGrid and the future consortium Effective operation Supporting applications Promoting international cooperation

Thank you!