Network & HEP Computing in China. Gongxing SUN CJK Workshop & CFI

Similar documents
Volunteer Computing, Grid Computing and Cloud Computing: Opportunities for Synergy. Derrick Kondo INRIA, France

(Possible) HEP Use Case for NDN. Phil DeMar; Wenji Wu NDNComm (UCLA) Sept. 28, 2015

2. Research and Development on the Autonomic Operation. Control Infrastructure Technologies in the Cloud Computing Environment

SOFTWARE-DEFINED NETWORKING AND OPENFLOW

A Scalable Network Monitoring System as a Public Service on Cloud

Pronto Cloud Controller The Next Generation Control

The dcache Storage Element

Report from SARA/NIKHEF T1 and associated T2s

Increase Simplicity and Improve Reliability with VPLS on the MX Series Routers

Frequently Asked Questions

Oracle Desktop Virtualization

Solution for private cloud computing

Cloud Optimize Your IT

GB-OS Version 6.2. Configuring IPv6. Tel: Fax Web:

How To Speed Up A Flash Flash Storage System With The Hyperq Memory Router

Building a Volunteer Cloud

SDN/Virtualization and Cloud Computing

SDN, a New Definition of Next-Generation Campus Network

Network Virtualization for the Enterprise Data Center. Guido Appenzeller Open Networking Summit October 2011

Introducing FUJITSU Software Systemwalker Centric Manager V15.1.1

Lecture 02b Cloud Computing II

Cost-Benefit Analysis of Cloud Computing versus Desktop Grids

Volunteer Computing and Cloud Computing: Opportunities for Synergy

AppDirector Load balancing IBM Websphere and AppXcel

This document describes the new features of this release and important changes since the previous one.

Big Data Testbed for Research and Education Networks Analysis. SomkiatDontongdang, PanjaiTantatsanawong, andajchariyasaeung

AscenVision. Successful Story of F1. AscenVision Technology Inc. The Intelligent Network Provider

Using SDN-OpenFlow for High-level Services

Cloud Computing with Red Hat Solutions. Sivaram Shunmugam Red Hat Asia Pacific Pte Ltd.

White paper. Microsoft and Citrix VDI: Virtual desktop implementation scenarios

CERN Cloud Storage Evaluation Geoffray Adde, Dirk Duellmann, Maitane Zotes CERN IT

Distributed applications monitoring at system and network level

STeP-IN SUMMIT June 18 21, 2013 at Bangalore, INDIA. Performance Testing of an IAAS Cloud Software (A CloudStack Use Case)

SVN5800 Secure Access Gateway

Experiences with MPTCP in an intercontinental multipathed OpenFlow network

About the VM-Series Firewall

Database Services for CERN

Network performance in virtual infrastructures

GRID computing at LHC Science without Borders

An Integrated CyberSecurity Approach for HEP Grids. Workshop Report.

AlcAtel-lucent enterprise AnD sdnsquare sdn² network solution enabling highly efficient, volumetric, time-critical data transfer over ip networks

DataCentred Cloud Services Pricing MediaCityUK, Manchester Flexible, Open Source, Cost Effective

System Models for Distributed and Cloud Computing

Agile VPN for Carrier/SP Network. ONOS- based SDN Controller for China Unicom MPLS L3VPN Service

Application-Centric WLAN. Rob Mellencamp

Software Defined Network (SDN)

SILVER PEAK ACCELERATION WITH EMC VSPEX PRIVATE CLOUD WITH RECOVERPOINT FOR VMWARE VSPHERE

SOFTWARE DEFINED NETWORKING: INDUSTRY INVOLVEMENT

District of Columbia Courts Attachment 1 Video Conference Bridge Infrastructure Equipment Performance Specification

Demonstrating the high performance and feature richness of the compact MX Series

Virtualization, SDN and NFV

Remote Application Server Version 14. Last updated:

GigaSpaces XAP 10.0 Administration Training ADMINISTRATION, MONITORING AND TROUBLESHOOTING GIGASPACES XAP DISTRIBUTED SYSTEMS

THE REVOLUTION TOWARDS SOFTWARE- DEFINED NETWORKING

The All-in-One, Intelligent NXC Controller

Tier3 Network Issues. Richard Carlson May 19, 2009

IP Telephony Management

Remote Application Server Version 14. Last updated:

Scala Storage Scale-Out Clustered Storage White Paper

SOFTWARE-DEFINED NETWORKING AND OPENFLOW

High-performance vswitch of the user, by the user, for the user

Citrix Lab Manager 3.6 SP 2 Quick Start Guide

This document describes how the Meraki Cloud Controller system enables the construction of large-scale, cost-effective wireless networks.

Microsoft Compute Clusters in High Performance Technical Computing. Björn Tromsdorf, HPC Product Manager, Microsoft Corporation

Load Manager Administrator s Guide For other guides in this document set, go to the Document Center

COMLINK Cloud Technical Specification Guide DEDICATED SERVER

Cisco Application Networking for Citrix Presentation Server

Gigabit Multi-Homing VPN Security Router

CERN Cloud Infrastructure. Cloud Networking

Install Guide for JunosV Wireless LAN Controller

Gigabit SSL VPN Security Router

How To Use Openstack At Cern

Software Defined Networking and Network Virtualization

Testing Intelligent Device Communications in a Distributed System

MPLS L2VPN (VLL) Technology White Paper

Accelerating Network Virtualization Overlays with QLogic Intelligent Ethernet Adapters

SDN in the Public Cloud: Windows Azure. Albert Greenberg Partner Development Manager Windows Azure Networking

DREAMER and GN4-JRA2 on GTS

Cisco Application Networking for BEA WebLogic

Cisco Application Networking for IBM WebSphere

Who s Endian?

High Availability Essentials

Programmable Networking with Open vswitch

LinuxWorld Conference & Expo Server Farms and XML Web Services

Chapter 5. Data Communication And Internet Technology

Basic IPv6 WAN and LAN Configuration

Transcription:

Network & HEP Computing in China Gongxing SUN CJK Workshop & CFI

Outlines IPV6 deployment SDN for HEP data transfer Dirac Computing Model on IPV6 Volunteer Computing Future Work

IPv6@IHEP-Deployment Internet bandwidth 10Gbps to CSTnet IPv6 network devices ~150 switches ~300 AP IP address allocation SLAAC Network monitoring Cacti / Nagios with IPv6 patch Network security IPv6 ACL(Linux) Anything out / Nothing in

IPv6@IHEP-IP address assignment Dibbler A portable DHCPv6 Open source software, current version 1.0.0 DHCPv6 solution Server, Client (Support XP), Relay Pseudo dynamic ipv6 address allocation Address track IHEP contribution bug fix: Interface-id string size>4 IPv6 distribution in VLAN IPv4 production public network: no IPv6 IPv4 private network : public IPv6 Feature OS supported Linux 2.4/2.6; Windows NT4.0,XP,WIN7/8; Mac OS Multi-server supported Auto-configuration proctocol supported Stateful /Stateless IA,TA,PD client IP configuration control Dhcpv6 relay request supported Client is configured by MAC or UUID Server caching

IPv6@IHEP-User Access Control Procedure Online Register MAC/User Name/Email/Tel/Building/Room number/plugin number/ Submit no Approved by Admin ok Dibbler/DHCP configuration updated Assign IP address IPDB save Switch configuration updated Switch information: IP/Port/Vlan/ Switch-Room/Plugin Number relationship Vlan/IP subnet/switch-port relationship IP/MAC relationship

IPv6@IHEP-HEPiX IPv6 Working Group Grid Computing Environment The gridftp(ipv6) test bed was set up IP Name: ui01-hepix.ihep.ac.cn ui01-hepix-v6.ihep.ac.cn (2401:de00::9998) ui01-hepix-v4.ihep.ac.cn (202.122.32.172)

FTS@IHEP-Data Flow & features Features Performance: Higher throughput Stability: services with cluster structure Efficiency: Low latency for data relay Function: Multi-relay sites supported Automation: file source auto-scan, Fault self-recovery

FTS@IHEP-Transfer Monitoring(real-time)

FTS@IHEP-Transfer Monitoring(Historical Statistics)

SDN@IHEP-Network for HEP MASS HEP experiment data share and exchange between cooperation members HEP Experiments Exchange for Computing and Storage in Data Center The Flexible and extensible network for the Computing and Storage Virtualization

SDN@IHEP - What we knew It is much easier for the SDN@Data Center, We can Control the network facilities & infrastructure Design the network structure and deploy the network devices The big problem we are facing is the internet performance among the HEP experiment members The applications(file transfer system included) just support IPv4 The IPv4 network performance problem among the HEP experiment members

SDN@IHEP-Goals: improve the performance A Private Virtual Network across Chinese HEP Experiment Members Based on SDN architecture An intelligent data transmission network path selection algorithm Using the IPv6 network link around China (CNGI) Do not change the applications(just IPv4 applications)

SDN@IHEP -Current Status Consist of End user network HEP Group Computing Resources HEP Group Computing Resources Backbone network(ipv6) L2VPN gateway Openflow switch SDU Network Switch L2VPN IPv6 Tunnel L2VPN Switch SJTU Network Control center Members IPv6 Tunnel IPv6 Tunnel IHEP/SJU/SDU/ Network manufacturer:ruijie L2VPN Switch Networks, A high performance network joint lab (IHEP-Ruijie) IHEP Network Controller HEP Group Computing Resources

SDN@IHEP -Controller Based on NOX-the original OpenFlow controller Design and programming the network route algorithm Developed the SDN network roughly monitoring system

SDN@IHEP -Performance Network performance Network latency But The performance is unstable Caused by the IPv6 network load status 链 路 描 述 IPV4 带 宽 (Mbps) IPV6 带 宽 (Mbps) IHEP SJTU 31.22 665.00 448.00 SJTU IHEP 28.60 380.00 127.00 IHEP SDU 5.90 117.31 68.90 SDU IHEP 3.15 12.60 6.23 SDU SJTU 3.51 66.40 39.60 SJTU SDU 3.06 75.23 40.20 隧 道 +SDN 带 宽 (Mbps) 链 路 描 述 IPV4 延 时 IPV6 延 隧 道 +SDN 延 时 (ms) 时 (ms) (ms) IHEP SJTU 31.20 36.78 38.17 SJTU IHEP 28.67 36.86 38.31 IHEP SDU 83.00 49.49 56.33 SDU IHEP 89.32 49.54 57.02 SDU SJTU 187.52 101.69 109.40 SJTU SDU 189.37 99.74 112.21

Dirac Computing Model for BESIII Exp.

Dirac Computing for HEP Enable simulation + reconstruction jobs on distributed environment distributed storage solution by changing computing model from remote central storage to distributed local storage easy connection to local analysis jobs distribute dst data from IHEP to collaboration members. site CE site SE site CE Lustre central storage solution site SE WAN dcache SE site CE IHEP SE site CE site CE site CE site CE download randomtrg data site CE upload output dst network jam high load of SE site SE write output dst site CE read randomtrg data SE replicate/transfer 4th June 2014 BESIII Collaboration Meeting, IHEP 17

Data Transfer Statistics 24.5 TB XYZ DST data ( IHEP USTC @ 3.20 TB/day ) 4.4 TB randomtrg data ( IHEP USTC, JINR, WHU, UMN @ 1.95 TB/day ) 4th June 2014 BESIII Collaboration Meeting, IHEP 18

Site monitoring Currently 4 monitorings: CE Availablity Host (worker nodes) Network SE latency More tests will be added: SE transfer speed SE usage information dataset status pilot monitoring Author: Igor Pelevanyuk @ JINR Details in Alexey s report 4th June 2014 BESIII Collaboration Meeting, IHEP 19

BOINC: Volunteer Computing

Have a large amount of data to process/a big computing task Scientists BOINC project Continuous Jobs sent to BOINC server

Volunteers CAS@Home SETI@Home LHC@home Einstein@Home

The scale of Volunteer Computing Around 50 VC projects :HEP, biology, cosmology, chemistry, physics, environment. 260K active volunteers 490K active volunteer computers Real time computing power: 7.2 PetaFLOPS

The scale of Volunteer Computing EC2 price: 1.16USD/hour for a CPU intensive Windows/Linux instance (~1.5GFLOPS) (7.2PetaFLOPS/1.5GFLOPS)*1.16USD=5.56M USD/hour Successful Individual projects: Einstein@home (280 TeraFLOPS), SETI@home (581 TeraFLOPS) LHC@home (2TeraFLOPS)

Basic Model for BOINC based VC 1 The scientist deploys and manages the BOINC server and develops the application 2 The BOINC client is installed on volunteer PCs/laptops/smart phones/game consoles by volunteers. 3 BOINC client gets the application and input files from server, runs the application and sends the results back to BOINC server. BOINC Server Application/ Input data Job Results BOINC Client

BOINC Architecture Application/Input data Datab ase Scheduler File Uploader BOINC Manager Client Tools BOINC Client Application1 Validator Application1 Assimilator BOINC Server Application2 Validator Application2 Assimilator BOINC Manager Client Tools BOINC Client

Volunteer Computing In HEP (1) LHC@home A project based at CERN, aiming at utilizing public computing resources for LHC (Large Hadron Collider ) related simulation The project has been running since 2010. Current scale: Over 13K active users About 18K active hosts Real time computing power: 14TeraFLOPS

Volunteer Computing In HEP (2) ATLAS@home Another VC project based at CERN, aiming at utilizing public desktop resources for ATLAS similuation ATLAS is one of the four HEP experiments at CERN, it co discovered the Higgs bosson with CMS experiment, hence completed the standard model in physics. All available public computing resources are harnessed by BOINC and integrated with its Grid Computing resources which provides high resource transparency to the end users. The project will be in official running in late June 2014.

Volunteer Computing on Mobile Phones Smart phones have serious computing power - as much as 25% of an average desktop computer There are 900 million Android phones as of May 2013, and it is growing rapidly Mobile devices can therefore supply a huge amount of energy-efficient computing power to science A serial of BOINC based volunteer computing projects have already supported their application running on smart phone platforms such as Android. These projects include: Einstein@home SETI@home World Community Grid Quake Catcher Network SubsetSum@Home theskynet POGS Asteroids@home Collatz Conjecture GPUGrid.net

Future Work

SDN@IHEP-what we are doing Make the applications to control the data flow route Network available bandwidth measurement feeds back to the controller FTS calls the API of controller Keep the network performance stable and improved Use the ipv6 and ipv6 link at the same time (link bonding) Two tunnels between any two sites Network link load balance

Work Plan and To Do List Data Management System improve the performance and UI of data transfer system promote the deployment of SE on site continue developing on dcache + Lustre Workload Management System Study and improve performance and bottleneck of simulation + reconstruction upgrade to DIRAC v6r11 Cloud Study, integrate and evaluate Cloud resources, including private cloud and commercial cloud Dynamically use cloud resources according to job requirements 4th June 2014 BESIII Collaboration Meeting, IHEP 32

Thanks! Question?