Swedish National Infrastructure for Computing SNIC & Grid & Data Sverker Holmgren SNIC 2007, - 1
SNIC-Mission The Swedish National Infrastructure for Computing (SNIC), under the jurisdiction of the Swedish Research Council, is a national resource intended to create integrated quality access to computational resources for Swedish research purposes where networks, data storage, computers, visualisation and various Grid-techniques can be used to produce a transparent resource Stated in the instruction for SNIC issued by the Swedish Research Council SNIC 2007, - 2
SNIC-Strategy Provide long term funding for HPC-resources in Sweden Coordinate investments in HPC-systems Coordinate competence at participating centers to optimize user support and quality of operations HPC-related development projects in Computer systems Storage Networks Computational science Visualization GRID-technology Disseminate information and knowledge about SNIC resources and their use Host the Swedish National Graduate School in Scientific Computing (NGSSC) SNIC 2007, - 3
Why a Metacenter? Limited number of HPC experts in Sweden Proximity to users by having regional centers In depth users support Collaborations Induction of new HPC usage Points of entry to national infrastructure Load balancing national leading edge systems based on technical assessments and resource availability Grid technology A metacenter can contribute to development Grid technology enables metacenter co-ordination International collaboration as a unified structure NorduGrid/ARC EGEE/LCG SNIC 2007, - 4
GRID-Vision Hardware, networks and middleware are used to put together a virtual computer resource s should not have to know where computation is taking place or where data is stored s will work together over disciplinary and geographical borders and form virtual organizations SNIC 2007, - 5
Flat GRID Resource Resource GRID Resource Resource Resource SNIC 2007, - 6
Hierarchical GRID Management Regional center GRID Regional center Local resource Local resource Local resource Local resource SNIC 2007, - 7
Collaborative GRID GRID Resources Resources SNIC 2007, - 8
Power plant GRID HPC-center HPC-center HPC-center GRID HPC-center HPC-center SNIC 2007, - 9
Some important Grid projects Globus Middleware project, provides the foundation for many other projects GGF (Global Grid Forum) World wide meetings and standardization efforts LCG (Large Hadron Collider Computing Grid) CERNs Grid project to do data analysis for LHC NorduGrid/ARC (Advanced Resource Connector) Middleware driving SweGrid NDGF (Nordic Data Grid Facility) Nordic organisation for national Grids, T1 facility EGEE (Enabling Grids for Escience in Europe) EU funded CERN driven project involving 74 partners BalticGrid EGEE outreach project to the Baltic states, coordianted by KTH DEISA EU funded project connecting large HPC centers in Europe eirg Advisory body to EU on einfrastructures ESFRI expert panel on HPC European advisory panel on HPC related issues SNIC 2007, - 10
SweGrid production testbed The first step towards HPC center Gridification Initiative from All HPC-centers in Sweden IT-researchers wanting to research Grid technology s Life Science Earth Sciences Space & Astro Physics High energy physics PC-clusters with large storage capacity Build for GRID production Participation in international collaborations LCG EGEE NorduGrid SNIC 2007, - 11
SweGrid production test bed Total budget 3.6 MEuro 6 GRID nodes 600 CPUs IA-32, 1 processor/server 875P with 800 MHz FSB and dual memory busses 2.8 GHz Intel P4 2 Gbyte Gigabit Ethernet 12 TByte temporary storage FibreChannel for bandwidth 14 x 146 GByte 10000 rpm 410 TByte nearline storage 140 TByte disk 270 TByte tape 1 Gigabit direct connection to SUNET (10 Gbps) SNIC 2007, - 12
SUNET connectivity GigaSunet 10 Gbit/s Typical POP at Univ. 2.5 Gbit/s 10 Gbit/s Univ. LAN SweGrid 1 Gbps Dedicated SNIC 2007, - 13
Persistent storage on SweGrid 1 2 3 Size Administration Bandwidth Availability SNIC 2007, - 14
SweGrid status Nodes installed January 2004, now becoming old Extensive use of the resources Local batch queues GRID queues through the NorduGrid middlware ARC Some nodes also available on Glite. Part of North Federation resources 60 national users 1/3 of SweGrid is dedicated to HEP (200 CPUs) Significant Contribution to LCG challenges As a partner in NorduGrid Also supporting LCG (glite) Working on compatibility between ARC and LCG Forms the core of the Northern EGEE ROC Accounting has been introduced SGAS Development of general and application specific grid portals Development of grid-enabled data base technology Development of data base technology for streaming data SNIC 2007, - 15
SweGrid Observations Global user identity Each SweGrid users must receive a unique x509-certifikat All centers must agree on a common lowest level of security. This will affect general security policy for HPC centers. Unified support organization All helpdesk activities and other support needs to be coordinated between centers. s can not decide where their jobs will be run (should not) and expect the same level of service at all sites. More bandwidth is needed To be able to move data between the nodes in SweGrid before and after execution of jobs continuously increasing bandwidth will be needed More storage is needed s can despite increasing bandwidth not fetch all data back home. Storage for both temporary and permanent data will be needed in close proximity to processor capacity SNIC 2007, - 16
Large data at HPC centers Database queries Service Long term file storage Near line storage Temporary storage SNIC 2007, - 17
A Proposal A Swedish infrastructure (Database SNIC) for data curation and services 5 application experts (curation and user support) 5 technicians (services and tool development) Driven by owners of data Hardware infrastructure provided by SNIC Software infrastructure Licenses Tools developed by SNIC centers and users in collaboration SUNET access Grid based access SNIC 2007, - 18
SNIC Storage Landscape A few centers nominated as hosts for data intensive applications These centers will have up to 10 petabyte storage capacity driven build-up Storage is nationally available (on all SNIC systems) Swedish Infrastructure for Data Curation and Services Hosting escience databases SNIC 2007, - 19
Database GRID Resouces Middleware AAA Curators Data Portal Meta GRID Processors s Technicians Data Portal SNIC 2007, - 20
The Big Questions? Is there a need to coordinate a Swedish infrastructure for Data bases in the same way as SNIC coordinates the HPC infrastructure? Is there synergies to be found between the HPC Grid infrastructure and Swedish Databases? If so which forms of collaborations should be established? SNIC 2007, - 21