Italian Scientific Big Data Initiative
|
|
|
- Michael Flynn
- 10 years ago
- Views:
Transcription
1 Italian Scientific Big Data Initiative Sanzio Bassini Director of Supercomputing Application & Innovation Department Casalecchio di Reno (BO) Via Magnanelli 6/3, Casalecchio di Reno Cineca Big Data 1
2 CINECA Interuniversity Consortium 69 Italian Universities CNR, OGS, SISSA/ISAS Trieste Ministry of University and Research Cineca Big Data 2
3 Mission Private non for Profit Organization Founded in 1969 by Ministry of Public Education now under the control of Ministry Education, University and Research Main activities: Promote the use of the most advanced HPC systems to support public and private scientific and technological research o Services both for Universities, National Research Institute, private enterprises and Ministry Three operational site o Bologna (consortium headquarter), Milan, Rome o 711 employees Supercomputing, Application & Innovation Dpt. Tier-0 and Tier-1 Supercomputing service Data processing and data management for scientific big data Enabling specialistic support for application development and exploitation Cineca Big Data 3
4 The focus Cineca Big Data 4
5 Data as an asset One of the most significant changes of the past decade has been the widespread recognition of data as an asset rather than the refuse of research. Cineca Big Data 5
6 G8 + O5 White Paper 5 Principles for an Open Data Infrastructure 1. Discoverability Implementation of appropriate persistent identifier frameworks, adoption of descriptive metadata standards, and the use of appropriate data formats and taxonomies. 2. Accessibility Publicly funded scientific research data should be made openly available with as few restrictions as possible. Ethical, legal or commercial constraints may be imposed on the use of research data to ensure that the research process is not damaged by inappropriate release of data. 3. Understandability Scientific data sets must be understandable in order to be effectively used. A set of numbers, texts, pictures or even videos alone cannot be understandable without additional context, semantics, data analysis tools, and algorithms. 4. Manageability Data management policies and plans must make it clear who is responsible for maintaining the availability of data and how the associated costs are to be met including issues associated with curation, storage and services. 5. People A global approach to research data infrastructure requires a highly skilled and adaptable workforce and culture that is able to capture the available data and make it available to those that are able to use it appropriately. Cineca Big Data 6
7 G8+O5 Data working group and GSO Report Recommendations 9 & 10 Recommendation 9 - e-infrastructure Global research infrastructure initiatives should recognize the utility of the integrated use of advanced e-infrastructures, services for accessing and processing, and curating data, as well as remote participation (interaction) and access to scientific experiments. Recommendation 10 - Data exchange Global scientific data infrastructure providers and users should recognise the utility of data exchange and interoperability of data across disciplines and national boundaries as a means to broadening the scientific reach of individual data sets. Cineca Big Data 7
8 Research model data lifecycle Follow-up research New research Undertake research reviews Scrutinize findings Teach and learn Enter data, digitize, transcribe, translate Check, validate, clean data Anonymise data where necessary Describe data Manage and store data Interpret data Derive data Produce research outputs Author publications Prepare data for preservation Distribute data Share data Control access Establish copyright Promote data Migrate data to best format Migrate data to suitable medium Backup and store data Create metadata and documentation Archive data Cineca Big Data 8
9 Data Domain Scientists personal data Homeless scientists Citizen scientists Community repositories Institute repositories Cineca Big Data 9
10 EUDAT available services Safe replication and archiving Data stage in & out Metadata indexing and search User friendly service for data upload & download Cineca Big Data 10
11 National Scientific Data Repository Aim of the project the creation of the national scientific data repository Objectives: Service provision for national and EU funded project: - EUDAT (European Data Infrastructure) - V-MUST (Virtual Museum Transnational Network) - Connettomics ICON Foundation, - LENS Human Brain Project - Fluid dynamic COST Action MP0806 Particles in turbulence - EuHIT - facilities for turbulence research - Astrophysics Mission PLANK, Mission GAIA - Geophysics INGV RI EPOS Project - NFFA / Sincrotrone - Nanoscience Foundries and Fine Analysis - Elixir Italian National Infrastructure - Epigen National Epigenetic infrastructure - Nextdata National Environment data infrastructure New business marketplace Improve the processing of data (include the data scarcely structured) Cineca Big Data 11
12 CINECA Current infrastructure External Data sources Internal Data sources PRACE EUDAT Lab A Prj A Lab B Prj B Lab Prj FERMI Tier- 0 > 2PF Next PLX Tier-1 > 1PF Data store Data Processing Workspace > 5 PByte Repository > 5PByte High IOPS > 50 Tbyte SSD Tape >12 Pbyte FERMI High througput NUBES / PLX Cloud serv. HPDA Front end cluster 64 thin node, 8 Fat node viz Big mem Data mover DB Web serv. processing Web Archive FTP Cineca Big Data 12
13 EUDAT-EPOS success story The whole archive of the INGV s data center in Rome is replicated to CINECA, the EUDAT s data center in Bologna ( years: ) Granularity is at single file level (the collection of sesimic waveforms related to 1 sensor for 1 day) Metadata are also replicated, as files (dataless). The replicas are stored, synchronized and accessed by a single user, the INGV community manager. Cineca Big Data 13
14 SmartDATA project CINECA s project to deal with BigData and HPC: a Big Data Analytics engine for OpenDATA catalogues EUDAT choerent Initial pre-conditions: Data are archived, catalogued, searchable and accessible: - Robust and reliable infrastructure - Interoperability and adoption of standards SmartData satisfies these requirements and besides: make available an HPC environment with analysis tools which foster the re-use of such data. ease the information exchange among experts belonging to different disciplines, from both the point of view of the methods and of the data.! Hence the data sharing creates new knowledge Cineca Big Data 14
15 SmartDATA project SmartDATA Portal and apps SmartDATA users orchestrator HPC Front-End Job Scheduler workflows ingestion Data Registry (irods) for OpenDATA sources repository Scratch staging Supercomputer analytic engine Cineca Big Data 15
16 EuHIT project EuHIT is a consortium that aims at integrating cutting-edge European facilities for turbulence research across national boundaries ( ). It includes 25 research institutes and 2 industrial partners from 10 European countries. A total of 14 cutting-edge turbulence research infrastructures CINECA will provide the technical and hardware infrastructure for the turbulence data digital library service (DLTD), offering: A service to manage data coming from numerical simulations and experiments in the field of fluid dynamics. A storage space of 150 TB online. A helpdesk and specialized support. Cineca Big Data 16
17 EUHIT project TurBase, a freely accessible living knowledgebase for high quality turbulence data, will rely on the CINECA s DLTD service, which will be implemented using the same technology adopted by EUDAT to foster interoperability and accessibility Cineca Big Data 17
18 Human Brain project The Human Brain Project (HBP) is a large scale European research project which aims to simulate the world's first and most exact human brain in a supercomputer. Working together with 86 different European institutions the project is planned to last ten years ( ). To image an entire brain many parallel stacks of images are acquired. They are afterwards merged with a custom-made algorithm suited to work with very large data sets (~ 1 TB) Cineca Big Data 18
19 Human Brain project WP7.5 High Performance Computing Platform: integration and operations CINECA is responsible for Task The HBP supercomputer for massive data analytics: Implement and operate a data-centric HPC facility providing efficient storage, processing and management of large volumes of data generated by the HBP. The HBP Massive Data Analytics Supercomputer in CINECA will provide Tier-0 and Tier-1 service, integrated with a mass storage facility of more than 5 Petabytes of working space with an aggregated file system bandwidth of about 70 GB/sec in a full production environment. The system will be integrated with a Data Facility of more than 5 Petabytes for online disk storage repository and more 10 Petabytes for long term data archiving, providing efficient data life-cycle management of structured and un-structured data generated by the HBP. Cineca Big Data 19
20 Sample preparation 1. Transcardiac mouse perfusion with PBS 1X followed by Paraformaldehyde 2. Extraction of the brain from the skull & post-fix in PFA 4% 4 C. 3. (optional) excision of the desired part of the brain. 4. Embedding of the sample in a 0.5 % w/w agarose gel. 5. Dehydration in graded ethanol series. 6. (for whole brains) additional dehydration in hexane 100% (1h). 7. Incubation in clearing solution (1:2 Benzyl Alcohol / Benzyl Benzoate). Uncleared mouse brain Cleared mouse brain Cineca Big Data 20
21 Whole Brain Imaging Scale bars: 1 mm TV SV CV Cerebellum from a P10 L7-GFP mouse Total volume 73 mm 3, voxel size µm 3, acquisition time 24 h (1.3 MegaVoxels/s) Cineca Big Data 21
22 Whole Brain Imaging Scale bars: 200 µm Whole brain from P15 thy1-gfp-m mouse Portions of hippocampus and superior colliculus are magnified Total volume 223 mm 3, voxel size µm 3, acquisition time 3 days (1.3 MegaVoxels/s) Cineca Big Data 22
23 Cineca Big Data 23
24 Initiatives (HPC HPDA) HPC Tier-0 (FERMI) >2PF; Tier-1 > 1PF; Open Access Peer Review PRACE, ISCRA, LISA HPDA Nubes, cloud, remote visualization, structured and unstructured data processing 80 servers node Linux cluster Repository Access Reference Curation Replica Preservation 10PB 5PB 5PB 12 PB WS Repository LT archiving Cineca Big Data 24
25 Business model for (Cineca) RI Research infrastructure supported by Minister, National and European (structured funds) Scientific community contribution for marginal and operational costs Standard service provision best practices are commonly in use by Cineca which ISO fully certified Partnership and scientific collaboration Vertical disciplinary project should (have to) define in its budget the contribution for the persistency of the (Cineca) RI on base of its programmatic access Pay for service model (applicable for Cineca being a private organization) does imply costs for VAT and couldn t be widely applicable Open tender for service procurement may be very difficult to apply because of the difficult to define a suitable terms of reference documentation for in progress scientific activities The skills and competence profiles need a continuously training programme part also on the job Technology transfer process also in collaboration with scientific communities for added value service towards private organizations and industries. Cineca Big Data 25
26 Many thanks for your Bologna attention! Rome Milan Cineca Big Data 26
Evoluzione dell Infrastruttura di Calcolo e Data Analytics per la ricerca
Evoluzione dell Infrastruttura di Calcolo e Data Analytics per la ricerca Carlo Cavazzoni CINECA Supercomputing Application & Innovation www.cineca.it 21 Aprile 2015 FERMI Name: Fermi Architecture: BlueGene/Q
EUDAT. Towards a pan-european Collaborative Data Infrastructure. Willem Elbers
EUDAT Towards a pan-european Collaborative Data Infrastructure Willem Elbers EUDAT / MPI-TLA Focus meeting: Data repositories SURF, Utrecht March 3, 2014 Outline EUDAT project EUDAT services Summary and
SURFsara Data Services
SURFsara Data Services SUPPORTING DATA-INTENSIVE SCIENCES Mark van de Sanden The world of the many Many different users (well organised (international) user communities, research groups, universities,
European Data Infrastructure - EUDAT Data Services & Tools
European Data Infrastructure - EUDAT Data Services & Tools Dr. Ing. Morris Riedel Research Group Leader, Juelich Supercomputing Centre Adjunct Associated Professor, University of iceland BDEC2015, 2015-01-28
How To Build An Open Source Data Infrastructure
EUDAT Collaborative Data Infrastructure Towards the convergence of Compute, Data, Knowledge and Scientific Instruments Giuseppe Fiameni CINECA www.eudat.eu EUDAT receives funding from the European Union's
OpenAIRE Research Data Management Briefing paper
OpenAIRE Research Data Management Briefing paper Understanding Research Data Management February 2016 H2020-EINFRA-2014-1 Topic: e-infrastructure for Open Access Research & Innovation action Grant Agreement
E-learning and Student Management System: toward an integrated and consistent learning process
E-learning and Student Management System: toward an integrated and consistent learning process Matteo Bertazzo 1, Franca Fiumana 2 1 CINECA, Information and Knowledge Management Services Department, via
Optimizing IT Deployment Issues
Optimizing IT Deployment Issues Trends and Challenges for Engineering Simulation Barbara Hutchings [email protected] 1 Outline Deployment Challenges and Trends Extreme scale up and scale out
irods at CC-IN2P3: managing petabytes of data
Centre de Calcul de l Institut National de Physique Nucléaire et de Physique des Particules irods at CC-IN2P3: managing petabytes of data Jean-Yves Nief Pascal Calvat Yonny Cardenas Quentin Le Boulc h
EUDAT - Open Data Services for Research
EUDAT - Open Data Services for Research Per Öster 05.03.2015 CSC at a Glance Founded in 1971 as a technical support unit for Univac 1108 Connected Finland to the Internet in 1988 Reorganized as a company,
Horizon 2020. Research e-infrastructures Excellence in Science Work Programme 2016-17. Wim Jansen. DG CONNECT European Commission
Horizon 2020 Research e-infrastructures Excellence in Science Work Programme 2016-17 Wim Jansen DG CONNECT European Commission 1 Before we start The material here presented has been compiled with great
BIG DATA IN THE CLOUD : CHALLENGES AND OPPORTUNITIES MARY- JANE SULE & PROF. MAOZHEN LI BRUNEL UNIVERSITY, LONDON
BIG DATA IN THE CLOUD : CHALLENGES AND OPPORTUNITIES MARY- JANE SULE & PROF. MAOZHEN LI BRUNEL UNIVERSITY, LONDON Overview * Introduction * Multiple faces of Big Data * Challenges of Big Data * Cloud Computing
Clodoaldo Barrera Chief Technical Strategist IBM System Storage. Making a successful transition to Software Defined Storage
Clodoaldo Barrera Chief Technical Strategist IBM System Storage Making a successful transition to Software Defined Storage Open Server Summit Santa Clara Nov 2014 Data at the core of everything Data is
Local Loading. The OCUL, Scholars Portal, and Publisher Relationship
Local Loading Scholars)Portal)has)successfully)maintained)relationships)with)publishers)for)over)a)decade)and)continues) to)attract)new)publishers)that)recognize)both)the)competitive)advantage)of)perpetual)access)through)
Digital preservation a European perspective
Digital preservation a European perspective Pat Manson Head of Unit European Commission DG Information Society and Media Cultural Heritage and Technology Enhanced Learning Outline The digital preservation
SCALABLE FILE SHARING AND DATA MANAGEMENT FOR INTERNET OF THINGS
Sean Lee Solution Architect, SDI, IBM Systems SCALABLE FILE SHARING AND DATA MANAGEMENT FOR INTERNET OF THINGS Agenda Converging Technology Forces New Generation Applications Data Management Challenges
CINECA DSpace-CRIS : An Open Source Solution Use Science 2013 w.c ineca.it
CINECA DSpace-CRIS : An Open Source Solution ~ Use Science 2013 Open www.cineca.it Infrastructure to Foster Collaboration between Industry and Academia Topics CINECA: a brief overview Solutions for Higher
Big Data in the Digital Cultural Heritage
Big Data in the Digital Cultural Heritage Antonella Fresa, Promoter Srl DCH-RP Technical Coordinator 1 Table of Content Digitisation of Cultural Heritage Toward an e-infrastructure for Digital Cultural
Enabling the re-use of research data: organising stakeholders and infrastructure in the Netherlands
Enabling the re-use of research data: organising stakeholders and infrastructure in the Netherlands Ingrid Dillo, deputy director DANS RECODE conference, Athens 15-01-2015 Challenges in sharing data Technical
Exploitation of ISS scientific data
Cooperative ISS Research data Conservation and Exploitation Exploitation of ISS scientific data Luigi Carotenuto Telespazio s.p.a. Copernicus Big Data Workshop March 13-14 2014 European Commission Brussels
Why long time storage does not equate to archive
Why long time storage does not equate to archive Jos van Wezel HUF Toronto 2015 STEINBUCH CENTRE FOR COMPUTING - SCC KIT University of the State of Baden-Württemberg and National Laboratory of the Helmholtz
e-irg workshop Dublin 22-23 May 2013 Track 1: Coordination of e-infrastructures
e-irg workshop Dublin 22-23 May 2013 Track 1: Coordination of e-infrastructures Rossend Llurba e-irgsp3 Track 1 2 sessions Session 1 (Chair: Lajos Balint) 4 presentations Bob Jones Stephen Moffat Sandra
BlueArc unified network storage systems 7th TF-Storage Meeting. Scale Bigger, Store Smarter, Accelerate Everything
BlueArc unified network storage systems 7th TF-Storage Meeting Scale Bigger, Store Smarter, Accelerate Everything BlueArc s Heritage Private Company, founded in 1998 Headquarters in San Jose, CA Highest
Managing Complexity in Distributed Data Life Cycles Enhancing Scientific Discovery
Center for Information Services and High Performance Computing (ZIH) Managing Complexity in Distributed Data Life Cycles Enhancing Scientific Discovery Richard Grunzke*, Jens Krüger, Sandra Gesing, Sonja
CEDA Storage. Dr Matt Pritchard. Centre for Environmental Data Archival (CEDA) www.ceda.ac.uk
CEDA Storage Dr Matt Pritchard Centre for Environmental Data Archival (CEDA) www.ceda.ac.uk How we store our data NAS Technology Backup JASMIN/CEMS CEDA Storage Data stored as files on disk. Data is migrated
H2020 Guidelines on Open Data and Data Management Plan
H2020 Guidelines on Open Data and Data Management Plan CRR Centro Risorse per la Ricerca Multimediale Why? Open scientific research data should be easily discoverable, accessible, assessable, intelligible,
Report of the DTL focus meeting on Life Science Data Repositories
Report of the DTL focus meeting on Life Science Data Repositories Goal The goal of the meeting was to inform and discuss research data repositories for life sciences. The big data era adds to the complexity
Digital Preservation Strategy, 2012-2015
Digital Preservation Strategy, 2012-2015 Preface This digital preservation strategy sets out what the National Library of Wales (NLW) intends to do to preserve digital materials over the next three years.
Workprogramme 2014-15
Workprogramme 2014-15 e-infrastructures DCH-RP final conference 22 September 2014 Wim Jansen einfrastructure DG CONNECT European Commission DEVELOPMENT AND DEPLOYMENT OF E-INFRASTRUCTURES AND SERVICES
The Key Elements of Digital Asset Management
The Key Elements of Digital Asset Management The last decade has seen an enormous growth in the amount of digital content, stored on both public and private computer systems. This content ranges from professionally
EUROPEAN COMMISSION Directorate-General for Research & Innovation. Guidelines on Data Management in Horizon 2020
EUROPEAN COMMISSION Directorate-General for Research & Innovation Guidelines on Data Management in Horizon 2020 Version 2.0 30 October 2015 1 Introduction In Horizon 2020 a limited and flexible pilot action
Data management challenges in todays Healthcare and Life Sciences ecosystems
Data management challenges in todays Healthcare and Life Sciences ecosystems Jose L. Alvarez Principal Engineer, WW Director Life Sciences [email protected] Evolution of Data Sets in Healthcare
A Service for Data-Intensive Computations on Virtual Clusters
A Service for Data-Intensive Computations on Virtual Clusters Executing Preservation Strategies at Scale Rainer Schmidt, Christian Sadilek, and Ross King [email protected] Planets Project Permanent
The challenges of becoming a Trusted Digital Repository
The challenges of becoming a Trusted Digital Repository Annemieke de Jong is Preservation Officer at the Netherlands Institute for Sound and Vision (NISV) in Hilversum. She is responsible for setting out
LJMU Research Data Policy: information and guidance
LJMU Research Data Policy: information and guidance Prof. Director of Research April 2013 Aims This document outlines the University policy and provides advice on the treatment, storage and sharing of
Reference Architectures for Repositories and Preservation Archiving
Reference Architectures for Repositories and Preservation Archiving Keith Rajecki Education Solutions Architect Sun Microsystems, Inc. 1 Agenda Challenges Solution Architectures > Open Storage/Open Archive
THE EMC ISILON STORY. Big Data In The Enterprise. Copyright 2012 EMC Corporation. All rights reserved.
THE EMC ISILON STORY Big Data In The Enterprise 2012 1 Big Data In The Enterprise Isilon Overview Isilon Technology Summary 2 What is Big Data? 3 The Big Data Challenge File Shares 90 and Archives 80 Bioinformatics
Designing a Cloud Storage System
Designing a Cloud Storage System End to End Cloud Storage When designing a cloud storage system, there is value in decoupling the system s archival capacity (its ability to persistently store large volumes
NITRD and Big Data. George O. Strawn NITRD
NITRD and Big Data George O. Strawn NITRD Caveat auditor The opinions expressed in this talk are those of the speaker, not the U.S. government Outline What is Big Data? Who is NITRD? NITRD's Big Data Research
The UK INCF Node and the CARMEN project. Presenter: Leslie Smith [email protected]
The UK INCF Node and the CARMEN project Presenter: Leslie Smith [email protected] Content Events and activities at UK INCF Node Recent Forthcoming The CARMEN project History, current status, ways
Big Data. George O. Strawn NITRD
Big Data George O. Strawn NITRD Caveat auditor The opinions expressed in this talk are those of the speaker, not the U.S. government Outline What is Big Data? NITRD's Big Data Research Initiative Big Data
Open Access and Open Research Data in Horizon 2020
Open Access and Open Research Data in Horizon 2020 Celina Ramjoué Head of Sector Open Access to Scientific Publications and Data Digital Science Unit CONNECT.C3 22 November 2013 Train the Trainer for H2020
CYBERINFRASTRUCTURE FRAMEWORK FOR 21 st CENTURY SCIENCE AND ENGINEERING (CIF21)
CYBERINFRASTRUCTURE FRAMEWORK FOR 21 st CENTURY SCIENCE AND ENGINEERING (CIF21) Goal Develop and deploy comprehensive, integrated, sustainable, and secure cyberinfrastructure (CI) to accelerate research
Big Data Analytics Service Definition G-Cloud 7
Big Data Analytics Service Definition G-Cloud 7 Big Data Analytics Service Service Overview ThinkingSafe s Big Data Analytics Service allows information to be collected from multiple locations, consolidated
Action full title: Universal, mobile-centric and opportunistic communications architecture. Action acronym: UMOBILE
Action full title: Universal, mobile-centric and opportunistic communications architecture Action acronym: UMOBILE Deliverable: D.6.10 - Data Management Plan Project Information: Project Full Title Project
Big Data - Infrastructure Considerations
April 2014, HAPPIEST MINDS TECHNOLOGIES Big Data - Infrastructure Considerations Author Anand Veeramani / Deepak Shivamurthy SHARING. MINDFUL. INTEGRITY. LEARNING. EXCELLENCE. SOCIAL RESPONSIBILITY. Copyright
Business-centric Storage FUJITSU Hyperscale Storage System ETERNUS CD10000
Business-centric Storage FUJITSU Hyperscale Storage System ETERNUS CD10000 Clear the way for new business opportunities. Unlock the power of data. Overcoming storage limitations Unpredictable data growth
Big Data Analytics Platform @ Nokia
Big Data Analytics Platform @ Nokia 1 Selecting the Right Tool for the Right Workload Yekesa Kosuru Nokia Location & Commerce Strata + Hadoop World NY - Oct 25, 2012 Agenda Big Data Analytics Platform
Data Centric Systems (DCS)
Data Centric Systems (DCS) Architecture and Solutions for High Performance Computing, Big Data and High Performance Analytics High Performance Computing with Data Centric Systems 1 Data Centric Systems
Big data management with IBM General Parallel File System
Big data management with IBM General Parallel File System Optimize storage management and boost your return on investment Highlights Handles the explosive growth of structured and unstructured data Offers
Big Data and Cloud Computing for GHRSST
Big Data and Cloud Computing for GHRSST Jean-Francois Piollé ([email protected]) Frédéric Paul, Olivier Archer CERSAT / Institut Français de Recherche pour l Exploitation de la Mer Facing data deluge
Regional Vision and Strategy The European HPC Strategy
Regional Vision and Strategy The European HPC Strategy BDEC 29 January 2015 Augusto BURGUEÑO ARJONA Head of Unit einfrastructure - DG CONNECT European Commission @ABurguenoEU aburguenoeu.wordpress.com
Project Number: 284941 Project Title: Human Brain Project. HBP_SP13_EPFL_14-0205_D13.3.2_Final.docx
Project Number: 284941 Project Title: Human Brain Project Document Title: Document Filename (1) : Deliverable Number: Deliverable Type: HBP Data Management Plan HBP_SP13_EPFL_14-0205_D13.3.2_Final.docx
Software services competence in research and development activities at PSNC. Cezary Mazurek PSNC, Poland
Software services competence in research and development activities at PSNC Cezary Mazurek PSNC, Poland Workshop on Actions for Better Participation of New Member States to FP7-ICT Timişoara, 18/19-03-2010
Inside Track Research Note. In association with. Enterprise Storage Architectures. Is it only about scale up or scale out?
Research Note In association with Enterprise Storage Architectures Is it only about scale up or scale out? August 2015 About this The insights presented in this document are derived from independent research
ENHANCED PUBLICATIONS IN THE CZECH REPUBLIC
ENHANCED PUBLICATIONS IN THE CZECH REPUBLIC PETRA PEJŠOVÁ, HANA VYČÍTALOVÁ [email protected], [email protected] The National Library of Technology, Czech Republic Abstract The aim of this
Cloud and Big Data Standardisation
Cloud and Big Data Standardisation EuroCloud Symposium ICS Track: Standards for Big Data in the Cloud 15 October 2013, Luxembourg Yuri Demchenko System and Network Engineering Group, University of Amsterdam
HPC technology and future architecture
HPC technology and future architecture Visual Analysis for Extremely Large-Scale Scientific Computing KGT2 Internal Meeting INRIA France Benoit Lange [email protected] Toàn Nguyên [email protected]
Big Data in BioMedical Sciences. Steven Newhouse, Head of Technical Services, EMBL-EBI
Big Data in BioMedical Sciences Steven Newhouse, Head of Technical Services, EMBL-EBI Big Data for BioMedical Sciences EMBL-EBI: What we do and why? Challenges & Opportunities Infrastructure Requirements
How To Manage Your Digital Assets On A Computer Or Tablet Device
In This Presentation: What are DAMS? Terms Why use DAMS? DAMS vs. CMS How do DAMS work? Key functions of DAMS DAMS and records management DAMS and DIRKS Examples of DAMS Questions Resources What are DAMS?
Autonomy Consolidated Archive
Autonomy Consolidated Archive Dennis Wild Director SME, Information Governance and Archiving POWER PROTECT PROMOTE Meaning-Based Governance Files IM Audio Email Social Video SharePoint Archiving = Gain
Diagram 1: Islands of storage across a digital broadcast workflow
XOR MEDIA CLOUD AQUA Big Data and Traditional Storage The era of big data imposes new challenges on the storage technology industry. As companies accumulate massive amounts of data from video, sound, database,
Real Time Big Data Processing
Real Time Big Data Processing Cloud Expo 2014 Ian Meyers Amazon Web Services Global Infrastructure Deployment & Administration App Services Analytics Compute Storage Database Networking AWS Global Infrastructure
Research Data Management
Research Data Management 1 Why to we need to Manage Data? 2 Data Management Planning Typically covers: - What data will be created (format, types) and how? - How will the data be documented and described?
CROSS PLATFORM AUTOMATIC FILE REPLICATION AND SERVER TO SERVER FILE SYNCHRONIZATION
1 E N D U R A D A T A EDpCloud: A File Synchronization, Data Replication and Wide Area Data Distribution Solution CROSS PLATFORM AUTOMATIC FILE REPLICATION AND SERVER TO SERVER FILE SYNCHRONIZATION 2 Resilient
Distributed File Systems An Overview. Nürnberg, 30.04.2014 Dr. Christian Boehme, GWDG
Distributed File Systems An Overview Nürnberg, 30.04.2014 Dr. Christian Boehme, GWDG Introduction A distributed file system allows shared, file based access without sharing disks History starts in 1960s
Kriterien für ein PetaFlop System
Kriterien für ein PetaFlop System Rainer Keller, HLRS :: :: :: Context: Organizational HLRS is one of the three national supercomputing centers in Germany. The national supercomputing centers are working
Euro-BioImaging European Research Infrastructure for Imaging Technologies in Biological and Biomedical Sciences
Euro-BioImaging European Research Infrastructure for Imaging Technologies in Biological and Biomedical Sciences WP11 Data Storage and Analysis Task 11.1 Coordination Deliverable 11.3 Selected Standards
