Life Sciences and Large Data Challenges
|
|
|
- Dora Ferguson
- 10 years ago
- Views:
Transcription
1 Life Sciences and Large Data Challenges David Fergusson Head of Scientific Computing The Francis Crick Institute
2 WHAT IS THE CRICK?
3 The Francis Crick Institute
4 Sir Paul Nurse Nobel Prize with Hartwell and Hunt for discovery of cyclins and CDK which control the cell cycle. President of the Royal Society and Chief Executive and Director of the Francis Crick Institute.
5 Synthesis of two institutes National Institute for Medical Research (NIMR) MRC Nobel Laureates - Sir Peter Medawar, Sir Frank Macfarlane Burnett, Sir Henry Hallett Dale, Archer John Porter Martin Dame Margaret Thornton London Research Institute (LRI) - CRUK Nobel Laureates - Renato Delbecco, Paul Nurse, Tim Hunt
6 Partners Wellcome Trust Medical Research Council Cancer research UK University College London King s College London Imperial College London
7 Crick Vision 1) Pursue discovery without boundaries 2) Create future science leaders 3) Collaborate creatively to advance UK science and innovation 4) Accelerate translation for health and wealth 5) Engage and inspire the public
8 Scientific Computing Vision Support Scientists Platforms to accelerate transformations Improve analysis Data Security Cooperate to develop novel methods Collaboration Secure shared data Shared best Practise Platforms for national & International biomedical collaboration Engage & Inspire Public Focussed examples Support science curricula Support Crick comms activity Promote Training Safe and sophisticated teaching environments Best Practise exemplars Expand experimental horizons Streamline workflows
9 Data Centre On-site/Off-site Strategy On-site/Off-site strategy to be implemented from day one Purposely designed so that the physical building does not limit the computational needs of the science On-site Data Centre - modest size, designed to hold just 40 high density racks Location: rooftop to take free-air cooling, 750kW of power. Average 18kW per rack. Immediate data collection and processing, data staging before transfer or replication to offsite data centre It will also host key services for our users and for the building itself. DATA CTRE
10 CREATING A NEW BIOMEDICAL INSTITUTE: THE CONTEXT
11 CHALLENGES
12 $1,000 Genome?? Not yet Data from NHGRI Sequencing Program April 11 th
13 Big Data High Energy Physics - CERN Hadron Collider generates big data, > 1Pb per month Astronomy will generate extremely big data (SKA) potentially many Petabytes per day..exascale computing Life/Biomedical Sciences are generating a lot of data But the potential to generate ever growing volumes of data exists and is set to increase rapidly.
14 Trust networks Trust networks to support big computation have been created and shown to work. Big Data is a new opportunity to base these around shared data resources. Just as big computation was (and is) out of reach for many organisations so is big data for many.
15 Complex Data Complex data / Complex analytics Distributed data in numerous data stores Clinical Data presents new challenges Legal, ethical, transmission security etc. Managing and tracking the data Securing and auditing access to clinical data Scale of the data involved Challenge: To develop the tools/infrastructure/middleware in a common way as opposed to the many groups developing strategies independently and across the globe.
16 Changing the dynamic Data centric not compute centric. Data problems are harder to deal with than compute problems. Data is hard (expensive) to move. Data requires curation (provenance). Big data silos trusted data suppliers Move the compute to the data Provide services around data (SaaS) Improve speed Streamline worksflows Support better data practice (no opportunity to leave CDs on trains)
17 CREATING A DATA CENTRE FOR THE CRICK
18 Offsite Data Centre/ Collaborative Data Centre We will also have the ability to offer collaborative space for stakeholders and others In the future we will want to analyse distributed data sets but this needs work and is a way off A joint data centre model provides a platform to not only share data but it acts as a catalyst for collaboration particularly at the infrastructure level I believe this is the biggest win initially and that the science will inevitably benefit from this collaborative model Examples of this happening in the U.S include:- CGHub David Haussler - Santa Cruz have installed a cluster local to the hub to provide an analysis engine close to the data New York Genome Centre - Identical IT strategy onsite/offsite and providing central computation for 10+ stakeholders
19 Collaborative Data Centre Collaborative Data Centre provides Private space Secure Collaborative Private Space Private space Private space Private space Private colocation (traditional) Logical Extension to local LAN Collaborative/Shared space, Secure space for sensitive data (patient data) Unique, powerful centre to build, test, deploy new infrastructure tools between Organisations. HPC where the data resides!!!!
20 In the Cloud
21 Offsite Data Centre Community Cloud Model LRI LRI UCL NIMR Clinical data sharing private networks through lightpaths? Others The Crick King s King s College College SANGER IMPERIAL Others
22 IN CONCLUSION
23 Collaborative Space Life Science Hub emedlab (?) Promote Skills Development (Systems, Informatics) Prototyping and deploying standards across multiple entities (Global Alliance) Promotes collaboration (both at IT and Informatics levels faster development, less duplication of effort de-facto standards) Produce real world infrastructure tools (production use across collaborating partners) Provide Sandboxes (testing development) Attractive to Industry partners (hardware evaluations, new technology deployment) Prototype public cloud techniques in private setting (safe environment) Safe Haven for sensitive data that should not move to public cloud Provide easier access to larger data sets. Pooled resources maximise Capital investment benefits for small and large user
24 Thank You
Keystones for supporting collaborative research using multiple data sets in the medical and bio-sciences
Keystones for supporting collaborative research using multiple data sets in the medical and bio-sciences David Fergusson Head of Scientific Computing The Francis Crick Institute The Francis Crick Institute
Exploring the roles and responsibilities of data centres and institutions in curating research data a preliminary briefing.
Exploring the roles and responsibilities of data centres and institutions in curating research data a preliminary briefing. Dr Liz Lyon, UKOLN, University of Bath Introduction and Objectives UKOLN is undertaking
NIH Commons Overview, Framework & Pilots - Version 1. The NIH Commons
The NIH Commons Summary The Commons is a shared virtual space where scientists can work with the digital objects of biomedical research, i.e. it is a system that will allow investigators to find, manage,
High Performance Computing Initiatives
High Performance Computing Initiatives Eric Stahlberg September 1, 2015 DEPARTMENT OF HEALTH AND HUMAN SERVICES National Institutes of Health National Cancer Institute Frederick National Laboratory is
Report to UCLH Board Bryan Williams MD FRCP FESC FAHA
NIHR University College London Hospitals Report to UCLH Board Bryan Williams MD FRCP FESC FAHA Professor of Medicine and BRC Director 18/03/2013 The NIHR UCLH BRC Renewed funding circa 100M for 5 years
The National Consortium for Data Science (NCDS)
The National Consortium for Data Science (NCDS) A Public-Private Partnership to Advance Data Science Ashok Krishnamurthy PhD Deputy Director, RENCI University of North Carolina, Chapel Hill What is NCDS?
Workprogramme 2014-15
Workprogramme 2014-15 e-infrastructures DCH-RP final conference 22 September 2014 Wim Jansen einfrastructure DG CONNECT European Commission DEVELOPMENT AND DEPLOYMENT OF E-INFRASTRUCTURES AND SERVICES
The 100,000 genomes project
The 100,000 genomes project Tim Hubbard @timjph Genomics England King s College London, King s Health Partners Wellcome Trust Sanger Institute ClinGen / Decipher Washington DC, 26 th May 2015 The 100,000
Steven Newhouse, Head of Technical Services
Challenges at EMBL-EBI Steven Newhouse, Head of Technical Services European Bioinformatics Institute Outstation of the European Molecular Biology Laboratory International organisation created by treaty
4net Technologies. Managed Services and Cloud Solutions
4net Technologies Managed Services and Cloud Solutions Managed Services and Cloud Solutions Managed Services and Cloud Solutions are an opportunity for organisations to bring control to complexity by managing
To find out more about the role, please visit our website www.royalacademy.org.uk/careers
Director of Information Technology Operations Department Competitive The Royal Academy is in a period of significant transformation. As we approach our 250 th anniversary in 2018, major investments are
Get more value from virtualisation
Get more value from virtualisation Computacenter enables organisations to realise the full benefits of a virtual enterprise with integrated management tools and automated processes GET MORE VALUE FROM
Big Data Challenges in Bioinformatics
Big Data Challenges in Bioinformatics BARCELONA SUPERCOMPUTING CENTER COMPUTER SCIENCE DEPARTMENT Autonomic Systems and ebusiness Pla?orms Jordi Torres [email protected] Talk outline! We talk about Petabyte?
Welcome Address by the. State Secretary at the Federal Ministry of Education and Research. Dr Georg Schütte
Welcome Address by the State Secretary at the Federal Ministry of Education and Research Dr Georg Schütte at the "Systems Biology and Systems Medicine" session of the World Health Summit at the Federal
Integrated Rule-based Data Management System for Genome Sequencing Data
Integrated Rule-based Data Management System for Genome Sequencing Data A Research Data Management (RDM) Green Shoots Pilots Project Report by Michael Mueller, Simon Burbidge, Steven Lawlor and Jorge Ferrer
ENABLING DATA TRANSFER MANAGEMENT AND SHARING IN THE ERA OF GENOMIC MEDICINE. October 2013
ENABLING DATA TRANSFER MANAGEMENT AND SHARING IN THE ERA OF GENOMIC MEDICINE October 2013 Introduction As sequencing technologies continue to evolve and genomic data makes its way into clinical use and
Big Data for health. Farr Institute, Administrative Data Research Centres, Medical Bioinformatics. 9 July 2015. Jacky Pallas, UCL
Big Data for health Farr Institute, Administrative Data Research Centres, Medical Bioinformatics 9 July 2015 Jacky Pallas, UCL Overview UK Research Council funding for big data in health, medical and administrative
Data Centric Computing Revisited
Piyush Chaudhary Technical Computing Solutions Data Centric Computing Revisited SPXXL/SCICOMP Summer 2013 Bottom line: It is a time of Powerful Information Data volume is on the rise Dimensions of data
Putting Genomes in the Cloud with WOS TM. ddn.com. DDN Whitepaper. Making data sharing faster, easier and more scalable
DDN Whitepaper Putting Genomes in the Cloud with WOS TM Making data sharing faster, easier and more scalable Table of Contents Cloud Computing 3 Build vs. Rent 4 Why WOS Fits the Cloud 4 Storing Sequences
Information and Communications Technology Strategy 2014-2017
Contents 1 Background ICT in Geoscience Australia... 2 1.1 Introduction... 2 1.2 Purpose... 2 1.3 Geoscience Australia and the Role of ICT... 2 1.4 Stakeholders... 4 2 Strategic drivers, vision and principles...
Big Data in BioMedical Sciences. Steven Newhouse, Head of Technical Services, EMBL-EBI
Big Data in BioMedical Sciences Steven Newhouse, Head of Technical Services, EMBL-EBI Big Data for BioMedical Sciences EMBL-EBI: What we do and why? Challenges & Opportunities Infrastructure Requirements
Biochemistry Major Talk 2014-15. Welcome!!!!!!!!!!!!!!
Biochemistry Major Talk 2014-15 August 14, 2015 Department of Biochemistry The University of Hong Kong Welcome!!!!!!!!!!!!!! Introduction to Biochemistry A four-minute video: http://www.youtube.com/watch?v=tpbamzq_pue&l
Request for Applications. Sharing Big Data for Health Care Innovation: Advancing the Objectives of the Global Alliance for Genomics and Health
1. Overview Request for Applications Sharing Big Data for Health Care Innovation: Advancing the Objectives of the Global Alliance for Genomics and Health In order for Canada to take full advantage of the
Head of Engineering Job Description
Head of Engineering Job Description (Job Code and Level: E006) Definition: Overall responsibility and accountability for the Engineering function across the UK which will include people and budgetary management.
Donna J. Dean, Ph.D. October 27, 2009 Brown University
Building Connections with NIH Program Officers: Myths and Realities Donna J. Dean, Ph.D. October 27, 2009 Brown University Funding Agencies Federal Agencies Focused on Biomedical Research s of Health (NIH)
IBM Global Services Foundation Human Capital Management
IBM Global Services Foundation Human Capital Management The Human Capital Touch 2 Foundation Human Capital Management The Human Capital touch Efficiency. Flexibility. Responsiveness. Three qualities any
Patient Centricity and the Changing Landscape of Healthcare
Patient Centricity and the Changing Landscape of Healthcare Andrea Cotter Director Healthcare Marketing IBM Corporation IBM Healthcare and Life Sciences Patient Centricity and the Changing Landscape of
Leveraging OpenStack Private Clouds
Leveraging OpenStack Private Clouds Robert Ronan Sr. Cloud Solutions Architect! [email protected]! LEVERAGING OPENSTACK - AGENDA OpenStack What is it? Benefits Leveraging OpenStack University
Science and Engineering Professional Framework
Contents: Introduction... 2 Who is the professional framework for?... 2 Using the science and engineering professional framework... 2 Summary of the Science and Engineering Professional Framework... 3
Validating Methods using Waters Empower TM 2 Method. Validation. Manager
Validating Methods using Waters Empower TM 2 Method Validation Manager (MVM) Anders Janesten Nordic Informatics Sales Specialist 2008 Waters Corporation Decision-centric centric method development information
WHAT IS (AND ISN T) INFORMATICS. Dr. Matthew Barnes, MD
WHAT IS (AND ISN T) INFORMATICS Dr. Matthew Barnes, MD The 19th century culture was defined by the novel, the 20th century culture was defined by the cinema, and the 21st century culture will be defined
Identity and Access Management Services. G-Cloud 7
Identity and Access Management Services G-Cloud 7 Who We Are Kainos is one of the longest standing independent digital technology companies in UK. We provide digital technology solutions that enable companies
Implementing SaaS CRM
White Paper IT Service Implementing SaaS CRM Just like water from the tap in your kitchen, cloud computing services can be turned on or off quickly as needed. Like at the water company, there is a team
ATA DRIVEN GLOBAL VISION CLOUD PLATFORM STRATEG N POWERFUL RELEVANT PERFORMANCE SOLUTION CLO IRTUAL BIG DATA SOLUTION ROI FLEXIBLE DATA DRIVEN V
ATA DRIVEN GLOBAL VISION CLOUD PLATFORM STRATEG N POWERFUL RELEVANT PERFORMANCE SOLUTION CLO IRTUAL BIG DATA SOLUTION ROI FLEXIBLE DATA DRIVEN V WHITE PAPER Create the Data Center of the Future Accelerate
Big Data for Population Health
Big Data for Population Health Prof Martin Landray Nuffield Department of Population Health Deputy Director, Big Data Institute, Li Ka Shing Centre for Health Information and Discovery University of Oxford
Business Development Manager. Grade: UCL Grade 8 Salary Range: 37,382-44,607 (excluding London Allowance of 2,834) Deputy Director, Administration
London Centre for Nanotechnology 17-19 Gordon Street London WC1H 0AH www.london-nano.com Job title: Business Development Manager Job reference number: 1374864 Grade: UCL Grade 8 Salary Range: 37,382-44,607
Head of CIO Office Information Services
Head of CIO Office Information Services Reporting to: Chief Information Officer Salary: Grade 6-47,787-57,031 per annum (pro rata) depending on skills and experience. Salary progression beyond this scale
Datacentre Studley. Dedicated managed environment for mission critical services. Six Degrees Group www.6dg.co.uk
Dedicated managed environment for mission critical services www.6dg.co.uk Our datacentres are the core of our business. At we own and manage 30,000 square feet of highly available, geographically diverse
Personalized Medicine and IT
Personalized Medicine and IT Data-driven Medicine in the Age of Genomics www.intel.com/healthcare/bigdata Ketan Paranjape General Manager, Life Sciences Intel Corp. @Portlandketan 1 The Central Dogma of
A Silver Lining in the Healthcare Cloud
A Silver Lining in the Healthcare Cloud Healthcare Innovations to prepare software solutions for Cloud Computing Author(s) John F. Ellingson, Partner, ProForma Healthcare Solutions, LLC Co-Sponsor Philip
HP S POINT OF VIEW TO CLOUD
HP S POINT OF VIEW TO CLOUD Frank Bloch Director Technology Consulting 2010 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice 3 GLOBAL MEGA
Data Science at the NIH Philip E. Bourne Ph.D. Associate Director for Data Science National Institutes of Health
Data Science at the NIH Philip E. Bourne Ph.D. Associate Director for Data Science National Institutes of Health Data Science Timeline 6/12 Findings: Sharing data & software through catalogs Support methods
Human Brain Project -
Human Brain Project - Scientific goals, Organization, Our role Wissenswerte, Bremen 26. Nov 2013 Prof. Sonja Grün Insitute of Neuroscience and Medicine (INM-6) & Institute for Advanced Simulations (IAS-6)
The HP IT Transformation Story
The HP IT Transformation Story Continued consolidation and infrastructure transformation impacts to the physical data center Dave Rotheroe, October, 2015 Why do data centers exist? Business Problem Application
BRISSkit: Biomedical Research Infrastructure Software Service kit. Jonathan Tedds. http://www.le.ac.uk/brisskit #brisskit #umfcloud
BRISSkit: Biomedical Research Infrastructure Software Service kit http://www.le.ac.uk/brisskit #brisskit #umfcloud Jonathan Tedds University of Leicester [email protected] @jtedds JISC University Modernisation
RED HAT CONTAINER STRATEGY
RED HAT CONTAINER STRATEGY An introduction to Atomic Enterprise Platform and OpenShift 3 Gavin McDougall Senior Solution Architect AGENDA Software disrupts business What are Containers? Misconceptions
The business owner s guide for replacing accounting software
The business owner s guide for replacing accounting software Replacing your accounting software is easier and more affordable than you may think. Use this guide to learn about the benefits of a modern
Big Workflow: More than Just Intelligent Workload Management for Big Data
Big Workflow: More than Just Intelligent Workload Management for Big Data Michael Feldman White Paper February 2014 EXECUTIVE SUMMARY Big data applications represent a fast-growing category of high-value
Expert Reference Series of White Papers. Understanding Data Centers and Cloud Computing
Expert Reference Series of White Papers Understanding Data Centers and Cloud Computing 1-800-COURSES www.globalknowledge.com Understanding Data Centers and Cloud Computing Paul Stryer, Global Knowledge
Why a Server Infrastructure Refresh Now and Why Dell?
Why a Server Infrastructure Refresh Now and Why Dell? In This Paper Outdated server infrastructure contributes to operating inefficiencies, lost productivity, and vulnerabilities Worse, existing infrastructure
ESG and Solvency II in the Cloud
Insights ESG and Solvency II in the Cloud In this article we look at how the model of cloud computing can be applied to high performance computing (HPC) applications. In particular it looks at economic
Objectives for today. Cloud Computing i det offentlige UK Public Sector G-Cloud, Applications Store & Data Centre Strategy
Cloud Computing i det offentlige UK Public Sector G-Cloud, Applications Store & Data Centre Strategy This is not just about technology. The main area of change, thus the major challenge, is how we as leaders
Sharing Data from Large-scale Biological Research Projects: A System of Tripartite Responsibility
Sharing Data from Large-scale Biological Research Projects: A System of Tripartite Responsibility Report of a meeting organized by the Wellcome Trust and held on 14 15 January 2003 at Fort Lauderdale,
The CRM 49 day success programme. Learn how to take control of your business destiny with our mentored CRM skills development service.
The CRM 49 day success programme Learn how to take control of your business destiny with our mentored CRM skills development service. Peter Clements, Microsoft Certified CRM Project Manager, Trainer &
Horizon 2020. Research e-infrastructures Excellence in Science Work Programme 2016-17. Wim Jansen. DG CONNECT European Commission
Horizon 2020 Research e-infrastructures Excellence in Science Work Programme 2016-17 Wim Jansen DG CONNECT European Commission 1 Before we start The material here presented has been compiled with great
3. Ensure the management of information is compliant with legislative requirements to maximise the benefits and minimise risks;
Enterprise Content Management (ECM) Policy Version Information A. Introduction Purpose 1. Outline and articulate the strategy for enterprise content management across Redland City Council (RCC). This document
GOVERNANCE MOVES BIG DATA FROM HYPE TO CONFIDENCE
GOVERNANCE MOVES BIG DATA FROM HYPE TO CONFIDENCE By Elliot King, Research Analyst Produced by Unisphere Research, a Division of Information Today, Inc. June 2014 Sponsored by 2 TABLE OF CONTENTS Introduction
Cloudbuz at Glance. How to take control of your File Transfers!
How to take control of your File Transfers! A MFT solution for ALL organisations! Cloudbuz is a MFT (Managed File Transfer) platform for organisations and businesses installed On-Premise or distributed
THe evolution of analytical lab InForMaTICs
Informatics Strategies for the Convergent Analytical Lab TECHNOLOGY REVIEW In many labs today, the drive to replace paper has begun pitting two systems against each other. The functionality in LIMS, which
A Guide to Horizon 2020 Funding for the Creative Industries
A Guide to Horizon 2020 Funding for the Creative Industries October 2014 Introduction This document is provided as a short guide to help you submit a proposal for the Horizon 2020 funding programme (H2020).
