Acceleration for Personalized Medicine Big Data Applications

Size: px
Start display at page:

Download "Acceleration for Personalized Medicine Big Data Applications"

Transcription

1 Acceleration for Personalized Medicine Big Data Applications Zaid Al-Ars Computer Engineering (CE) Lab Delft Data Science Delft University of Technology 1"

2 Introduction Definition & relevance Personalized medicine is the customization of healthcare - with medical decisions, practices, and products being tailored to the individual patient. Example of societally critical, highly-demanding big data application domains 2"

3 Introduction Scientific and societal challenges Exponentially growing data volumes Increasing complexity of analysis Both computational and data challenges 3"

4 Introduction Scientific and societal challenges Urgent clinical diagnostics, for example Targeted cancer & neo-natal diagnostics! We provide techniques to reduce compute time Cost prohibitive for society More patients & diseases to be treated! We provide techniques to reduce cost COMPUTE"COST " """"""""""""""""COMPUTE"TIME" 4"

5 Introduction Master class outline Introduction and background Field of personalized medicine Challenges and opportunities Relations to other big data fields Computational big data pipeline Stages of a typical personalized medicine pipeline Methods to reduce computation time Methods to reduce pipeline cost Solution demonstration 5"

6 Background Field of personalized medicine Vision: P4 medicine medicine that is predictive, preventive, personalized, and participatory 6"

7 Background Field of personalized medicine Sources of personalized information Measurements of vitals & body data Regular blood, spit, urine, etc. testing Genome data sequencing 7"

8 Background Field of personalized medicine Measurements of vitals & body data Pros Body is minutely and continuously monitored Corporate support from big industry Cons Use is not yet clear Health risks are not monitored! Not known if applications in health are possible 8"

9 Background Field of personalized medicine Regular blood, spit, urine, etc. testing Pros Measurement 100s of molecules in body Direct correlation to health risk Cons Still too expensive No specific health advice yet possible! Possible future use if cost becomes manageable 9"

10 Background Field of personalized medicine Genome data sequencing Pros Detailed knowledge genetic information Known markers to diagnose disease Cons Huge computational effort! Can be used today if computation effort becomes manageable 10"

11 Background DNA-based diagnostics 11"

12 Background DNA-based diagnostics DNA"muta<on"results"in" abnormal"cell"behavior" " Some"muta<ons"cause"cells" to"divide"without"control" causing"cancer" Cancer"can"be"diagnosed"by" iden<fying"which"muta<ons" are"in"the"dna"!cancer!diagnos-cs!is!main!use!for!dna!data!today! 12"

13 Background DNA-based diagnostics 13"

14 Big data pipeline Computational big data pipeline Three"main"stages" 1. Data"genera<on" Generate"and"store"DNA"data"using" specialized"compression"techniques" 2. Data"analysis" Accelerate"mapping"&"variant"calling" of"gene<c"algorithms"on"hardware" 3. Data"visualiza<on" Understand"the"analyzed"gene<c" data"to"make"clinical"decisions"for" the"pa<ent" GENERATE" Generate"and"store"DNA"data" using"specialized"compression" techniques" ANALYZE" Accelerate"mapping"&"variant" calling"of"gene<c"algorithms" on"hardware" INTERPRET" Understand"the"analyzed" gene<c"data"to"make"clinical" decisions"for"the"pa<ent" 14"

15 Big data pipeline Data generation " DNA processing passes in 3 stages # Sequence generation # Data analysis # Result interpretation " Sequence generation faces size bottlenecks 10^E7" 10^E6" GENERATE" Generate"and"store"DNA"data" using"specialized"compression" techniques" ANALYZE" Accelerate"mapping"&"variant" calling"of"gene<c"algorithms" on"hardware" 10^E5" 10^E4" 10^E3" 10^E2" 10^E1" 2003" 2004" 2005" 2006" 2007" 2008" 2009" 2010" 2011" Lincoln"D"Stein," The"case"for"cloud"compu<ng"in"genome" informa<cs,"genome"biology,"11:207,"2010." INTERPRET" Understand"the"analyzed" gene<c"data"to"make"clinical" decisions"for"the"pa<ent" 15"

16 Big data pipeline Data analysis " Growth of throughput of data generation is faster than growth in CPU processing capacity 10^E8" 10^E7" 10^E6" # Growth is exponential # Need for rapidly increasing processing capacity DNA"sequencing"(bp/day)" GENERATE" Generate"and"store"DNA"data" using"specialized"compression" techniques" ANALYZE" Accelerate"mapping"&"variant" calling"of"gene<c"algorithms" on"hardware" 10^E5" 10^E4" 10^E3" CPU"speed"(M"Inst./s)" INTERPRET" Understand"the"analyzed" gene<c"data"to"make"clinical" decisions"for"the"pa<ent" 10^E2" 2003" 2004" 2005" 2006" 2007" 2008" 2009" 2010" 2011" Po-Ru Loh, Michael Baym & Bonnie Berger, Compressive genomics, Nature Biotechnology, 30: , "

17 Big data pipeline Data interpretation " Relative cost of interpretation is increasing # Number of sequenced genomes increases # Cross referencing multiple genomes to identify correlations # Need for innovative DNA visualization " Sequence generation faces size bottlenecks 100%" 90%" 80%" 70%" 60%" 50%" 40%" 30%" 20%" 10%" 0%" 2012$ 2020$ Genotyping" 2012$ 2020$ Interpreta<on" Ingo"Helbig," Be"literate"when"the"exome"goes"clinical,"hcp:// channelopathist.net/,"june"6,"2012" GENERATE" Generate"and"store"DNA"data" using"specialized"compression" techniques" ANALYZE" Accelerate"mapping"&"variant" calling"of"gene<c"algorithms" on"hardware" INTERPRET" Understand"the"analyzed" gene<c"data"to"make"clinical" decisions"for"the"pa<ent" 17"

18 Big data pipeline 10^E8" 10^E7" 10^E6" Current solution " Current solution: increasing capacity in local or cloud clusters # Not always the best solution Growth"in"DNA"and"CPU"computa<onal"complexity" DNA"sequencing"(bp/day)" 10^E5" 10^E4" CPU"speed"(M"Inst./s)" 10^E3" 10^E2" 2003" 2004" 2005" 2006" 2007" 2008" 2009" 2010" 2011" Po-Ru Loh, Michael Baym & Bonnie Berger, Compressive genomics, Nature Biotechnology, 30: , "

19 Big data pipeline CE lab solution: compression " Domain specific compression # Enables high compression rate # Allows reduced infrastructure footprint " Possible transparent compression from and to file system 19"

20 Big data pipeline CE lab solution: acceleration " Hybrid core computing # Means using dedicated computing chips for specific algorithms # Next to traditional general-purpose CPUs (Intel processors) " Dedicated chips use FPGAs (field programmable gate arrays) like Xilinx " Recreate small compute elements on hardware " Can parallelize the computations tens of times " Becoming mainstream: used by Intel, IBM, Microsoft, Facebook, etc. 20"

21 Big data pipeline CE lab solution: acceleration " Compare and align nucleotide or protein sequences " Algorithm scores every possible alignment # Cell of matrix compares elements of query and database # Much parallelism, both within & between sequences 21"

22 Big data pipeline CE lab solution: distribution " Efficient utilization of available hardware resources # Less hardware is used for same algorithms " Tuning of hardware-software system to use case # More parallelism extracted from algorithms Task"P1" Task"S1" Task"S2" Task"S3" " Task"Sn" Task"S1" Task"S2" Task"S3" " Task"Sn" Task"S1" Task"S2" Task"S3" " Task"Sn" Task"P2" Task"P3" " Task"Pn" Task"P1" Task"P2" Task"P3" " Task"Pn" 22"

23 Big data pipeline CE lab solution: distribution " Higher Performance # 5x to 25x speed gains " Energy Saving # Up to 90% power reduction " Easy to use, program, manage # Standard Linux ecosystem # Transparent to the user " Well suited for Bioinformatics # Inherent parallelism exploited by pipelining # Small data types use logic efficiently 23"

24 Next steps Delft Data Science research agenda CE Lab provides a holistic approach to optimize big data infrastructure 1. Addressing big data storage limitations Effective compression techniques 2. Addressing big data computational time Acceleration of big data algorithms 3. Addressing big data system cost Effective utilization of system resources Storage"limita<ons" Computa<onal" boclenecks" Infrastructure"cost" op<miza<ons" 24"

25 Next steps Collaboration opportunities Collaborations on big data infrastructure Work together on industrially relevant challenges Transfer of expert knowledge to organizations CE Lab is leading research in Pipeline-wide performance optimization Integrated system cost optimization Large network of leading technology providers IBM, Intel, Altera, etc. 25"

26 Next steps Contact for further discussion Contact for further discussion on collaborations or question/feedback Zaid Al-Ars CE Lab / TUDelft Mekelweg 4, 2628 CD Delft Web: ce.ewi.tudelft.nl/zaid Tel: "

27 Next steps Future prospects Genetic analysis has significant potential Personalized medicine Preemptive intervention Trait selection and enhancement Etc. Early detection & cure of diabetes w/ ipop 60 TeraB data 27"

28 Next steps Solution demonstration 28"

DNA Mapping/Alignment. Team: I Thought You GNU? Lars Olsen, Venkata Aditya Kovuri, Nick Merowsky

DNA Mapping/Alignment. Team: I Thought You GNU? Lars Olsen, Venkata Aditya Kovuri, Nick Merowsky DNA Mapping/Alignment Team: I Thought You GNU? Lars Olsen, Venkata Aditya Kovuri, Nick Merowsky Overview Summary Research Paper 1 Research Paper 2 Research Paper 3 Current Progress Software Designs to

More information

P4 Medicine: Personalized, Predictive, Preventive, Participatory A Change of View that Changes Everything

P4 Medicine: Personalized, Predictive, Preventive, Participatory A Change of View that Changes Everything P4 Medicine: Personalized, Predictive, Preventive, Participatory A Change of View that Changes Everything Leroy E. Hood Institute for Systems Biology David J. Galas Battelle Memorial Institute Version

More information

Personalized medicine in China s healthcare system

Personalized medicine in China s healthcare system Personalized medicine in China s healthcare system Jingmin Kan, Sam Linsen Netherlands office for Science and Technology, Guangzhou and Shanghai, China Content PERSONALIZED MEDICINE 2 FOCUS AT THE INDIVIDUAL

More information

HIV NOMOGRAM USING BIG DATA ANALYTICS

HIV NOMOGRAM USING BIG DATA ANALYTICS HIV NOMOGRAM USING BIG DATA ANALYTICS S.Avudaiselvi and P.Tamizhchelvi Student Of Ayya Nadar Janaki Ammal College (Sivakasi) Head Of The Department Of Computer Science, Ayya Nadar Janaki Ammal College

More information

Infrastructure Matters: POWER8 vs. Xeon x86

Infrastructure Matters: POWER8 vs. Xeon x86 Advisory Infrastructure Matters: POWER8 vs. Xeon x86 Executive Summary This report compares IBM s new POWER8-based scale-out Power System to Intel E5 v2 x86- based scale-out systems. A follow-on report

More information

Personalized Medicine and IT

Personalized Medicine and IT Personalized Medicine and IT Data-driven Medicine in the Age of Genomics www.intel.com/healthcare/bigdata Ketan Paranjape General Manager, Life Sciences Intel Corp. @Portlandketan 1 The Central Dogma of

More information

Attacking the Biobank Bottleneck

Attacking the Biobank Bottleneck Attacking the Biobank Bottleneck Professor Jan-Eric Litton BBMRI-ERIC BBMRI-ERIC Big Data meets research biobanking Big data is high-volume, high-velocity and highvariety information assets that demand

More information

Moving Beyond CPUs in the Cloud: Will FPGAs Sink or Swim?

Moving Beyond CPUs in the Cloud: Will FPGAs Sink or Swim? Moving Beyond CPUs in the Cloud: Will FPGAs Sink or Swim? Successful FPGA datacenter usage at scale will require differentiated capability, programming ease, and scalable implementation models Executive

More information

Extending the Power of FPGAs. Salil Raje, Xilinx

Extending the Power of FPGAs. Salil Raje, Xilinx Extending the Power of FPGAs Salil Raje, Xilinx Extending the Power of FPGAs The Journey has Begun Salil Raje Xilinx Corporate Vice President Software and IP Products Development Agenda The Evolution of

More information

SAP HANA Enabling Genome Analysis

SAP HANA Enabling Genome Analysis SAP HANA Enabling Genome Analysis Joanna L. Kelley, PhD Postdoctoral Scholar, Stanford University Enakshi Singh, MSc HANA Product Management, SAP Labs LLC Outline Use cases Genomics review Challenges in

More information

14.3 Studying the Human Genome

14.3 Studying the Human Genome 14.3 Studying the Human Genome Lesson Objectives Summarize the methods of DNA analysis. State the goals of the Human Genome Project and explain what we have learned so far. Lesson Summary Manipulating

More information

Powering Cutting Edge Research in Life Sciences with High Performance Computing

Powering Cutting Edge Research in Life Sciences with High Performance Computing A Point of View Powering Cutting Edge Research in Life Sciences with High Performance Computing High performance computing (HPC) is the foundation of pioneering research in life sciences. HPC plays a vital

More information

Big Data Challenges in Bioinformatics

Big Data Challenges in Bioinformatics Big Data Challenges in Bioinformatics BARCELONA SUPERCOMPUTING CENTER COMPUTER SCIENCE DEPARTMENT Autonomic Systems and ebusiness Pla?orms Jordi Torres Jordi.Torres@bsc.es Talk outline! We talk about Petabyte?

More information

Big Data Science. Prof.dr.ir. Geert-Jan Houben. TU Delft Web Information Systems Delft Data Science KIVI chair Big Data Science

Big Data Science. Prof.dr.ir. Geert-Jan Houben. TU Delft Web Information Systems Delft Data Science KIVI chair Big Data Science Big Data Science Prof.dr.ir. Geert-Jan Houben TU Delft Web Information Systems Delft Data Science KIVI chair Big Data Science 1 big data: it s there, it s important it is interesting to study it, to understand

More information

3. NUMBER OF PARTICIPANTS TO BE ENROLLED

3. NUMBER OF PARTICIPANTS TO BE ENROLLED 3. COMPUTER 1. Purpose of the course Refer to each sub-course. 2. Training program (1)General Orientation and Japanese Language Program The General Orientation and Japanese Program are organized at the

More information

Genetic testing. The difference diagnostics can make. The British In Vitro Diagnostics Association

Genetic testing. The difference diagnostics can make. The British In Vitro Diagnostics Association 6 Genetic testing The difference diagnostics can make The British In Vitro Diagnostics Association Genetic INTRODUCTION testing The Department of Health published Our Inheritance, Our Future - Realising

More information

IBM Deep Computing Visualization Offering

IBM Deep Computing Visualization Offering P - 271 IBM Deep Computing Visualization Offering Parijat Sharma, Infrastructure Solution Architect, IBM India Pvt Ltd. email: parijatsharma@in.ibm.com Summary Deep Computing Visualization in Oil & Gas

More information

Outline. High Performance Computing (HPC) Big Data meets HPC. Case Studies: Some facts about Big Data Technologies HPC and Big Data converging

Outline. High Performance Computing (HPC) Big Data meets HPC. Case Studies: Some facts about Big Data Technologies HPC and Big Data converging Outline High Performance Computing (HPC) Towards exascale computing: a brief history Challenges in the exascale era Big Data meets HPC Some facts about Big Data Technologies HPC and Big Data converging

More information

Integrating Bioinformatics, Medical Sciences and Drug Discovery

Integrating Bioinformatics, Medical Sciences and Drug Discovery Integrating Bioinformatics, Medical Sciences and Drug Discovery M. Madan Babu Centre for Biotechnology, Anna University, Chennai - 600025 phone: 44-4332179 :: email: madanm1@rediffmail.com Bioinformatics

More information

Integration of Genetic and Familial Data into. Electronic Medical Records and Healthcare Processes

Integration of Genetic and Familial Data into. Electronic Medical Records and Healthcare Processes Integration of Genetic and Familial Data into Electronic Medical Records and Healthcare Processes By Thomas Kmiecik and Dale Sanders February 2, 2009 Introduction Although our health is certainly impacted

More information

Xeon+FPGA Platform for the Data Center

Xeon+FPGA Platform for the Data Center Xeon+FPGA Platform for the Data Center ISCA/CARL 2015 PK Gupta, Director of Cloud Platform Technology, DCG/CPG Overview Data Center and Workloads Xeon+FPGA Accelerator Platform Applications and Eco-system

More information

Big Data Challenges. technology basics for data scientists. Spring - 2014. Jordi Torres, UPC - BSC www.jorditorres.

Big Data Challenges. technology basics for data scientists. Spring - 2014. Jordi Torres, UPC - BSC www.jorditorres. Big Data Challenges technology basics for data scientists Spring - 2014 Jordi Torres, UPC - BSC www.jorditorres.eu @JordiTorresBCN Data Deluge: Due to the changes in big data generation Example: Biomedicine

More information

BIOINF 525 Winter 2016 Foundations of Bioinformatics and Systems Biology http://tinyurl.com/bioinf525-w16

BIOINF 525 Winter 2016 Foundations of Bioinformatics and Systems Biology http://tinyurl.com/bioinf525-w16 Course Director: Dr. Barry Grant (DCM&B, bjgrant@med.umich.edu) Description: This is a three module course covering (1) Foundations of Bioinformatics, (2) Statistics in Bioinformatics, and (3) Systems

More information

UF EDGE brings the classroom to you with online, worldwide course delivery!

UF EDGE brings the classroom to you with online, worldwide course delivery! What is the University of Florida EDGE Program? EDGE enables engineering professional, military members, and students worldwide to participate in courses, certificates, and degree programs from the UF

More information

White Paper. Version 1.2 May 2015 RAID Incorporated

White Paper. Version 1.2 May 2015 RAID Incorporated White Paper Version 1.2 May 2015 RAID Incorporated Introduction The abundance of Big Data, structured, partially-structured and unstructured massive datasets, which are too large to be processed effectively

More information

2019 Healthcare That Works for All

2019 Healthcare That Works for All 2019 Healthcare That Works for All This paper is one of a series describing what a decade of successful change in healthcare could look like in 2019. Each paper focuses on one aspect of healthcare. To

More information

Analysis of GPU Parallel Computing based on Matlab

Analysis of GPU Parallel Computing based on Matlab Analysis of GPU Parallel Computing based on Matlab Mingzhe Wang, Bo Wang, Qiu He, Xiuxiu Liu, Kunshuai Zhu (School of Computer and Control Engineering, University of Chinese Academy of Sciences, Huairou,

More information

Big Data Analytics for Healthcare

Big Data Analytics for Healthcare Big Data Analytics for Healthcare Jimeng Sun Chandan K. Reddy Healthcare Analytics Department IBM TJ Watson Research Center Department of Computer Science Wayne State University 1 Healthcare Analytics

More information

GPU File System Encryption Kartik Kulkarni and Eugene Linkov

GPU File System Encryption Kartik Kulkarni and Eugene Linkov GPU File System Encryption Kartik Kulkarni and Eugene Linkov 5/10/2012 SUMMARY. We implemented a file system that encrypts and decrypts files. The implementation uses the AES algorithm computed through

More information

An FPGA Acceleration of Short Read Human Genome Mapping

An FPGA Acceleration of Short Read Human Genome Mapping An FPGA Acceleration of Short Read Human Genome Mapping Corey Bruce Olson A thesis submitted in partial fulfillment of the requirements for the degree of Master of Science in Electrical Engineering University

More information

Cloud-Based Big Data Analytics in Bioinformatics

Cloud-Based Big Data Analytics in Bioinformatics Cloud-Based Big Data Analytics in Bioinformatics Presented By Cephas Mawere Harare Institute of Technology, Zimbabwe 1 Introduction 2 Big Data Analytics Big Data are a collection of data sets so large

More information

High Performance Compu2ng Facility

High Performance Compu2ng Facility High Performance Compu2ng Facility Center for Health Informa2cs and Bioinforma2cs Accelera2ng Scien2fic Discovery and Innova2on in Biomedical Research at NYULMC through Advanced Compu2ng Efstra'os Efstathiadis,

More information

DELL s Oracle Database Advisor

DELL s Oracle Database Advisor DELL s Oracle Database Advisor Underlying Methodology A Dell Technical White Paper Database Solutions Engineering By Roger Lopez Phani MV Dell Product Group January 2010 THIS WHITE PAPER IS FOR INFORMATIONAL

More information

Lesson 7: SYSTEM-ON. SoC) AND USE OF VLSI CIRCUIT DESIGN TECHNOLOGY. Chapter-1L07: "Embedded Systems - ", Raj Kamal, Publs.: McGraw-Hill Education

Lesson 7: SYSTEM-ON. SoC) AND USE OF VLSI CIRCUIT DESIGN TECHNOLOGY. Chapter-1L07: Embedded Systems - , Raj Kamal, Publs.: McGraw-Hill Education Lesson 7: SYSTEM-ON ON-CHIP (SoC( SoC) AND USE OF VLSI CIRCUIT DESIGN TECHNOLOGY 1 VLSI chip Integration of high-level components Possess gate-level sophistication in circuits above that of the counter,

More information

ebook Utilizing MapReduce to address Big Data Enterprise Needs Leveraging Big Data to shorten drug development cycles in Pharmaceutical industry.

ebook Utilizing MapReduce to address Big Data Enterprise Needs Leveraging Big Data to shorten drug development cycles in Pharmaceutical industry. Utilizing MapReduce to address Big Data Enterprise Needs Leveraging Big Data to shorten drug development cycles in Pharmaceutical industry. www.persistent.com 3 4 5 5 7 9 10 11 12 13 From the Vantage Point

More information

Data deluge (and it s applications) Gianluigi Zanetti. Data deluge. (and its applications) Gianluigi Zanetti

Data deluge (and it s applications) Gianluigi Zanetti. Data deluge. (and its applications) Gianluigi Zanetti Data deluge (and its applications) Prologue Data is becoming cheaper and cheaper to produce and store Driving mechanism is parallelism on sensors, storage, computing Data directly produced are complex

More information

Big Data Trends A Basis for Personalized Medicine

Big Data Trends A Basis for Personalized Medicine Big Data Trends A Basis for Personalized Medicine Dr. Hellmuth Broda, Principal Technology Architect emedikation: Verordnung, Support Prozesse & Logistik 5. Juni, 2013, Inselspital Bern Over 150,000 Employees

More information

Data Center and Cloud Computing Market Landscape and Challenges

Data Center and Cloud Computing Market Landscape and Challenges Data Center and Cloud Computing Market Landscape and Challenges Manoj Roge, Director Wired & Data Center Solutions Xilinx Inc. #OpenPOWERSummit 1 Outline Data Center Trends Technology Challenges Solution

More information

An EVIDENCE-ENHANCED HEALTHCARE ECOSYSTEM for Cancer: I/T perspectives

An EVIDENCE-ENHANCED HEALTHCARE ECOSYSTEM for Cancer: I/T perspectives An EVIDENCE-ENHANCED HEALTHCARE ECOSYSTEM for Cancer: I/T perspectives Chalapathy Neti, Ph.D. Associate Director, Healthcare Transformation, Shahram Ebadollahi, Ph.D. Research Staff Memeber IBM Research,

More information

Manufacturing CUSTOM CHEMICALS AND SERVICES, SUPPORTING SCIENTIFIC ADVANCES FOR HUMAN HEALTH

Manufacturing CUSTOM CHEMICALS AND SERVICES, SUPPORTING SCIENTIFIC ADVANCES FOR HUMAN HEALTH Manufacturing CUSTOM CHEMICALS AND SERVICES, SUPPORTING SCIENTIFIC ADVANCES FOR HUMAN HEALTH VWR enables the advancement of science by providing high-quality chemicals and services, customized to your

More information

SICKLE CELL ANEMIA & THE HEMOGLOBIN GENE TEACHER S GUIDE

SICKLE CELL ANEMIA & THE HEMOGLOBIN GENE TEACHER S GUIDE AP Biology Date SICKLE CELL ANEMIA & THE HEMOGLOBIN GENE TEACHER S GUIDE LEARNING OBJECTIVES Students will gain an appreciation of the physical effects of sickle cell anemia, its prevalence in the population,

More information

THE NEXT FRONTIER IN COMPUTING QUANTUM OPTICAL COMPUTING. Presentation For Venture Capital Investment

THE NEXT FRONTIER IN COMPUTING QUANTUM OPTICAL COMPUTING. Presentation For Venture Capital Investment THE NEXT FRONTIER IN COMPUTING QUANTUM OPTICAL COMPUTING Presentation For Venture Capital Investment Dr. Brian Antao, CEO and Founder tundrasystems.eu 1 OPTICONDUCTORS: THE FUTURE OF SEMICONDUCTORS Mission:

More information

Health Informatics Research and Development in Europe

Health Informatics Research and Development in Europe Workshop on Health Informatics Research and Development, London, 18 July 2002 Health Informatics Research and Development in Europe Sofie Nørager European Commission Information Society Technologies Program

More information

The National Institute of Genomic Medicine (INMEGEN) was

The National Institute of Genomic Medicine (INMEGEN) was Genome is...... the complete set of genetic information contained within all of the chromosomes of an organism. It defines the particular phenotype of an individual. What is Genomics? The study of the

More information

Search and Data Mining: Techniques. Applications Anya Yarygina Boris Novikov

Search and Data Mining: Techniques. Applications Anya Yarygina Boris Novikov Search and Data Mining: Techniques Applications Anya Yarygina Boris Novikov Introduction Data mining applications Data mining system products and research prototypes Additional themes on data mining Social

More information

Impact of Big Data growth On Transparent Computing

Impact of Big Data growth On Transparent Computing Impact of Big Data growth On Transparent Computing Michael A. Greene Intel Vice President, Software and Services Group, General Manager, System Technologies and Optimization 1 Transparent Computing (TC)

More information

Big Data Analytics and Healthcare

Big Data Analytics and Healthcare Big Data Analytics and Healthcare Anup Kumar, Professor and Director of MINDS Lab Computer Engineering and Computer Science Department University of Louisville Road Map Introduction Data Sources Structured

More information

Reconfigurable Architecture Requirements for Co-Designed Virtual Machines

Reconfigurable Architecture Requirements for Co-Designed Virtual Machines Reconfigurable Architecture Requirements for Co-Designed Virtual Machines Kenneth B. Kent University of New Brunswick Faculty of Computer Science Fredericton, New Brunswick, Canada ken@unb.ca Micaela Serra

More information

FPGA Acceleration using OpenCL & PCIe Accelerators MEW 25

FPGA Acceleration using OpenCL & PCIe Accelerators MEW 25 FPGA Acceleration using OpenCL & PCIe Accelerators MEW 25 December 2014 FPGAs in the news» Catapult» Accelerate BING» 2x search acceleration:» ½ the number of servers»

More information

FPGA Accelerator Virtualization in an OpenPOWER cloud. Fei Chen, Yonghua Lin IBM China Research Lab

FPGA Accelerator Virtualization in an OpenPOWER cloud. Fei Chen, Yonghua Lin IBM China Research Lab FPGA Accelerator Virtualization in an OpenPOWER cloud Fei Chen, Yonghua Lin IBM China Research Lab Trend of Acceleration Technology Acceleration in Cloud is Taking Off Used FPGA to accelerate Bing search

More information

Hardware and Software

Hardware and Software Hardware and Software 1 Hardware and Software: A complete design Hardware and software support each other Sometimes it is necessary to shift functions from software to hardware or the other way around

More information

Big Data and the Data Lake. February 2015

Big Data and the Data Lake. February 2015 Big Data and the Data Lake February 2015 My Vision: Our Mission Data Intelligence is a broad term that describes the real, meaningful insights that can be extracted from your data truths that you can act

More information

Importance of Statistics in creating high dimensional data

Importance of Statistics in creating high dimensional data Importance of Statistics in creating high dimensional data Hemant K. Tiwari, PhD Section on Statistical Genetics Department of Biostatistics University of Alabama at Birmingham History of Genomic Data

More information

Seeking Opportunities for Hardware Acceleration in Big Data Analytics

Seeking Opportunities for Hardware Acceleration in Big Data Analytics Seeking Opportunities for Hardware Acceleration in Big Data Analytics Paul Chow High-Performance Reconfigurable Computing Group Department of Electrical and Computer Engineering University of Toronto Who

More information

Preparing the scenario for the use of patient s genome sequences in clinic. Joaquín Dopazo

Preparing the scenario for the use of patient s genome sequences in clinic. Joaquín Dopazo Preparing the scenario for the use of patient s genome sequences in clinic Joaquín Dopazo Computational Medicine Institute, Centro de Investigación Príncipe Felipe (CIPF), Functional Genomics Node, (INB),

More information

Medical Certification: Bringing genomic microcores to clinical use OI- VF- WP- 011

Medical Certification: Bringing genomic microcores to clinical use OI- VF- WP- 011 Medical Certification: Bringing genomic microcores to clinical use VoltedFlow GmbH Industriestrasse 23, 6055, Alpnach Dorf, Switzerland 1 Summary VoltedFlow has developed solutions to vastly speed up DNA

More information

Emerging storage and HPC technologies to accelerate big data analytics Jerome Gaysse JG Consulting

Emerging storage and HPC technologies to accelerate big data analytics Jerome Gaysse JG Consulting Emerging storage and HPC technologies to accelerate big data analytics Jerome Gaysse JG Consulting Introduction Big Data Analytics needs: Low latency data access Fast computing Power efficiency Latest

More information

Concept and Project Objectives

Concept and Project Objectives 3.1 Publishable summary Concept and Project Objectives Proactive and dynamic QoS management, network intrusion detection and early detection of network congestion problems among other applications in the

More information

Next Generation Sequencing: Technology, Mapping, and Analysis

Next Generation Sequencing: Technology, Mapping, and Analysis Next Generation Sequencing: Technology, Mapping, and Analysis Gary Benson Computer Science, Biology, Bioinformatics Boston University gbenson@bu.edu http://tandem.bu.edu/ The Human Genome Project took

More information

Managing and Conducting Biomedical Research on the Cloud Prasad Patil

Managing and Conducting Biomedical Research on the Cloud Prasad Patil Managing and Conducting Biomedical Research on the Cloud Prasad Patil Laboratory for Personalized Medicine Center for Biomedical Informatics Harvard Medical School SaaS & PaaS gmail google docs app engine

More information

Scientific and Technical Applications as a Service in the Cloud

Scientific and Technical Applications as a Service in the Cloud Scientific and Technical Applications as a Service in the Cloud University of Bern, 28.11.2011 adapted version Wibke Sudholt CloudBroker GmbH Technoparkstrasse 1, CH-8005 Zurich, Switzerland Phone: +41

More information

FPGA-based MapReduce Framework for Machine Learning

FPGA-based MapReduce Framework for Machine Learning FPGA-based MapReduce Framework for Machine Learning Bo WANG 1, Yi SHAN 1, Jing YAN 2, Yu WANG 1, Ningyi XU 2, Huangzhong YANG 1 1 Department of Electronic Engineering Tsinghua University, Beijing, China

More information

IOS110. Virtualization 5/27/2014 1

IOS110. Virtualization 5/27/2014 1 IOS110 Virtualization 5/27/2014 1 Agenda What is Virtualization? Types of Virtualization. Advantages and Disadvantages. Virtualization software Hyper V What is Virtualization? Virtualization Refers to

More information

Bio-Informatics Lectures. A Short Introduction

Bio-Informatics Lectures. A Short Introduction Bio-Informatics Lectures A Short Introduction The History of Bioinformatics Sanger Sequencing PCR in presence of fluorescent, chain-terminating dideoxynucleotides Massively Parallel Sequencing Massively

More information

Bricata Next Generation Intrusion Prevention System A New, Evolved Breed of Threat Mitigation

Bricata Next Generation Intrusion Prevention System A New, Evolved Breed of Threat Mitigation Bricata Next Generation Intrusion Prevention System A New, Evolved Breed of Threat Mitigation Iain Davison Chief Technology Officer Bricata, LLC WWW.BRICATA.COM The Need for Multi-Threaded, Multi-Core

More information

Bioinformatics Grid - Enabled Tools For Biologists.

Bioinformatics Grid - Enabled Tools For Biologists. Bioinformatics Grid - Enabled Tools For Biologists. What is Grid-Enabled Tools (GET)? As number of data from the genomics and proteomics experiment increases. Problems arise for the current sequence analysis

More information

History of DNA Sequencing & Current Applications

History of DNA Sequencing & Current Applications History of DNA Sequencing & Current Applications Christopher McLeod President & CEO, 454 Life Sciences, A Roche Company IMPORTANT NOTICE Intended Use Unless explicitly stated otherwise, all Roche Applied

More information

Big Data Analytics Driving Healthcare Transformation

Big Data Analytics Driving Healthcare Transformation Big Data Analytics Driving Healthcare Transformation Greg Caressi SVP Healthcare & Life Sciences November, 2014 Six Big Themes for the New Healthcare Economy Themes Modernizing Care Delivery Clinical practice

More information

RevoScaleR Speed and Scalability

RevoScaleR Speed and Scalability EXECUTIVE WHITE PAPER RevoScaleR Speed and Scalability By Lee Edlefsen Ph.D., Chief Scientist, Revolution Analytics Abstract RevoScaleR, the Big Data predictive analytics library included with Revolution

More information

Stream Processing on GPUs Using Distributed Multimedia Middleware

Stream Processing on GPUs Using Distributed Multimedia Middleware Stream Processing on GPUs Using Distributed Multimedia Middleware Michael Repplinger 1,2, and Philipp Slusallek 1,2 1 Computer Graphics Lab, Saarland University, Saarbrücken, Germany 2 German Research

More information

Scheduling. Scheduling. Scheduling levels. Decision to switch the running process can take place under the following circumstances:

Scheduling. Scheduling. Scheduling levels. Decision to switch the running process can take place under the following circumstances: Scheduling Scheduling Scheduling levels Long-term scheduling. Selects which jobs shall be allowed to enter the system. Only used in batch systems. Medium-term scheduling. Performs swapin-swapout operations

More information

EMBL Identity & Access Management

EMBL Identity & Access Management EMBL Identity & Access Management Rupert Lück EMBL Heidelberg e IRG Workshop Zürich Apr 24th 2008 Outline EMBL Overview Identity & Access Management for EMBL IT Requirements & Strategy Project Goal and

More information

Genetic diagnostics the gateway to personalized medicine

Genetic diagnostics the gateway to personalized medicine Micronova 20.11.2012 Genetic diagnostics the gateway to personalized medicine Kristiina Assoc. professor, Director of Genetic Department HUSLAB, Helsinki University Central Hospital The Human Genome Packed

More information

SGI. High Throughput Computing (HTC) Wrapper Program for Bioinformatics on SGI ICE and SGI UV Systems. January, 2012. Abstract. Haruna Cofer*, PhD

SGI. High Throughput Computing (HTC) Wrapper Program for Bioinformatics on SGI ICE and SGI UV Systems. January, 2012. Abstract. Haruna Cofer*, PhD White Paper SGI High Throughput Computing (HTC) Wrapper Program for Bioinformatics on SGI ICE and SGI UV Systems Haruna Cofer*, PhD January, 2012 Abstract The SGI High Throughput Computing (HTC) Wrapper

More information

Chapter 20: Biotechnology: DNA Technology & Genomics

Chapter 20: Biotechnology: DNA Technology & Genomics Biotechnology Chapter 20: Biotechnology: DNA Technology & Genomics The BIG Questions How can we use our knowledge of DNA to: o Diagnose disease or defect? o Cure disease or defect? o Change/improve organisms?

More information

The Scientific Data Mining Process

The Scientific Data Mining Process Chapter 4 The Scientific Data Mining Process When I use a word, Humpty Dumpty said, in rather a scornful tone, it means just what I choose it to mean neither more nor less. Lewis Carroll [87, p. 214] In

More information

Continuing the MDM journey

Continuing the MDM journey IBM Software White paper Information Management Continuing the MDM journey Extending from a virtual style to a physical style for master data management 2 Continuing the MDM journey Organizations implement

More information

BIOSCIENCES COURSE TITLE AWARD

BIOSCIENCES COURSE TITLE AWARD COURSE TITLE AWARD BIOSCIENCES As a Biosciences undergraduate student at the University of Westminster, you will benefit from some of the best teaching and facilities available. Our courses combine lecture,

More information

Model-based system-on-chip design on Altera and Xilinx platforms

Model-based system-on-chip design on Altera and Xilinx platforms CO-DEVELOPMENT MANUFACTURING INNOVATION & SUPPORT Model-based system-on-chip design on Altera and Xilinx platforms Ronald Grootelaar, System Architect RJA.Grootelaar@3t.nl Agenda 3T Company profile Technology

More information

CURRICULUM GUIDE 2007-2008 COMPUTER SCIENCE CERTIFICATES OF COMPLETION

CURRICULUM GUIDE 2007-2008 COMPUTER SCIENCE CERTIFICATES OF COMPLETION CURRICULUM GUIDE 2007-2008 COMPUTER SCIENCE CERTIFICATES OF COMPLETION Including Cisco Certified Network Associate; Computer and Information Literacy; Computer Applications in Biotechnology; Computer Programming;

More information

Putting IBM Watson to Work In Healthcare

Putting IBM Watson to Work In Healthcare Martin S. Kohn, MD, MS, FACEP, FACPE Chief Medical Scientist, Care Delivery Systems IBM Research marty.kohn@us.ibm.com Putting IBM Watson to Work In Healthcare 2 SB 1275 Medical data in an electronic or

More information

Regulatory Issues in Genetic Testing and Targeted Drug Development

Regulatory Issues in Genetic Testing and Targeted Drug Development Regulatory Issues in Genetic Testing and Targeted Drug Development Janet Woodcock, M.D. Deputy Commissioner for Operations Food and Drug Administration October 12, 2006 Genetic and Genomic Tests are Types

More information

Contents. Preface Acknowledgements. Chapter 1 Introduction 1.1

Contents. Preface Acknowledgements. Chapter 1 Introduction 1.1 Preface xi Acknowledgements xv Chapter 1 Introduction 1.1 1.1 Cloud Computing at a Glance 1.1 1.1.1 The Vision of Cloud Computing 1.2 1.1.2 Defining a Cloud 1.4 1.1.3 A Closer Look 1.6 1.1.4 Cloud Computing

More information

Embedded System Hardware - Processing (Part II)

Embedded System Hardware - Processing (Part II) 12 Embedded System Hardware - Processing (Part II) Jian-Jia Chen (Slides are based on Peter Marwedel) Informatik 12 TU Dortmund Germany Springer, 2010 2014 年 11 月 11 日 These slides use Microsoft clip arts.

More information

high-performance computing so you can move your enterprise forward

high-performance computing so you can move your enterprise forward Whether targeted to HPC or embedded applications, Pico Computing s modular and highly-scalable architecture, based on Field Programmable Gate Array (FPGA) technologies, brings orders-of-magnitude performance

More information

GYAN VIHAR SCHOOL OF ENGINEERING & TECHNOLOGY M. TECH. CSE (2 YEARS PROGRAM)

GYAN VIHAR SCHOOL OF ENGINEERING & TECHNOLOGY M. TECH. CSE (2 YEARS PROGRAM) GYAN VIHAR SCHOOL OF ENGINEERING & TECHNOLOGY M. TECH. CSE (2 YEARS PROGRAM) Need, objectives and main features of the Match. (CSE) Curriculum The main objective of the program is to develop manpower for

More information

Parallel Compression and Decompression of DNA Sequence Reads in FASTQ Format

Parallel Compression and Decompression of DNA Sequence Reads in FASTQ Format , pp.91-100 http://dx.doi.org/10.14257/ijhit.2014.7.4.09 Parallel Compression and Decompression of DNA Sequence Reads in FASTQ Format Jingjing Zheng 1,* and Ting Wang 1, 2 1,* Parallel Software and Computational

More information

IMPLEMENTING BIG DATA IN TODAY S HEALTH CARE PRAXIS: A CONUNDRUM TO PATIENTS, CAREGIVERS AND OTHER STAKEHOLDERS - WHAT IS THE VALUE AND WHO PAYS

IMPLEMENTING BIG DATA IN TODAY S HEALTH CARE PRAXIS: A CONUNDRUM TO PATIENTS, CAREGIVERS AND OTHER STAKEHOLDERS - WHAT IS THE VALUE AND WHO PAYS IMPLEMENTING BIG DATA IN TODAY S HEALTH CARE PRAXIS: A CONUNDRUM TO PATIENTS, CAREGIVERS AND OTHER STAKEHOLDERS - WHAT IS THE VALUE AND WHO PAYS 29 OCTOBER 2015 DR. DIRK J. EVERS BACKGROUND TreatmentMAP

More information

Digital Catapult. The impact of Big Data in a Connected Digital Economy Future of Healthcare. Mark Wall Big Data & Analytics Leader.

Digital Catapult. The impact of Big Data in a Connected Digital Economy Future of Healthcare. Mark Wall Big Data & Analytics Leader. 1 Digital Catapult The impact of Big Data in a Connected Digital Economy Future of Healthcare Mark Wall Big Data & Analytics Leader March 12 2014 Catapult is a Technology Strategy Board programme Agenda

More information

Leading Genomics. Diagnostic. Discove. Collab. harma. Shanghai Cambridge, MA Reykjavik

Leading Genomics. Diagnostic. Discove. Collab. harma. Shanghai Cambridge, MA Reykjavik Leading Genomics Diagnostic harma Discove Collab Shanghai Cambridge, MA Reykjavik Global leadership for using the genome to create better medicine WuXi NextCODE provides a uniquely proven and integrated

More information

DataSafe Solutions. Protect your valuable genomic data

DataSafe Solutions. Protect your valuable genomic data DataSafe Solutions Protect your valuable genomic data Central and secure storage of next-generation sequencing (NGS) data is critical to the success of your organization. The ability to store and protect

More information

Optimising the resource utilisation in high-speed network intrusion detection systems.

Optimising the resource utilisation in high-speed network intrusion detection systems. Optimising the resource utilisation in high-speed network intrusion detection systems. Gerald Tripp www.kent.ac.uk Network intrusion detection Network intrusion detection systems are provided to detect

More information

A Low Latency Library in FPGA Hardware for High Frequency Trading (HFT)

A Low Latency Library in FPGA Hardware for High Frequency Trading (HFT) A Low Latency Library in FPGA Hardware for High Frequency Trading (HFT) John W. Lockwood, Adwait Gupte, Nishit Mehta (Algo-Logic Systems) Michaela Blott, Tom English, Kees Vissers (Xilinx) August 22, 2012,

More information

Introduction to Cloud Computing

Introduction to Cloud Computing Introduction to Cloud Overview and Introduction 15 319, spring 2010 1 st Lecture, Jan 12 th Majd F. Sakr 15-319 Introduction to Cloud Why take 15 319? Because you re cool! Because we re cool! Gain real

More information

Globus Striped GridFTP Framework and Server. Raj Kettimuthu, ANL and U. Chicago

Globus Striped GridFTP Framework and Server. Raj Kettimuthu, ANL and U. Chicago Globus Striped GridFTP Framework and Server Raj Kettimuthu, ANL and U. Chicago Outline Introduction Features Motivation Architecture Globus XIO Experimental Results 3 August 2005 The Ohio State University

More information

REGULATIONS FOR THE DEGREE OF BACHELOR OF SCIENCE IN BIOINFORMATICS (BSc[BioInf])

REGULATIONS FOR THE DEGREE OF BACHELOR OF SCIENCE IN BIOINFORMATICS (BSc[BioInf]) 820 REGULATIONS FOR THE DEGREE OF BACHELOR OF SCIENCE IN BIOINFORMATICS (BSc[BioInf]) (See also General Regulations) BMS1 Admission to the Degree To be eligible for admission to the degree of Bachelor

More information

Smarter Healthcare@IBM Research. Joseph M. Jasinski, Ph.D. Distinguished Engineer IBM Research

Smarter Healthcare@IBM Research. Joseph M. Jasinski, Ph.D. Distinguished Engineer IBM Research Smarter Healthcare@IBM Research Joseph M. Jasinski, Ph.D. Distinguished Engineer IBM Research Our researchers work on a wide spectrum of topics Basic Science Industry specific innovation Nanotechnology

More information

Novel Mining of Cancer via Mutation in Tumor Protein P53 using Quick Propagation Network

Novel Mining of Cancer via Mutation in Tumor Protein P53 using Quick Propagation Network Novel Mining of Cancer via Mutation in Tumor Protein P53 using Quick Propagation Network Ayad. Ghany Ismaeel, and Raghad. Zuhair Yousif Abstract There is multiple databases contain datasets of TP53 gene

More information