On the need for intelligent access to big data in life sciences
|
|
|
- Jordan Riley
- 10 years ago
- Views:
Transcription
1 On the need for intelligent access to big data in life sciences The experience Georgios Paliouras, NCSR Demokritos
2 Life sciences then and now
3 Life sciences then and now
4 These data are too big for me! By Anderson for
5 Data growth is exponential Number of articles indexed by MEDLINE (PUBMED) per year Southan and Cameron, Beyond the tsunami: developing the infrastructure to deal with life sciences data, The fourth Paradigm: Data-Intensive Scientific Discovery, Microsoft Corp., 2009.
6 Data is source of knowledge Gillam et al., The healthcare singularity and the age of semantic medicine, The fourth Paradigm: Data-Intensive Scientific Discovery, Microsoft Corp., 2009.
7 One just needs to make the connection The Swanson case: Fish oil and Raynaud s syndrome Public knowledge since 1975: Raynaud s syndrome is associated with high blood viscosity, platelet aggregability, vasoconstriction. Public knowledge since 1984: Fish oil leads to reductions in blood lipids, platelet aggregability, blood viscosity, and vascular reactivity. Swanson puts the two together in 1986: Can dietary fish oil ameliorate or prevent Raynaud s syndrome? He supports his evidence with relevant literature. DiGiacomo confirms the hypothesis in Vision: Create big data machinery that help produce and support more such cases.
8 BioASQ vision 2 articles published in biomedical journals every minute! Make sure this knowledge is used to the benefit of patients Need to make it accessible to biomedical experts Search is not effective enough Push research in automated answering of questions A challenge for such systems can achieve a multiplying effect
9 BioASQ ecosystem
10 BioASQ ecosystem
11 BioASQ ecosystem
12 BioASQ ecosystem
13 Talking to BioASQ experts as I m growing older... I spend more time in front of the computer but I learn less.... the complexity has increased, the variety has increased and my time has been reduced. When I do research I use IT stuff all the time, I m looking for papers and data...i m also doing statistical analysis
14 Talking to BioASQ experts as I m growing older... I spend more time in front of the computer but I learn less.... the complexity has increased, the variety has increased and my time has been reduced. When I do research I use IT stuff all the time, I m looking for papers and data...i m also doing statistical analysis PubMed and all this of course, we really depend on that. We cannot work if we don t search in those. The bulk of information, that s the main problem.... if someone has some extra time and starts reading the results of a search then this might never end! Sometimes you get irrelevant results. That s the main problem.
15 Talking to BioASQ experts as I m growing older... I spend more time in front of the computer but I learn less.... the complexity has increased, the variety has increased and my time has been reduced. When I do research I use IT stuff all the time, I m looking for papers and data...i m also doing statistical analysis PubMed and all this of course, we really depend on that. We cannot work if we don t search in those. The bulk of information, that s the main problem.... if someone has some extra time and starts reading the results of a search then this might never end! Sometimes you get irrelevant results. That s the main problem. There is abundance of structured information... Unfortunately not all structured databases are included into one. I am looking at least into twenty different places for the same protein.... since I use a number of different programs I forget them by the time I want to use them again and I have to remember them once more.
16 Putting big data to work (Vision) Information systems that act like peers to human experts: understand the information need of the expert represent the need in machine-readable format match it to the information and data available in various sources provide comprehensive and comprehensible response, with supporting material (Big data added value) Integration of information from many sources and large-scale semantic indexing. (Outlook) Long way ahead but the impact of even marginal progress on public health can be very significant!
17 Where do we stand? Big data is getting linked We have a range of tools for analysing and indexing such data BDE is set to bring the pieces together Challenges, such as push research further; NLM has improved their MeSH indexing engine by 5%, in the first year of BioASQ! IBM Watson to be put in use by 14 US cancer research institutes Robotic science assistants making their appearance; Adam generating functional genomics hypotheses about the yeast Saccharomyces cerevisiae
18 Further information participants-area.bioasq.org
Abstract ( ) Introduction
Abstract ( ) Introduction 143 The Transition Phase 144 145 146 147 148 149 150 Advice for Students in the Transition Phase 151 Table1: The Top Twenty Pieces of Advice from Faculty to New University Students
LIBRARY GUIDE & SELECTED RESOURCES FOR NURSING. To find books about and by nursing theorists:
LIBRARY GUIDE & SELECTED RESOURCES FOR NURSING University of Massachusetts Dartmouth University Library www.lib.umassd.edu Selected Library Resources for NUR 204 Fall, 2004 Mary Adams Health Sciences Librarian
From Distributed Computing to Distributed Artificial Intelligence
From Distributed Computing to Distributed Artificial Intelligence Dr. Christos Filippidis, NCSR Demokritos Dr. George Giannakopoulos, NCSR Demokritos Big Data and the Fourth Paradigm The two dominant paradigms
Big Data a threat or a chance?
Big Data a threat or a chance? Helwig Hauser University of Bergen, Dept. of Informatics Big Data What is Big Data? well, lots of data, right? we come back to this in a moment. certainly, a buzz-word but
Manjula Ambur NASA Langley Research Center April 2014
Manjula Ambur NASA Langley Research Center April 2014 Outline What is Big Data Vision and Roadmap Key Capabilities Impetus for Watson Technologies Content Analytics Use Potential use cases What is Big
Acceleration for Personalized Medicine Big Data Applications
Acceleration for Personalized Medicine Big Data Applications Zaid Al-Ars Computer Engineering (CE) Lab Delft Data Science Delft University of Technology 1" Introduction Definition & relevance Personalized
Keynote Speaker. Jay Schnitzer, MD, PhD MITRE Corporation Director Biomedical Sciences
Keynote Speaker Jay Schnitzer, MD, PhD MITRE Corporation Director Biomedical Sciences 5 Definitions of Big Data 3 v s: high volume, high velocity, high variety Oxford English Dictionary: data of a very
I N T E L L I G E N T S O L U T I O N S, I N C. DATA MINING IMPLEMENTING THE PARADIGM SHIFT IN ANALYSIS & MODELING OF THE OILFIELD
I N T E L L I G E N T S O L U T I O N S, I N C. OILFIELD DATA MINING IMPLEMENTING THE PARADIGM SHIFT IN ANALYSIS & MODELING OF THE OILFIELD 5 5 T A R A P L A C E M O R G A N T O W N, W V 2 6 0 5 0 USA
Preparing for Graduate School in Biology and Related Fields
Preparing for Graduate School in Biology and Related Fields Michael McKeown Molecular Biology, Cell Biology and Biochemistry From time to time, God causes men to be born - and thou art one of them - who
Integrating Predictive Analytics Into Clinical Practice For Improved Outcomes & Financial Performance
Transforming the HHS Experience Improving the relationship between payers, providers and consumers Integrating Predictive Analytics Into Clinical Practice For Improved Outcomes & Financial Performance
Connecting Basic Research and Healthcare Big Data
Elsevier Health Analytics WHS 2015 Big Data in Health Connecting Basic Research and Healthcare Big Data Olaf Lodbrok Managing Director Elsevier Health Analytics [email protected] t +49 89 5383 600
Kimmo Rossi. European Commission DG CONNECT
Kimmo Rossi European Commission DG CONNECT Unit G.3 - Data Value Chain SC1 info day, Brussels 5/12/2014 1 What we do Unit CNECT.G3 Data Value Chain FP7/CIP/H2020 project portfolio: Big Data, analytics,
A leader in the development and application of information technology to prevent and treat disease.
A leader in the development and application of information technology to prevent and treat disease. About MOLECULAR HEALTH Molecular Health was founded in 2004 with the vision of changing healthcare. Today
Industrial Roadmap for Connected Machines. Sal Spada Research Director ARC Advisory Group [email protected]
Industrial Roadmap for Connected Machines Sal Spada Research Director ARC Advisory Group [email protected] Industrial Internet of Things (IoT) Based upon enhanced connectivity of this stuff Connecting
Government Technology Trends to Watch in 2014: Big Data
Government Technology Trends to Watch in 2014: Big Data OVERVIEW The federal government manages a wide variety of civilian, defense and intelligence programs and services, which both produce and require
Health Informatics Research and Development in Europe
Workshop on Health Informatics Research and Development, London, 18 July 2002 Health Informatics Research and Development in Europe Sofie Nørager European Commission Information Society Technologies Program
WHITE PAPER. Caradigm Healthcare Analytics. Healthcare Analytics
WHITE PAPER We have witnessed a major paradigm shift in healthcare information management over the past decade, instigated by the birth of electronic medical records and medical informatics. This new world
Donna J. Dean, Ph.D. October 27, 2009 Brown University
Building Connections with NIH Program Officers: Myths and Realities Donna J. Dean, Ph.D. October 27, 2009 Brown University Funding Agencies Federal Agencies Focused on Biomedical Research s of Health (NIH)
IB Math Research Problem
Vincent Chu Block F IB Math Research Problem The product of all factors of 2000 can be found using several methods. One of the methods I employed in the beginning is a primitive one I wrote a computer
BIG DATA AGGREGATOR STASINOS KONSTANTOPOULOS NCSR DEMOKRITOS, GREECE. Big Data Europe
BIG DATA AGGREGATOR STASINOS KONSTANTOPOULOS NCSR DEMOKRITOS, GREECE Big Data Europe The Big Data Aggregator The Big Data Aggregator: o A general-purpose architecture for processing Big Data o An implementation
Data Analytics @ UNC. Vinayak Deshpande
Data Analytics @ UNC Vinayak Deshpande New MBA elective @UNC MBA706, Data Analytics: Tools and Opportunities Instructor: Adam Mersereau Course Goals: Data Analytics: Tools and Opportunities" prepares students
COVER FEATURE PUTTING BIG DATA TO WORK PUTTING BIG DATA TO WORK BY AHMED NOOR
COVER FEATURE PUTTING BIG DATA TO WORK F 32 PUTTING BIG DATA TO WORK BY AHMED NOOR IMAGINE WHAT FACTORIES WILL DO WHEN THEY CAN TAKE FULL ADVANTAGE OF ALL THE INFORMATION AT HAND t a missile plant in Huntsville,
CARADIGM HEALTHCARE ANALYTICS. By Hamid Al-Azzawe, Vice President of Engineering, Caradigm WHITEPAPER
CARADIGM HEALTHCARE ANALYTICS By Hamid Al-Azzawe, Vice President of Engineering, Caradigm WHITEPAPER We have witnessed a major paradigm shift in healthcare information management over the past decade,
Automated Software Testing by: Eli Janssen
1. What is automated testing? Automated Software Testing by: Eli Janssen Automated testing is, much like the name implies, getting the computer to do the remedial work of ensuring that inputs yield expected
Roadmap for Ph.D. Students Aiming for a Successful Career in Science
Roadmap for Ph.D. Students Aiming for a Successful Career in Science Do you really want to get a Ph.D.? Do you have what it takes to get a Ph.D.? How can you get the most out of joining a Ph.D. program?
How To Change Medicine
P4 Medicine: Personalized, Predictive, Preventive, Participatory A Change of View that Changes Everything Leroy E. Hood Institute for Systems Biology David J. Galas Battelle Memorial Institute Version
The NEW POSSIBILITY. How the Data Center Helps Your Organization Excel in the Digital Services Economy
The NEW CENTER OF POSSIBILITY How the Data Center Helps Your Organization Excel in the Digital Services Economy Powering the world s best ideas Dramatic improvements in compute, storage, and network technology
Are You Ready for Big Data?
Are You Ready for Big Data? Jim Gallo National Director, Business Analytics February 11, 2013 Agenda What is Big Data? How do you leverage Big Data in your company? How do you prepare for a Big Data initiative?
Introduction to Information and Computer Science: Information Systems
Introduction to Information and Computer Science: Information Systems Lecture 1 Audio Transcript Slide 1 Welcome to Introduction to Information and Computer Science: Information Systems. The component,
A Strategic Approach to Unlock the Opportunities from Big Data
A Strategic Approach to Unlock the Opportunities from Big Data Yue Pan, Chief Scientist for Information Management and Healthcare IBM Research - China [contacts: [email protected] ] Big Data or Big Illusion?
Data Science & Engineering Consortium Initiative
Data Science & Engineering Consortium Initiative A vision from the University of Santiago de Compostela Alberto Bugarín Affiliated Researcher Centro Singular de Investigación en Tecnoloxías da Información
Symantec Enterprise Vault
Store, Manage, and Discover Critical Business Information The pressure on organizations to protect and manage data has intensified with the recent growth in unstructured data and the reliance on email
Biomedical Informatics: Computer Applications in Health Care and Biomedicine
An Overview of Biomedical Date 1/19/06 Biomedical : Computer Applications in Health Care and Biomedicine Edward H. Shortliffe, MD, PhD Department of Biomedical Columbia University Asian Pacific Association
Machine Learning and Data Mining. Fundamentals, robotics, recognition
Machine Learning and Data Mining Fundamentals, robotics, recognition Machine Learning, Data Mining, Knowledge Discovery in Data Bases Their mutual relations Data Mining, Knowledge Discovery in Databases,
Putting IBM Watson to Work In Healthcare
Martin S. Kohn, MD, MS, FACEP, FACPE Chief Medical Scientist, Care Delivery Systems IBM Research [email protected] Putting IBM Watson to Work In Healthcare 2 SB 1275 Medical data in an electronic or
Why data quality should be a central focus of your CRM initiative. An Experian Data Quality white paper
Why data quality should be a central focus of your CRM initiative An Experian Data Quality white paper CRMs, or Customer Relationship Management applications, are used to better manage a company s customers
CASE STUDY. Leeds Beckett University. SirsiDynix Symphony integration with Blackboard Learn
CASE STUDY Leeds Beckett University SirsiDynix Symphony integration with Blackboard Learn Leeds Beckett University is a modern university with a proud heritage that dates back to the nineteenth century
RESPONSE FROM GBIF TO QUESTIONS FOR FURTHER CONSIDERATION
RESPONSE FROM GBIF TO QUESTIONS FOR FURTHER CONSIDERATION A. Policy support tools and methodologies developed or used under the Convention and their adequacy, impact and obstacles to their uptake, as well
People are People & Ads are Ads. Perspectives on B2B Copy Testing March 19, 2007
People are People & Ads are Ads Perspectives on B2B Copy Testing March 19, 2007 My Perspective 15+ years experience Testing ads Briefing ads Making ads Developing ad strategies 1000 s of ads Mostly B2C,
How to stop looking in the wrong place? Use PubMed!
How to stop looking in the wrong place? Use PubMed! 1 Why not just use? Plus s Fast! Easy to remember web address Its huge - you always find something It includes PubMed citations Downside Is simply finding
2019 Healthcare That Works for All
2019 Healthcare That Works for All This paper is one of a series describing what a decade of successful change in healthcare could look like in 2019. Each paper focuses on one aspect of healthcare. To
PONTE Presentation CETIC. EU Open Day, Cambridge, 31/01/2012. Philippe Massonet
PONTE Presentation CETIC Philippe Massonet EU Open Day, Cambridge, 31/01/2012 PONTE Description Efficient Patient Recruitment for Innovative Clinical Trials of Existing Drugs to other Indications Start
Essays on Teaching Excellence. Using Rubrics to Teach Science Writing
Essays on Teaching Excellence Toward the Best in the Academy Volume 20, Number 8, 2008-09 A publication of The Professional & Organizational Development Network in Higher Education (www.podnetwork.org).
Moodlerooms Features & Services
Moodlerooms Features & Services WHAT IS MOODLEROOMS Moodlerooms features and services, are here to work with you and for you. We have it all. From hosting, implementation, support, training, course conversion,
Web-Based Genomic Information Integration with Gene Ontology
Web-Based Genomic Information Integration with Gene Ontology Kai Xu 1 IMAGEN group, National ICT Australia, Sydney, Australia, [email protected] Abstract. Despite the dramatic growth of online genomic
Boom and Bust Cycles in Scientific Literature A Toolbased Big-Data Analysis
Boom and Bust Cycles in Scientific Literature A Toolbased Big-Data Analysis Bachelorarbeit zur Erlangung des akademischen Grades Bachelor of Science (B.Sc.) im Studiengang Wirtschaftsingenieur der Fakultät
Structure of Presentation. The Role of Programming in Informatics Curricula. Concepts of Informatics 2. Concepts of Informatics 1
The Role of Programming in Informatics Curricula A. J. Cowling Department of Computer Science University of Sheffield Structure of Presentation Introduction The problem, and the key concepts. Dimensions
EXPERIMENT #1: MICROSCOPY
EXPERIMENT #1: MICROSCOPY Brightfield Compound Light Microscope The light microscope is an important tool in the study of microorganisms. The compound light microscope uses visible light to directly illuminate
Complexity and Scalability in Semantic Graph Analysis Semantic Days 2013
Complexity and Scalability in Semantic Graph Analysis Semantic Days 2013 James Maltby, Ph.D 1 Outline of Presentation Semantic Graph Analytics Database Architectures In-memory Semantic Database Formulation
Big Data Hope or Hype?
Big Data Hope or Hype? David J. Hand Imperial College, London and Winton Capital Management Big data science, September 2013 1 Google trends on big data Google search 1 Sept 2013: 1.6 billion hits on big
Business Model Analysis and Evaluation Framework for PQoS-aware VoIP and IPTV Services of Mobile Operators
Business Model Analysis and Evaluation Framework for PQoS-aware VoIP and IPTV Services of Mobile Operators Vaios Koumaras 1, Harilaos Koumaras 1, Monica Gorricho 2, Anastasios Kourtis 1 1 NCSR Demokritos,
Karuna P Joshi, PhD. Research Asst. Professor. [email protected]
Karuna P Joshi, PhD Research Asst. Professor [email protected] Increasing adoption of technologies such as Electronic Health Records (EHR) to capture clinical data Mandate by Health Information Technology
Patient Centricity and the Changing Landscape of Healthcare
Patient Centricity and the Changing Landscape of Healthcare Andrea Cotter Director Healthcare Marketing IBM Corporation IBM Healthcare and Life Sciences Patient Centricity and the Changing Landscape of
IBM Internet of Things Point of View and Strategy.
IBM Internet of Things Point of View and Strategy. Jim Caldwell Director, IBM Internet of Things, Continuous Engineering Solutions Development Abstract: The Internet of Things is predicted to have an economic
Worldwide Business Rules Management Systems 2011 Vendor Shares
COMPETITIVE ANALYSIS Worldwide Business Rules Management Systems 2011 Vendor Shares Al Hilwa Stephen D. Hendrick IDC OPINION Global Headquarters: 5 Speen Street Framingham, MA 01701 USA P.508.872.8200
CDISC and Clinical Research Standards in the LHS
CDISC and Clinical Research Standards in the LHS Learning Health System in Europe 24 September 2015, Brussels Rebecca D. Kush, PhD, President and CEO, CDISC CDISC 2015 1 CDISC Healthcare Link Goal: Optimize
Big Data R&D Initiative
Big Data R&D Initiative Howard Wactlar CISE Directorate National Science Foundation NIST Big Data Meeting June, 2012 Image Credit: Exploratorium. The Landscape: Smart Sensing, Reasoning and Decision Environment
Exploring the Resurgence of Cocoa Antioxidants
1 Exploring the Resurgence of Cocoa Antioxidants Does antioxidant capacity explain health benefits? Dr. Stoffer Loman NutriClaim Hi Europe Conference Module 4B Natural Antioxidants 13 November 2012 Frankfurt,
BRIDGing CDASH to SAS: How Harmonizing Clinical Trial and Healthcare Standards May Impact SAS Users Clinton W. Brownley, Cupertino, CA
BRIDGing CDASH to SAS: How Harmonizing Clinical Trial and Healthcare Standards May Impact SAS Users Clinton W. Brownley, Cupertino, CA ABSTRACT The Clinical Data Interchange Standards Consortium (CDISC),
Master of Science in BIOINFORMATICS. > information. > insight. > innovation
Master of Science in BIOINFORMATICS > information > insight > innovation The Program Master of Science in Bioinformatics The College of Science at Northeastern University is committed to delivering cutting-edge
Big Data and Healthcare
Big Data and Healthcare Dr. George Poste Chief Scientist, Complex Adaptive Systems Initiative and Del E. Webb Chair in Health Innovation Arizona State University [email protected] www.casi.asu.edu Panel
Game Changers for Researchers: Altmetrics, Big Data, Open Access What Might They Change? Kiki Forsythe, M.L.S.
Game Changers for Researchers: Altmetrics, Big Data, Open Access What Might They Change? Kiki Forsythe, M.L.S. Definition of Game Changer A newly introduced element or factor that changes an existing situation
Master of Science in Artificial Intelligence
Master of Science in Artificial Intelligence Options: Engineering and Computer Science (ECS) Speech and Language Technology (SLT) Big Data Analytics (BDA) Faculty of Engineering Science Faculty of Science
OpenFlow/SDN for IaaS Providers
OpenFlow/SDN for IaaS Providers Open Networking Summit 2011 Stanford University Paul Lappas & Ivan Batanov The Public Cloud Our Definition Shared infrastructure operated by a service provider where no
Making an Impact in a VUCA World. Health Data Summit NAHDO 28 th Annual Conference December 11, 2013
Making an Impact in a VUCA World Health Data Summit NAHDO 28 th Annual Conference December 11, 2013 Leadership and Change The test of a first rate intelligence is the ability to hold two opposed ideas
MEDICAL DATA MINING. Timothy Hays, PhD. Health IT Strategy Executive Dynamics Research Corporation (DRC) December 13, 2012
MEDICAL DATA MINING Timothy Hays, PhD Health IT Strategy Executive Dynamics Research Corporation (DRC) December 13, 2012 2 Healthcare in America Is a VERY Large Domain with Enormous Opportunities for Data
The Healthcare Singularity and the Age of Semantic Medicine
HEALTH AND WELLBEING The Healthcare Singularity and the Age of Semantic Medicine In 1499, w h e n p o r t u g u e s e e x p l o r e r va s c o d a g a m a returned home after completing the first-ever
A.I. in health informatics lecture 1 introduction & stuff kevin small & byron wallace
A.I. in health informatics lecture 1 introduction & stuff kevin small & byron wallace what is this class about? health informatics managing and making sense of biomedical information but mostly from an
