On the need for intelligent access to big data in life sciences

Similar documents
Abstract ( ) Introduction

LIBRARY GUIDE & SELECTED RESOURCES FOR NURSING. To find books about and by nursing theorists:

From Distributed Computing to Distributed Artificial Intelligence

Big Data a threat or a chance?

Manjula Ambur NASA Langley Research Center April 2014

Acceleration for Personalized Medicine Big Data Applications

Keynote Speaker. Jay Schnitzer, MD, PhD MITRE Corporation Director Biomedical Sciences

I N T E L L I G E N T S O L U T I O N S, I N C. DATA MINING IMPLEMENTING THE PARADIGM SHIFT IN ANALYSIS & MODELING OF THE OILFIELD

Preparing for Graduate School in Biology and Related Fields

Integrating Predictive Analytics Into Clinical Practice For Improved Outcomes & Financial Performance


Connecting Basic Research and Healthcare Big Data

Kimmo Rossi. European Commission DG CONNECT

A leader in the development and application of information technology to prevent and treat disease.

Industrial Roadmap for Connected Machines. Sal Spada Research Director ARC Advisory Group

Government Technology Trends to Watch in 2014: Big Data

Health Informatics Research and Development in Europe

WHITE PAPER. Caradigm Healthcare Analytics. Healthcare Analytics

Donna J. Dean, Ph.D. October 27, 2009 Brown University

IB Math Research Problem

BIG DATA AGGREGATOR STASINOS KONSTANTOPOULOS NCSR DEMOKRITOS, GREECE. Big Data Europe

Data UNC. Vinayak Deshpande

COVER FEATURE PUTTING BIG DATA TO WORK PUTTING BIG DATA TO WORK BY AHMED NOOR

CARADIGM HEALTHCARE ANALYTICS. By Hamid Al-Azzawe, Vice President of Engineering, Caradigm WHITEPAPER

Automated Software Testing by: Eli Janssen

Roadmap for Ph.D. Students Aiming for a Successful Career in Science

How To Change Medicine

The NEW POSSIBILITY. How the Data Center Helps Your Organization Excel in the Digital Services Economy

Are You Ready for Big Data?

Introduction to Information and Computer Science: Information Systems

A Strategic Approach to Unlock the Opportunities from Big Data

Data Science & Engineering Consortium Initiative

Symantec Enterprise Vault

Biomedical Informatics: Computer Applications in Health Care and Biomedicine

Machine Learning and Data Mining. Fundamentals, robotics, recognition

Putting IBM Watson to Work In Healthcare

Why data quality should be a central focus of your CRM initiative. An Experian Data Quality white paper

CASE STUDY. Leeds Beckett University. SirsiDynix Symphony integration with Blackboard Learn

RESPONSE FROM GBIF TO QUESTIONS FOR FURTHER CONSIDERATION

People are People & Ads are Ads. Perspectives on B2B Copy Testing March 19, 2007

How to stop looking in the wrong place? Use PubMed!

2019 Healthcare That Works for All

PONTE Presentation CETIC. EU Open Day, Cambridge, 31/01/2012. Philippe Massonet

Essays on Teaching Excellence. Using Rubrics to Teach Science Writing

Moodlerooms Features & Services

Web-Based Genomic Information Integration with Gene Ontology

Boom and Bust Cycles in Scientific Literature A Toolbased Big-Data Analysis

Structure of Presentation. The Role of Programming in Informatics Curricula. Concepts of Informatics 2. Concepts of Informatics 1

EXPERIMENT #1: MICROSCOPY

Complexity and Scalability in Semantic Graph Analysis Semantic Days 2013

Big Data Hope or Hype?

Business Model Analysis and Evaluation Framework for PQoS-aware VoIP and IPTV Services of Mobile Operators

Karuna P Joshi, PhD. Research Asst. Professor. karuna.joshi@umbc.edu

Patient Centricity and the Changing Landscape of Healthcare

IBM Internet of Things Point of View and Strategy.

Worldwide Business Rules Management Systems 2011 Vendor Shares

CDISC and Clinical Research Standards in the LHS

Big Data R&D Initiative

Exploring the Resurgence of Cocoa Antioxidants

BRIDGing CDASH to SAS: How Harmonizing Clinical Trial and Healthcare Standards May Impact SAS Users Clinton W. Brownley, Cupertino, CA

Master of Science in BIOINFORMATICS. > information. > insight. > innovation

Big Data and Healthcare

Game Changers for Researchers: Altmetrics, Big Data, Open Access What Might They Change? Kiki Forsythe, M.L.S.

Master of Science in Artificial Intelligence

OpenFlow/SDN for IaaS Providers

Making an Impact in a VUCA World. Health Data Summit NAHDO 28 th Annual Conference December 11, 2013

MEDICAL DATA MINING. Timothy Hays, PhD. Health IT Strategy Executive Dynamics Research Corporation (DRC) December 13, 2012

The Healthcare Singularity and the Age of Semantic Medicine

A.I. in health informatics lecture 1 introduction & stuff kevin small & byron wallace

Transcription:

On the need for intelligent access to big data in life sciences The experience 21-05-2015 Georgios Paliouras, NCSR Demokritos

Life sciences then and now

Life sciences then and now

These data are too big for me! By Anderson for

Data growth is exponential Number of articles indexed by MEDLINE (PUBMED) per year http://dan.corlan.net/medline-trend.html Southan and Cameron, Beyond the tsunami: developing the infrastructure to deal with life sciences data, The fourth Paradigm: Data-Intensive Scientific Discovery, Microsoft Corp., 2009.

Data is source of knowledge Gillam et al., The healthcare singularity and the age of semantic medicine, The fourth Paradigm: Data-Intensive Scientific Discovery, Microsoft Corp., 2009.

One just needs to make the connection The Swanson case: Fish oil and Raynaud s syndrome Public knowledge since 1975: Raynaud s syndrome is associated with high blood viscosity, platelet aggregability, vasoconstriction. Public knowledge since 1984: Fish oil leads to reductions in blood lipids, platelet aggregability, blood viscosity, and vascular reactivity. Swanson puts the two together in 1986: Can dietary fish oil ameliorate or prevent Raynaud s syndrome? He supports his evidence with relevant literature. DiGiacomo confirms the hypothesis in 1989. Vision: Create big data machinery that help produce and support more such cases.

BioASQ vision 2 articles published in biomedical journals every minute! Make sure this knowledge is used to the benefit of patients Need to make it accessible to biomedical experts Search is not effective enough Push research in automated answering of questions A challenge for such systems can achieve a multiplying effect

BioASQ ecosystem

BioASQ ecosystem

BioASQ ecosystem

BioASQ ecosystem

Talking to BioASQ experts as I m growing older... I spend more time in front of the computer but I learn less.... the complexity has increased, the variety has increased and my time has been reduced. When I do research I use IT stuff all the time, I m looking for papers and data...i m also doing statistical analysis

Talking to BioASQ experts as I m growing older... I spend more time in front of the computer but I learn less.... the complexity has increased, the variety has increased and my time has been reduced. When I do research I use IT stuff all the time, I m looking for papers and data...i m also doing statistical analysis PubMed and all this of course, we really depend on that. We cannot work if we don t search in those. The bulk of information, that s the main problem.... if someone has some extra time and starts reading the results of a search then this might never end! Sometimes you get irrelevant results. That s the main problem.

Talking to BioASQ experts as I m growing older... I spend more time in front of the computer but I learn less.... the complexity has increased, the variety has increased and my time has been reduced. When I do research I use IT stuff all the time, I m looking for papers and data...i m also doing statistical analysis PubMed and all this of course, we really depend on that. We cannot work if we don t search in those. The bulk of information, that s the main problem.... if someone has some extra time and starts reading the results of a search then this might never end! Sometimes you get irrelevant results. That s the main problem. There is abundance of structured information... Unfortunately not all structured databases are included into one. I am looking at least into twenty different places for the same protein.... since I use a number of different programs I forget them by the time I want to use them again and I have to remember them once more.

Putting big data to work (Vision) Information systems that act like peers to human experts: understand the information need of the expert represent the need in machine-readable format match it to the information and data available in various sources provide comprehensive and comprehensible response, with supporting material (Big data added value) Integration of information from many sources and large-scale semantic indexing. (Outlook) Long way ahead but the impact of even marginal progress on public health can be very significant!

Where do we stand? Big data is getting linked We have a range of tools for analysing and indexing such data BDE is set to bring the pieces together Challenges, such as push research further; NLM has improved their MeSH indexing engine by 5%, in the first year of BioASQ! IBM Watson to be put in use by 14 US cancer research institutes Robotic science assistants making their appearance; Adam generating functional genomics hypotheses about the yeast Saccharomyces cerevisiae

Further information www.big-data-europe.eu www.bioasq.org, participants-area.bioasq.org Follow @BigData Europe, @BioASQ