From Data to Foresight:



Similar documents
A Strategic Approach to Unlock the Opportunities from Big Data

Big Data, Integration and Governance: Ask the Experts

Driving Innovation in Licensing Through Competitive Intelligence and Big Data Analytics

Discover more, discover faster. High performance, flexible NLP-based text mining for life sciences

Raul F. Chong Senior program manager Big data, DB2, and Cloud IM Cloud Computing Center of Competence - IBM Toronto Lab, Canada

Big Data Trends A Basis for Personalized Medicine

A leader in the development and application of information technology to prevent and treat disease.

5 Keys to Unlocking the Big Data Analytics Puzzle. Anurag Tandon Director, Product Marketing March 26, 2014

Smarter Analytics. Barbara Cain. Driving Value from Big Data

DGE /DG Connect

BIG Data Analytics Move to Competitive Advantage

Test Data Management in the New Era of Computing

KNOWLEDGENT WHITE PAPER. Big Data Enabling Better Pharmacovigilance

MEDICAL DATA MINING. Timothy Hays, PhD. Health IT Strategy Executive Dynamics Research Corporation (DRC) December 13, 2012

Big Data and Analytics in Government

The Future of Business Analytics is Now! 2013 IBM Corporation

B G DATA, B GGER OUTCOMES

SUSTAINING COMPETITIVE DIFFERENTIATION

International Journal of Advanced Engineering Research and Applications (IJAERA) ISSN: Vol. 1, Issue 6, October Big Data and Hadoop

Big Data, Analytics, Intelligence: Potenziale und Nutzen

A New Era Of Analytic

Complex, true real-time analytics on massive, changing datasets.

Big Data Efficiencies That Will Transform Media Company Businesses

Big Data Challenges and Success Factors. Deloitte Analytics Your data, inside out

Klarna Tech Talk: Mind the Data! Jeff Pollock InfoSphere Information Integration & Governance

Are You Ready for Big Data?

Are You Ready for Big Data?

Big Data and the new trends for BI and Analytics Juha Teljo Business Intelligence and Predictive Solutions Executive IBM Europe

Extend your analytic capabilities with SAP Predictive Analysis

Strategic Decisions Supported by SAP Big Data Solutions. Angélica Bedoya / Strategic Solutions GTM Mar /2014

Big Data Analytics. Copyright 2011 EMC Corporation. All rights reserved.

EMC ADVERTISING ANALYTICS SERVICE FOR MEDIA & ENTERTAINMENT

How To Understand The Benefits Of Big Data

2019 Healthcare That Works for All

Creating a Business Intelligence Competency Center to Accelerate Healthcare Performance Improvement

Big Data and the Data Lake. February 2015

This Symposium brought to you by

Big Data & Analytics for Semiconductor Manufacturing

Agil visualisering och dataanalys

Realizing Business Value from Convergence of IoT + Big Data Technologies. Aditya Thadani Phil Andreoli

Big Data a threat or a chance?

CONNECTING DATA WITH BUSINESS

Synergies between the Big Data Value (BDV) Public Private Partnership and the Helix Nebula Initiative (HNI)

BIG DATA BREATHES LIFE INTO NEXT-GEN PHARMA R&D

Demystifying Big Data Government Agencies & The Big Data Phenomenon

Utility Analytics, Challenges & Solutions. Session Three September 24, 2014

IBM Data Warehousing and Analytics Portfolio Summary

Big Data & Analytics. Counterparty Credit Risk Management. Big Data in Risk Analytics

Big Data overview. Livio Ventura. SICS Software week, Sept Cloud and Big Data Day

How the Past Changes the Future of Fraud

Data Centric Systems (DCS)

How To Make Data Streaming A Real Time Intelligence

BAO & Big Data Overview Applied to Real-time Campaign GSE. Joel Viale Telecom Solutions Lab Solution Architect. Telecom Solutions Lab

Are You Big Data Ready?

Intelligent Business Operations and Big Data Software AG. All rights reserved.

Zero-in on business decisions through innovation solutions for smart big data management. How to turn volume, variety and velocity into value

No Data Governance, No Actionable Insights

> Cognizant Analytics for Banking & Financial Services Firms

How To Change Medicine

VIEWPOINT. High Performance Analytics. Industry Context and Trends

COULD VS. SHOULD: BALANCING BIG DATA AND ANALYTICS TECHNOLOGY WITH PRACTICAL OUTCOMES

Statistics for BIG data

Exploiting Data at Rest and Data in Motion with a Big Data Platform

exactly. The need for efficiency in developing effective new therapeutics has never been greater.

The Future of Data Management

Informatics and Knowledge Management at the Novartis Institutes for BioMedical Research (NIBR)

IDC MaturityScape Benchmark: Big Data and Analytics in Government. Adelaide O Brien Research Director IDC Government Insights June 20, 2014

IDC MaturityScape Benchmark: Big Data and Analytics in Government

Big Data & Analytics. The. Deal. About. Jacob Büchler jbuechler@dk.ibm.com Cand. Polit. IBM Denmark, Solution Exec IBM Corporation

Big Analytics: A Next Generation Roadmap

Dr Alexander Henzing

locuz.com Big Data Services

Big Data Analytics: Driving Value Beyond the Hype

Danny Wang, Ph.D. Vice President of Business Strategy and Risk Management Republic Bank

How To Use Social Media To Improve Your Business

How To Use Big Data To Help A Retailer

Pharmacology skills for drug discovery. Why is pharmacology important?

COACH Clinician Forum 2015

AFFITECH and XOMA Sign Antibody Collaboration and Cross-License Agreement

The Big Picture on Big Data. Princeton Section 307 Dinner Meeting December 11, 2013 Richard Herczeg

SERVICES. Software licensing and entitlement management delivered in the cloud for the cloud

Game Changers for Researchers: Altmetrics, Big Data, Open Access What Might They Change? Kiki Forsythe, M.L.S.

Big Data Executive Survey

Dr. John E. Kelly III Senior Vice President, Director of Research. Differentiating IBM: Research

What to Look for When Selecting a Master Data Management Solution

Niara Security Intelligence. Overview. Threat Discovery and Incident Investigation Reimagined

Transcription:

Laura Haas, IBM Fellow IBM Research - Almaden From Data to Foresight: Leveraging Data and Analytics for Materials Research 1 2011 IBM Corporation

The road from data to foresight is long? Consumer Reports How can I reduce my? Must acquire, integrate, enhance and align Must deal with missing and incomplete data Must store, protect, and manage Must create models and other analytics and test them Must run these analyses efficiently over large data volumes Must understand and share results INPUT SATURATION & SATURATION SURFACE RUNOFF & SURFACERunoff SURFACE Runoff Saturated Surface Area Runoff MISCELLANEOUS Observed MISCELLANEOUS Precipitation MISCELLANEOUS FLUXES MISCELLANEOUS MISCELLANEOUS FLUXES FLUXES FLUXES FLUXES INTERFLOW INTERFLOW Percolation Misc OVERLAND OVERLAND fluxes ROUTING PERCOLATION ROUTING PERCOLATION PERCOLATION Interflow Instantaneous Effective Runoff RAINFALL RAINFALL Precipitation Routed Runoff ERROR ERROR Percolation Total Water: SOLVE Upper Layer, SOLVE STATE EQUATIONS SOLVE UPDATE SOLVE UPDATE Lower Layer STATE UPDATE STATE EQUATIONS SOLVE UPDATE STATE EQUATIONS STATE OUTPUT STATE UPDATE Lower STATE EQUATIONS EQUATIONS STATE STATE Layer Evaporation LOWER LOWER LAYER LOWER Layer LOWER Layer EVAPORATION Layer EVAPORATION Baseflow Potential EVAPORATION Evapo- Transpiration BASE BASE FLOW BASE FLOW BASE FLOW Upper BASE FLOW FLOW Layer UPPER UPPER LAYER Evaporation UPPER LAYER EVAPORATION UPPER LAYER LAYER EVAPORATION EVAPORATION Legend: Flux computations Inputs and outputs State computations Note: in addition to dependencies shown, most flux calculations are dependent on values of state variables at the previous timestep Requires significant (and expensive) EXPERTISE in data management, systems, analytics, and the domain Takes TIME 2 2011 IBM Corporation

The 4 V s of data Volume Velocity Variety Veracity* * Truthfulness, accuracy or precision, correctness Data at Rest Data in Motion Data in Many Forms Data in Doubt Terabytes to exabytes of existing data to process Streaming data, milliseconds to seconds to respond Structured, unstructured, text, multimedia Uncertainty due to data inconsistency & incompleteness, ambiguities, latency, deception, model approximations 3 2011 IBM Corporation

Valuable new insights are hidden in this wealth of data! Detect life-threatening conditions at hospitals in time to intervene Predict weather patterns to plan optimal wind turbine usage, and optimize capital expenditure on asset placement Make risk decisions based on real-time transactional data Identify criminals and threats from disparate video, audio, and data feeds Discover and optimize new materials by mining data in the patents and literature 4 2011 IBM Corporation

Fortunately, new platforms can unlock the value of data New analytic applications drive the requirements for a big data platform BI / Reporting Analytic Applications Exploration / Visualization Functional App Industry App Predictive Analytics Content BI / Analytics Reporting Integrate and manage the full variety, velocity and volume of data Apply advanced analytics to information in its native form Visualize all available data for adhoc analysis Develop new analytic applications Optimize and control scheduling of many simultaneous analyses Protect data and applications from accidents, sabotage, and theft Visualization & Discovery Hadoop System IBM Big Data Platform Application Development Accelerators Stream Computing Systems Management Data Warehouse Information Integration & Governance 5 2011 IBM Corporation

Outcome-based medicine vision: Leverage public and private content, rich analytics to improve treatment outcomes Target Identification Target Selection Lead Discovery Candidate Selection Preclinical Development Development Selection Clinical I II IIIl Launch Medical Care Patient Experience Patient Outcome Research & Development and Intellectual Property Target Identification and Validation Lead Discovery and Optimization Safety and Efficacy Genomics Proteomics Metalobomics Chemical and Biological Extraction, Profiling, Analytics, And Reasoning Clinical Decision Support Patient Similarity and Segmentation Patient Cohorts for Clinical Support Clinical Genomics Analysis Comparative Effectiveness Research Predictive Modeling of Outcome Disease Progression Analysis Treatment Cost Analysis Temporal Analysis Patient experience and social community support Patient first hand experiences Social community development and support Key Analytics Capabilities: BI, Text analytics, NLP, Network Analysis, Relationship Discovery, ML, Modeling, High Scientific Ontologies Safety Electronic Throughput Patents Pre-clinical Literature Pathways DMPK Medical Web Claims Data Screening Curated Data Formulation Clinical Trials Records Social Media 6 2011 IBM Corporation

An Example: Leveraging data to accelerate life sciences R&D The Situation Highly volatile, increasingly complex environment Traditional R&D is not delivering New approaches are needed Collaborative R&D models The new normal requiring open platforms, clear boundaries and protection Agile responses Vital to drive fast adaptation to changing competitive IP landscape including, adjustments to strategy, portfolio investments and partnerships Effective IP portfolio management Delivering key value for out-licensing and monetizing of non-core IP Strategic ecosystem development Growth and competitive differentiation through aggressive collaboration, early identification of acquisition and recruitment targets The Solution IBM BAO strategic IP insight platform (SIIP) A unique and powerful data and analytics offering Aggregates and processes 30M+ patents and scientific literature from around the globe Automatically extracts chemical and biological entities 200M+ chemical compound instances to date Generates chemical and biological entity profiles Searches and analyzes using natural language-based inputs for key relationship discovery and IP insights Reasoning about causality of drug, diseases, targets, and efficacy and side effects Integrates and enhances existing data and applications The Benefits Valuable insights into competitive landscape, white space, and IP portfolio High quality chemical extractions available hours after patents are available from patent authorities Previously unobtainable insights at the scientists fingertips with the touch of a button Fast and easy search and analysis drastically reducing search time from weeks and months to just minutes R&D Find white space and gain insight into complex chemical and biological patents; Gain early insights into given targetcompound match from past patents for better research target & compound selection decisions Legal Detect IP infringement earlier and increase the quality of patent filings Corporate Strategy / Business Dev Identify collaboration and acquisition targets for greater research value and effectiveness and find patent in- and out licensing candidates for efficient management and monetization of IP 7 2011 IBM Corporation

A Smart Entity Profiling, Analytics and Reasoning Methodology IP - Legal status - Assignee - Foreign filings - Expiration Date An integrated framework leveraging broad set of data, and many types of analytics: Hypothesis generation Entity extraction and profiling Relationship discovery and analytics Summarization Reasoning Scoring and ranking Predictive modeling Key steps: Extract key entities Combine information from multiple sources Discover relationships among entities Reason about relationships Drug - Activity - Half life - Protein Binding Disease Organisms - Organism - Organ - Cell - Tissue Physical - Computational - Molecular Weight - MF, Bp, Mp Medicine Pathways - Metabolic - Genetic - Environmental - Cellular - Organism Spectral - IR - NMR - Mass Spectra Patients Genetic -... Medical Records -... Reactions - Enzymes Life styles -... Toxocity - Clinical Trials - Pre-Clinical Screening - Activity Medical History -... Patents Literature Experimental HTS Medical Records Clinical Business Social 8 2011 IBM Corporation

Information and Governance for Big Data Leverage private/public clouds to share vs keep proprietary as appropriate 9 2011 IBM Corporation

Summary There is much to be gained from leveraging available data and content Accelerate discovery Avoid repeating work Unlocking the value buried in there is difficult 4 V s: Volume, Velocity, Variety, Veracity A long process requiring many types of expertise There are powerful platforms and tools that can help Aid development of type-specific analytics Enable fast and timely processing of large diverse data sets Sharing, with appropriate data governance, can accelerate discovery Controls for the entire data lifecycle Many industry groups are finding leverage from shared investments 10 2011 IBM Corporation