Data Virtualization Paul Moxon Denodo Technologies Alberta Data Architecture Community January 22 nd, 2014
The Changing Speed of Business 100 25 35 45 55 65 75 85 95
Gartner The Nexus of Forces
Today s Data Realities Data volumes are growing and will continue to grow at an accelerating rate As new sources of data are exploited, data complexity is here to stay There is no such thing as a batch window the business wants the data now Users want more than canned reports they want self-service with access to data as resources Costs of traditional data solutions are escalating
Traditional Methods - Too Slow, Too Limited, Too Costly The Data Warehousing Institute - Orgs. on avg. 8 wks. - add new data source to a DW 7 wks. - build complex dashboard or report Forrester - IT spends avg. 1% of Rev. on Storage 75% of the data stored - inactive, rarely accessed 90% of all data access requests in production OLTP are serviced by new data Gartner Top Technology Trends 2013 Data Virtualization - Central to success is an enabling technology infrastructure that helps information producers and information consumers organize, share and exchange any type of data and content, anytime, anywhere
Business Need for Information Agility Business entity views Pre-integrated information Discovery, self-service Fast access, ~ real-time Trustworthy, accurate Flexible, mobile semantic relations wisdom knowledge understanding principles information understanding patterns understanding relations data understanding
Information Gap Information Gap
What is Data Virtualization? Data Virtualization combines disparate data sources into a single virtual data layer that provides unified access and integrated data services to consuming applications in real-time (right-time) DERIVED VIEW Connect & Virtualize Combine & Integrate Publish as a Data Service BASE VIEW BASE VIEW BASE VIEW
Data Virtualization Key Tenets Realize Value from all Data Universal data access across internal, external, web, structured or unstructured Virtualization Minimizes Replication Flexible integration options with fine control virtual real-time, cached or scheduled batch Abstracted and Unified Data Services Abstracted data delivered as reusable data services Managed access control, service levels Enterprise Class Powerful and Agile Performance and scalability Data governance, lineage, management Easily integrated into existing IT infrastructure
Core Data Virtualization Capabilities Unified Data Governance On-demand, real-time access to disparate data Data Virtualization Agile Data Services Provisioning Logical Abstraction & Decoupling Semantic Integration of Structured, Web, Unstructured
Denodo Data Virtualization Platform
Denodo Data Virtualization Platform Logical Abstraction & Decoupling
Denodo Data Virtualization Platform Semantic Integration of Structured, Web, Unstructured Logical Abstraction & Decoupling
Denodo Data Virtualization Platform Agile Data Services Provisioning Semantic Integration of Structured, Web, Unstructured Logical Abstraction & Decoupling
Denodo Data Virtualization Platform On-demand, realtime access to disparate data Agile Data Services Provisioning Semantic Integration of Structured, Web, Unstructured Logical Abstraction & Decoupling
Denodo Data Virtualization Platform On-demand, realtime access to disparate data Unified Data Governance Agile Data Services Provisioning Semantic Integration of Structured, Web, Unstructured Logical Abstraction & Decoupling
Denodo Platform Demo
Broad Spectrum Data Virtualization Patterns Analytics / Informational Operational / Transactional Web, Cloud, and B2B Integration Data Management & Data Services Infrastructure
Broad Spectrum Data Virtualization Patterns Analytical/Informational Analytics / Informational Mainstream BI & DW Real Time Reporting Operational BI / Analytics Prototyping EDW Logical DW Virtual Data Marts Hybrid DV-ETL Big Data Analytics Operational / Transactional Web, Cloud, and B2B Integration NoSQL as Sandbox NoSQL for Cold Data Storage NoSQL Staging Area Hybrid Data Storage Data NoSQL for Management ETL & Data Expose Big Data Results Services Infrastructure Data Discovery / Self-Service Data Discovery, What If Analytics Self-Service BI and Reporting
Broad Spectrum Data Virtualization Patterns Operational/Transactional Analytics / Informational Data Services for Application Development Agile Mobile and Cloud App Development Agile SOA and BPM Development Agile Portal and Collaboration Development Operational / Transactional Data Abstraction for Migration & Modernization Legacy Application Modernization Migration from Enterprise to Cloud/SaaS Mergers & Acquisitions Data Consolidation Web, Cloud, and B2B Integration B2B Data Services & Integration Data Management & Data Data Services for Partners B2B Integration through Infrastructure Web Automation Single View Applications Customer Service & Call Centers Products/Product Catalogs Vertical Specific (e.g. Well or Physician Data)
Broad Spectrum Data Virtualization Patterns Web, Cloud, and B2B Integration Analytics / Informational Operational / Transactional Web, Cloud, and B2B Integration Web, Cloud, and B2B Integration Web Extraction (data.gov, public sources, etc.) Competitive BI Data Services in Cloud Social Media Integration Data Management & Data Services Infrastructure Cloud/SaaS Application Integration B2B Integration through Web
Broad Spectrum Data Virtualization Patterns Data Management & Data Services Infrastructure Analytics / Informational Operational / Transactional Data Management & Data Services Infrastructure Web, Cloud, and B2B Integration Canonical Views of Data Entities Enterprise Business Data Glossary Virtual MDM Integrated SOA Data, Logic, & Business Services Enterprise Data Services Data Management & Data Services Infrastructure
Patterns: Analytical/Informational Logical Data Warehouse Hybrid DV-ETL Hybrid Data Storage NoSQL for ETL
Patterns: Operational/Transactional Agile SOA & BPM Development Migration - Enterprise to Cloud/SaaS Customer Call Center Mergers & Acquisitions
Biogen Idec - Agile BI Real-time Sales Reporting Across 90 Countries
Biogen Idec Key Benefits and ROI from Data Virtualization Biogen Idec Example Executive report Global Drug Sales (units) across all products Business Problem Manual process Data freshness and quality concerns Fragile and complex to change Business Benefits Automated Validated and trusted data Self service enabled for changes Available on-demand IT Challenges Diverse data sources Structured and semi-structured data Data structure complexity Constantly changing sources and data IT Benefits Implementation - 4 weeks Changes 2,4 days Low maintenance and support costs Biogen Idec executives now have access to new partner and public data sources that they could not leverage before : 60% faster with change requests met in just a few days with IT using 40% less analyst time to support.
The Climate Corporation Data Environment
The Climate Corporation Data Architecture
The Climate Corporation Data Virtualization
The Climate Corporation Impact First use-cases for Sales implemented 3X improvement in time-to-market with 1/3 rd the team Other use cases: Risk reporting automations using Hive/Denodo Linked Data Services and BI Portal Operational Data Integration
R- Single View of Customer
R Services Architecture
Broad Spectrum Data Virtualization Webinar Series Four Sessions Covering Trends and Wide-Range of Use Case Patterns of Data Virtualization for 2014 & Beyond Data Virtualization is more than just Agile BI. Broad spectrum data virtualization is about building a common semantic data layer that serves both Informational and Operational/Transactional business needs. Join leading industry analysts, including Claudia Imhoff and Rick van der Lans, for a webinar series that examines the wide range of use cases for broad spectrum data virtualization. Join the conversation on twitter #broadspectrumdv
Q & A
Data Virtualization Fast, Flexible and Unified Data Access View Demo