Beyond von Neumann. Dr. Mark E. Dean, PhD Fisher Distinguished Professor UTK College of Eng. 2009 IBM Corporation



Similar documents
IBM Centennial Getting ready for a Smarter Planet & Big Data

Smarter Planet evolution

Von Social Media zum Social Business Ein Megatrend für die Geschäftswelt

BMW11: Dealing with the Massive Data Generated by Many-Core Systems. Dr Don Grice IBM Corporation

Data Centric Computing Revisited

Big Data, Integration and Governance: Ask the Experts

Big Data: Aspirations, Applications, and Analytics IBM Corporation

A New Era of Computing

A Strategic Approach to Unlock the Opportunities from Big Data

Beyond Watson: The Business Implications of Big Data

Data Centric Systems (DCS)

What s Behind Big Data and Behavorial Analytics

Big Data: Image & Video Analytics

A New Era Of Analytic

Big Data, Analytics, Intelligence: Potenziale und Nutzen

Big Data & Analytics for Semiconductor Manufacturing

Test Data Management in the New Era of Computing

How To Get A Computer Engineering Degree

Big Data and Big Data Modeling

Gi-Joon Nam, IBM Research - Austin Sani R. Nassif, Radyalis. Opportunities in Power Distribution Network System Optimization (from EDA Perspective)

Dr. John E. Kelly III Senior Vice President, Director of Research. Differentiating IBM: Research

Key Attributes for Analytics in an IBM i environment

DGE /DG Connect

Big Data: Study in Structured and Unstructured Data

Architecture 3.0 Landscape Analytics

Trends in High-Performance Computing for Power Grid Applications

White Paper. How Streaming Data Analytics Enables Real-Time Decisions

IBM Data Warehousing and Analytics Portfolio Summary

Machina Research. Where is the value in IoT? IoT data and analytics may have an answer. Emil Berthelsen, Principal Analyst April 28, 2016

Microwatt to Megawatt - Transforming Edge to Data Centre Insights

How In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time

The Flash-Transformed Financial Data Center. Jean S. Bozman Enterprise Solutions Manager, Enterprise Storage Solutions Corporation August 6, 2014

The 4 Pillars of Technosoft s Big Data Practice

Smarter Analytics. Barbara Cain. Driving Value from Big Data

Putting IBM Watson to Work In Healthcare

ELE 356 Computer Engineering II. Section 1 Foundations Class 6 Architecture

ANALYTICS BUILT FOR INTERNET OF THINGS

IBM Big Data Platform

Introducing Oracle Exalytics In-Memory Machine

International Journal of Advanced Engineering Research and Applications (IJAERA) ISSN: Vol. 1, Issue 6, October Big Data and Hadoop

Akuda Labs. Leverages Peak Hosting s Operations-as-a-Service Managed Hosting Solution to Process Big Data Analytics 500 Faster without Big Costs

SQLstream Blaze and Apache Storm A BENCHMARK COMPARISON

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

Strategic Decisions Supported by SAP Big Data Solutions. Angélica Bedoya / Strategic Solutions GTM Mar /2014

Is Big Data a Big Deal? What Big Data Does to Science

What happens when Big Data and Master Data come together?

CSC384 Intro to Artificial Intelligence

Exploiting Data at Rest and Data in Motion with a Big Data Platform

No Data Governance, No Actionable Insights

The Internet of Things

SQL Server 2012 Performance White Paper

SECURITY MEETS BIG DATA. Achieve Effectiveness And Efficiency. Copyright 2012 EMC Corporation. All rights reserved.

IBM Business Analytics software for Insurance

Big Data Are You Ready? Thomas Kyte

News and trends in Data Warehouse Automation, Big Data and BI. Johan Hendrickx & Dirk Vermeiren

Massive Scale Analytics for a Smarter Planet

Big Data overview. Livio Ventura. SICS Software week, Sept Cloud and Big Data Day

Statistical Challenges with Big Data in Management Science

How the oil and gas industry can gain value from Big Data?

Dell* In-Memory Appliance for Cloudera* Enterprise

Definition of Computers. INTRODUCTION to COMPUTERS. Historical Development ENIAC

Ramesh Bhashyam Teradata Fellow Teradata Corporation

Real-Time Big Data Analytics SAP HANA with the Intel Distribution for Apache Hadoop software

Understanding the Value of In-Memory in the IT Landscape


Big Data and Analytics 21 A Technical Perspective Abhishek Bhattacharya, Aditya Gandhi and Pankaj Jain November 2012

Understanding traffic flow

High Performance Computing. Course Notes HPC Fundamentals

Technology and Trends for Smarter Business Analytics

BIG DATA. - How big data transforms our world. Kim Escherich Executive Innovation Architect, IBM Global Business Services

YOU VS THE SENSORS. Six Requirements for Visualizing the Internet of Things. Dan Potter Chief Marketing Officer, Datawatch Corporation

CREATING & MANAGING A DYNAMIC INFRASTRUCTURE

IT Platforms for Utilization of Big Data

Chapter 2 Logic Gates and Introduction to Computer Architecture

Data Refinery with Big Data Aspects

Hortonworks & SAS. Analytics everywhere. Page 1. Hortonworks Inc All Rights Reserved

Big Data Analytics in Space Exploration and Entrepreneurship

Software AG Fast Big Data Solutions. Come la gestione realtime dei dati abilita nuovi scenari di business per le Banche

Enabling the SmartGrid through Cloud Computing

Achieving Nanosecond Latency Between Applications with IPC Shared Memory Messaging

Big Data Challenges in Bioinformatics

Big Data in Transportation Engineering

Luncheon Webinar Series May 13, 2013

Logical Operations. Control Unit. Contents. Arithmetic Operations. Objectives. The Central Processing Unit: Arithmetic / Logic Unit.

2010 Ingres Corporation. Interactive BI for Large Data Volumes Silicon India BI Conference, 2011, Mumbai Vivek Bhatnagar, Ingres Corporation

Cisco UCS and Fusion- io take Big Data workloads to extreme performance in a small footprint: A case study with Oracle NoSQL database

Are You Ready for Big Data?

Intelligent Government From Data to Decision. Robert Lindsley Oracle, Public Sector Technology Group

Dr. Raju Namburu Computational Sciences Campaign U.S. Army Research Laboratory. The Nation s Premier Laboratory for Land Forces UNCLASSIFIED

Benchmarking Cassandra on Violin

Data-Driven Decisions: Role of Operations Research in Business Analytics

How To Build A Cloud Computer

Introduction to the Mathematics of Big Data. Philippe B. Laval

Demystifying Big Data Government Agencies & The Big Data Phenomenon

How to use Big Data in Industry 4.0 implementations. LAURI ILISON, PhD Head of Big Data and Machine Learning

In-memory computing with SAP HANA

Hur hanterar vi utmaningar inom området - Big Data. Jan Östling Enterprise Technologies Intel Corporation, NER

Design Cycle for Microprocessors

Big Data & Analytics. The. Deal. About. Jacob Büchler jbuechler@dk.ibm.com Cand. Polit. IBM Denmark, Solution Exec IBM Corporation

Transcription:

Beyond von Neumann Dr. Mark E. Dean, PhD Fisher Distinguished Professor UTK College of Eng. 2009 IBM Corporation

Businesses are dying of thirst in an ocean of data 90% of the world s data was created in the last two years 80% of the world s data today is unstructured 20% is the amount of available data traditional systems leverages 2 1 in 2 business leaders don t have access to data they need Source: GigaOM, Software Group, IBM Institute for Business Value" 83% of CIOs cited BI and analytics as part of their visionary plan 2.2X more likely that top performers use business analytics

Our Connected World will Drive the Creation of Big/Fast Data Number of Connected Devices 50 50 Billion 40 30 20 15 Billion 10 7 Billion 2010 2015 2020 Multiple Sources: Intel, Ericsson, Gartner, etc.

New Big/Fast Data Brings New Challenges & Opportunities, Requires New Analytics Exa Homeland Security 600,000 records/sec, 50B/day 1-2 ms/decision 320TB for Deep Analytics Peta Up to 10,000 Times larger Telco Promotions 100,000 records/sec, 6B/day 10 ms/decision 270TB for Deep Analytics Data Scale Tera Giga Data at Rest DeepQA 100s GB for Deep Analytics 3 sec/decision Mega Traditional Data Warehouse and Business Intelligence Kilo Data in Motion Up to 10,000 times faster yr mo wk day hr min sec ms µs Occasional Frequent Real-time Smart Traffic 250K GPS probes/sec 630K segments/sec 2 ms/decision, 4K vehicles Decision Frequency

Analytics toolkits will be expanded to support ingestion and interpretation of unstructured data, and enable adaptation and learning New Data Traditional New Methods Adaptive Analysis Continual Analysis Optimization under Uncertainty Optimization Predictive Modeling Simulation Forecasting Alerts Query/Drill Down Ad hoc Reporting Standard Reporting Entity Resolution Relationship, Feature Extraction Annotation and Tokenization Responding to context Responding to local change/feedback Quantifying or mitigating risk Decision complexity, solution speed Causality, probabilistic, confidence levels High fidelity, games, data farming Larger data sets, nonlinear regression Rules/triggers, context sensitive, complex events In memory data, fuzzy search, geo spatial Query by example, user defined reports Real time, visualizations, user interaction People, roles, locations, things Rules, semantic inferencing, matching Automated, crowd sourced Ø Learn In the context of the decision process Ø Decide and Act Ø Understand and Predict Ø Report Ø Collect and Ingest/Interpret Decide what to count; enable accurate counting Extended from: Competing on Analytics, Davenport and Harris, 2007 5

The fourth dimension of Big Data: Veracity handling data in doubt Volume Velocity Variety Veracity* Data at Rest Data in Motion Data in Many Forms Data in Doubt Terabytes to exabytes of existing data to process Streaming data, milliseconds to seconds to respond Structured, unstructured, text, multimedia Uncertainty due to data inconsistency & incompleteness, ambiguities, latency, deception, model approximations * Truthfulness, accuracy or precision, correctness 6 6

By 2015, 80% of all available data will be uncertain Global Data Volume in Exabytes 9000 8000 7000 6000 5000 4000 3000 2000 1000 0 100 90 80 70 60 50 40 30 20 10 Aggregate Uncertainty % Data quality solutions exist for enterprise data like customer, product, and address data, but this is only a fraction of the total enterprise data. Multiple sources: IDC,Cisco By 2015 the number of networked devices will be double the entire global population. All sensor data has uncertainty. The total number of social media accounts exceeds the entire global population. This data is highly uncertain in both its expression and content. Enterprise Data 2005 2010 2015 7 7

Advances in computational systems 5-10 years Cognitive Computing Compute+ Natural Language+ Analytics Program Deep Q&A Computers 1 Big Data Synapse devices L e a rn BIG/Fast 00 X 0, 0 0 0, 1,000 à Data + analytics (zettabytes + milli / microseconds net Smarter Pla ple) ings + Peo h T f o t e rn (Inte Exascale (Datacenter-in-a-box) Massive parallelism Flexible system optimization Processing in-memory 1000X Workload Optimized Systems Nano Systems (Systems-on-a-chip) 1B Transistors Nano Devices (Power7 chip) 1000X 1T Devices Photonics DNA Transistor PhaseChangeMemory Carbon Nanotubes Memristor

Architectural Enhancements Relative Performance (log) Scalar processing RISC, CISC, Vector, (Frequency) Superscalar Out-of-Order Superpipeline Mutli-level Caches (Freq. & memory) Workload optimized - Petaflop - Deep Q/A - Storage - SSD Multicore SMT Processor Networks (Chip Density) 1970 2000 2018

Log of Compute Power 1E+12 Integrated Circuit Nanotechnology $1000 Buys: Computations per second 1E+9 1E+6 1E+3 1E+0 1E-3 Mechanical Electro- Mechanical Vacuum Tube Discrete Transistor 1E-5 1900 1920 1940 1960 1980 2000 2020 Source: Kurzweil 1999 Moravec 1998

Device Structure Research Pipeline C Electronics Fully Depleted Devices HfO2 ETSOI Si NW Deposited Si FINFET Conventional Planar Device 22/20 nm 15/11 nm 8 nm & Beyond Si Nano-Wire

The Charge to Exascale: Future Technologies 1 PetaFlop 72 BG/P Racks Overall Performance = 1000X Overall Accessable Data = 10^6X Performance / watt = 135X Performance / $ = 1000X Footprint = <2% Referenced to1pf system CPU Phase Change Memory Software Silicon Photonics 3D CNT Graphene The Next Ten Years 10 PetaFlop 100 P7IH Racks 1 PetaFlop = 1/3 rack

New Computing Architecture for Data-centric Environments Non Von Neumann Architectures New Architecture & Programming Model New Interconnect, Signaling & Encoding N N Ak New Components & Chips Presynaptic Postsynaptic New Switch & Devices

Data-centric model for computer structure Cognitive Computing

Computers and the Brain are Dramatically Different Separates memory and processor Sequential, centralized processing Ever increasing clock rates, high active power Huge passive power Programmed system, hard-wired, fault-prone Algorithms and analytics Integrates memory and processor Parallel, distributed processing Event-driven, low active power Does nothing better, low passive power Learning system, reconfigurable, fault-tolerant Substrate and pattern recognition

Von Neumann Computing: Left Brain Computing Cognitive Computing: Right Brain Computing Sequential, Analytical Text, numbers, symbolic Front-end, back-end intelligence Parallel, Synthetic Sense-act, sub-symbolic In-situ, physical intelligence Centralized Clock Distributed Event-driven Bus Logical communication by messages No Bus Local physical, global logical communication Cache Memory-inefficient for real-time Registers Overwritten No Cache Fundamentally memoryefficient Update when state changes Programming Hard-wired Fault-prone Algorithms Learning Reconfigurable Fault-tolerant Variable Precision Computation-driven so power-agnostic; Flops Energy-driven so poweraware; Flops / W; scales with activity

Example Program: SyNAPSE Multi-institutional, Multi-disciplinary, Vertically-integrated Approach Potential Applications: - Spatial navigation - Machine vision - Complex System Modeling/Analysis - Pattern recognition - Associative memory - Security $41M in funding from DARPA

Other Approaches to Cognitive Computing Artificial Neural Networks Attributes: Data-Centric New programming model Event-driven, low-power operatioons Integrated memory, computation, & communication Challenges Complexity, New architectures New Programming Model Use of existing CMOS structures & devices inefficient Integration with existing computing structures

Thank you.