Exploiting Data at Rest and Data in Motion with a Big Data Platform



Similar documents
Are You Ready for Big Data?

Industry Impact of Big Data in the Cloud: An IBM Perspective

Are You Ready for Big Data?

How the oil and gas industry can gain value from Big Data?

IBM Data Warehousing and Analytics Portfolio Summary

Deploying Big Data to the Cloud: Roadmap for Success

Addressing Open Source Big Data, Hadoop, and MapReduce limitations

BAO & Big Data Overview Applied to Real-time Campaign GSE. Joel Viale Telecom Solutions Lab Solution Architect. Telecom Solutions Lab

Beyond Watson: The Business Implications of Big Data

Big Data and Trusted Information

A New Era Of Analytic

Big Data & Analytics. The. Deal. About. Jacob Büchler jbuechler@dk.ibm.com Cand. Polit. IBM Denmark, Solution Exec IBM Corporation

Big Data and the new trends for BI and Analytics Juha Teljo Business Intelligence and Predictive Solutions Executive IBM Europe

Luncheon Webinar Series May 13, 2013

IBM Information Management Overview

Big Data Use Case Deep Dive 5 Game Changing Use Cases for Big Data

IBM Big Data Platform

YOU VS THE SENSORS. Six Requirements for Visualizing the Internet of Things. Dan Potter Chief Marketing Officer, Datawatch Corporation

Raul F. Chong Senior program manager Big data, DB2, and Cloud IM Cloud Computing Center of Competence - IBM Toronto Lab, Canada

5 Keys to Unlocking the Big Data Analytics Puzzle. Anurag Tandon Director, Product Marketing March 26, 2014

IBM AND NEXT GENERATION ARCHITECTURE FOR BIG DATA & ANALYTICS!

Big Data in Telco & Banking Analytics. Benjamin Sznajder IBM Research Haifa

IBM Big Data in Government

BIG DATA. - How big data transforms our world. Kim Escherich Executive Innovation Architect, IBM Global Business Services

IBM Big Data Platform

Demystifying Big Data Government Agencies & The Big Data Phenomenon

The Future of Data Management

A Strategic Approach to Unlock the Opportunities from Big Data

Big Data, Integration and Governance: Ask the Experts

Datenverwaltung im Wandel - Building an Enterprise Data Hub with

CSC590: Selected Topics BIG DATA & DATA MINING. Lecture 2 Feb 12, 2014 Dr. Esam A. Alwagait

Big Data & Analytics for Semiconductor Manufacturing

BIG Data Analytics Move to Competitive Advantage

Big Data overview. Livio Ventura. SICS Software week, Sept Cloud and Big Data Day

Big Data Use Cases Update

The Future of Business Analytics is Now! 2013 IBM Corporation

Smarter Analytics. Barbara Cain. Driving Value from Big Data

Addressing government challenges with big data analytics

A holistic approach to Big Data

Business Analytics In a Big Data World Ted Malone Solutions Architect Data Platform and Cloud Microsoft Federal

BEYOND BI: Big Data Analytic Use Cases

Applications for Big Data Analytics

IBM System x reference architecture solutions for big data

Sources: Summary Data is exploding in volume, variety and velocity timely

DATAMEER WHITE PAPER. Beyond BI. Big Data Analytic Use Cases

End to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ

Smarter Analytics Leadership Summit Big Data. Real Solutions. Big Results.

W H I T E P A P E R. Architecting A Big Data Platform for Analytics INTELLIGENT BUSINESS STRATEGIES

Big Data and Your Data Warehouse Philip Russom

Getting Started Practical Input For Your Roadmap

Beyond the Single View with IBM InfoSphere

The Enterprise Data Hub and The Modern Information Architecture

Surfing the Data Tsunami: A New Paradigm for Big Data Processing and Analytics

Big Data Strategies with IMS

Klarna Tech Talk: Mind the Data! Jeff Pollock InfoSphere Information Integration & Governance

Building Confidence in Big Data Innovations in Information Integration & Governance for Big Data

Big Data Analytic Solution Accelerators Kevin Foster IBM Big Data Solution Architect

Big Data Realities Hadoop in the Enterprise Architecture

HP Vertica at MIT Sloan Sports Analytics Conference March 1, 2013 Will Cairns, Senior Data Scientist, HP Vertica

The Future of Data Management with Hadoop and the Enterprise Data Hub

BIG DATA ANALYTICS REFERENCE ARCHITECTURES AND CASE STUDIES

Apache Hadoop in the Enterprise. Dr. Amr Awadallah,

Big Data / FDAAWARE. Rafi Maslaton President, cresults the maker of Smart-QC/QA/QD & FDAAWARE 30-SEP-2015

Information systems architecture for the Oil and Gas industry

Danny Wang, Ph.D. Vice President of Business Strategy and Risk Management Republic Bank

Data Refinery with Big Data Aspects

Auto-Classification for Document Archiving and Records Declaration

BIG DATA FUNDAMENTALS

Data Centric Computing Revisited

The IBM Agile Information Governance Process

IBM BigInsights for Apache Hadoop

Optimized for the Industrial Internet: GE s Industrial Data Lake Platform

Mohan Sawhney Robert R. McCormick Tribune Foundation Clinical Professor of Technology Kellogg School of Management

Transforming the Telecoms Business using Big Data and Analytics

The Lab and The Factory

Turning Big Data into Big Decisions Delivering on the High Demand for Data

Fujitsu Big Data Software Use Cases

How To Understand The Benefits Of Big Data

BIG DATA : PAST, PRESENT AND FUTURE - AN ANALYST S PERSPECTIVE

Big Data Er Big Data bare en døgnflue? Lasse Bache-Mathiesen CTO BIM Norway

Data Lake In Action: Real-time, Closed Looped Analytics On Hadoop

The top five ways to get started with big data

Hadoop Beyond Hype: Complex Adaptive Systems Conference Nov 16, Viswa Sharma Solutions Architect Tata Consultancy Services

How To Make Data Streaming A Real Time Intelligence

Transcription:

Exploiting Data at Rest and Data in Motion with a Big Data Platform Sarah Brader, sarah_brader@uk.ibm.com

What is Big Data? Where does it come from? 12+ TBs of tweet data every day 30 billion RFID tags today (1.3B in 2005) 4.6 billion camera phones world wide? TBs of data every day 25+ TBs of log data every day 80% Of world s data is unstructured 76 million smart meters in 2009 200M by 2014 100s of millions of GPS enabled devices sold annually 2+ billion people on the Web by end 2011

Characteristics of Big Data Extracting insight from an immense volume, variety and velocity of data, in context, beyond what was previously possible. Cost efficiently processing the growing Volume 2010 50x 35 ZB 2020 Responding to the increasing Velocity 30 Billion RFID sensors and counting Collectively Analyzing the broadening Variety 80% of the worlds data is unstructured Establishing the Veracity of big data sources 1 in 3 business leaders don t trust the information they use to make decisions

Variety Positioning Big Data Operational systems Data Warehousing Density of Data Value X Operational Analytics Ad-hoc Deep Analytics The New Data Data Volume - Terabytes

A New Approach for Big Data Traditional Approach Structured, analytical, logical New Approach Creative, holistic thought, intuition Transaction Data Data Warehouse Hadoop Streams Web Logs Internal App Data Structured Repeatable Mainframe Data Linear Monthly sales reports Profitability analysis Customer surveys OLTP System Data Structured Repeatable Linear Enterprise Integration Unstructured Exploratory Iterative Social Data Unstructured Exploratory Iterative Text Data: emails Brand sentiment Product strategy Maximum asset utilization Sensor data: images ERP data Traditional Sources New Sources RFID

IBM s Next Generation Big Data Architecture Data types Real-time analytics Actionable insight Machine and sensor data Decision management Image and video Enterprise content + + Deep analytics & modeling Predictive analytics and modeling Transaction and application data Operational information Enterprise warehouse Reporting & interactive analysis Reporting, analysis, content analytics Social data Third-party data Exploration, landing and archive Discovery and exploration Information Integration Data Quality Data Matching & MDM Metadata & Lineage Security & Privacy Lifecycle Management

Hadoop Technology Volume Scales to many petabytes Scales to thousands of nodes Variety Structured Semi-structured Unstructured Experimental analytics Apply structure on read Rapid exploitation of new data Structured Semi-structured Unstructured

Hadoop ++ InfoSphere BigInsights Based on Apache Open Source Integrates with OEM tech Applications developed elsewhere run without change Ease of use Skills in SQL, R etc are instantly usable Complexities of MR are hidden Enterprise capabilities Security Disaster Recovery Supports workload isolation Logs email GPFS quotes Client db phone clickstream trades weblog Posix

IBM InfoSphere Streams A platform for real-time analytics on BIG data Volume Gigabytes per second or more Terabyte per day or more Variety All kinds of data All kinds of analytics Velocity Insights in microseconds Agility Dynamically responsive Rapid application development Millions of events per second Just-in-time decisions Powerful Analytics Sensor, video, audio, text, and relational data sources Microsecond Latency

What kinds of analysis is Streams used for? Telephony CDR processing Social analysis Churn prediction Geomapping Transportation Intelligent traffic management Smart Grid & Energy Transactive control Phasor Monitoring Unit Health & Life Sciences Neonatal ICU monitoring Epidemic early warning system Remote healthcare monitoring Stock market Impact of weather on securities prices Analyze market data at ultra-low latencies Natural Systems Wildfire management Water management Law Enforcement, Defense & Cyber Security Real-time multimodal surveillance Situational awareness Cyber security detection Fraud prevention Detecting multi-party fraud Real time fraud prevention e-science Space weather prediction Detection of transient events Synchrotron atomic research Other Manufacturing Text Analysis

How Streams Works continuous ingestion Continuous ingestion Continuous analysis

How Streams Works Continuous ingestion Continuous analysis Filter / Sample Infrastructure provides services for Scheduling analytics across hardware hosts, Establishing streaming connectivity Transform Annotate Correlate Classify Achieve scale: By partitioning applications into software components By distributing across stream-connected hardware hosts Where appropriate: Elements can be fused together for lower communication latency

Further Information Resources Visit IBMBigDataHub for conversations www.ibmbigdatahub.com Visit BigDataUniversity for technical education www.bigdatauniversity.com InfoSphere BigInsights 2.1.2 Knowledge Center http://www- 01.ibm.com/support/knowledgecenter/SSPT3X_2.1.2/com.ibm.swg.im.infosphere.biginsights.welcome.doc/doc/welcome.html InfoSphere Streams Knowledge Center http://www- 01.ibm.com/support/knowledgecenter/SSCRJU/SSCRJU_welcome.html?lang=en Watson Explorer Knowledge Center http://www- 01.ibm.com/support/knowledgecenter/SS8NLW/SS8NLW_welcome.html?lang=en

Try it now! InfoSphere for BigInsights Quick Start Free, no limit, non-production version of BigInsights Features Big SQL, BigSheets, Text Analytics, Big R, management console, development tools Tutorials and education available http://www.ibm.com/developerworks/downloads/im/biginsightsquick/

How Can You Get Quick Start? Ibm.co/streamsqs http://www.ibm.com/software/data/infosphere/streams/quick-start/ 2 download options!