Big Data-Anwendungsbeispiele aus Industrie und Forschung



Similar documents
How To Handle Big Data With A Data Scientist

Big Learning Data Management and Data Analysis

BIG DATA Alignment of Supply & Demand Nuria de Lama Representative of Atos Research &

Practical Considerations for Real-Time Business Intelligence. Donovan Schneider Yahoo! September 11, 2006

Architecting for the Internet of Things & Big Data

Big Data Are You Ready? Jorge Plascencia Solution Architect Manager

How To Scale Out Of A Nosql Database

Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies

CISC 432/CMPE 432/CISC 832 Advanced Database Systems

Big Data Analytics. Copyright 2011 EMC Corporation. All rights reserved.

This Symposium brought to you by

End to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ

The 4 Pillars of Technosoft s Big Data Practice

Session 1: IT Infrastructure Security Vertica / Hadoop Integration and Analytic Capabilities for Federal Big Data Challenges

Infrastructures for big data

Structured Data Storage

Modern Data Warehouse

Managing Big Data with Hadoop & Vertica. A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database

How to Enhance Traditional BI Architecture to Leverage Big Data

IBM AND NEXT GENERATION ARCHITECTURE FOR BIG DATA & ANALYTICS!

Deploying Governed Data Discovery to Centralized and Decentralized Teams. Why Tableau and QlikView fall short

BIG DATA IN THE CLOUD : CHALLENGES AND OPPORTUNITIES MARY- JANE SULE & PROF. MAOZHEN LI BRUNEL UNIVERSITY, LONDON

Putting Apache Kafka to Use!

Transforming the Telecoms Business using Big Data and Analytics

Evaluating NoSQL for Enterprise Applications. Dirk Bartels VP Strategy & Marketing

KNIME & Avira, or how I ve learned to love Big Data

BIG DATA ANALYTICS REFERENCE ARCHITECTURES AND CASE STUDIES

REAL-TIME BIG DATA ANALYTICS

Big Data Analytics. An Introduction. Oliver Fuchsberger University of Paderborn 2014

Analyzing Big Data with AWS

Big Data Analytics Nokia

Luncheon Webinar Series May 13, 2013

Big Data: Are You Ready? Kevin Lancaster

An Integrated Big Data & Analytics Infrastructure June 14, 2012 Robert Stackowiak, VP Oracle ESG Data Systems Architecture

Integrated System Modeling for Handling Big Data in Electric Utility Systems

THE DEVELOPER GUIDE TO BUILDING STREAMING DATA APPLICATIONS

Big Data Are You Ready? Thomas Kyte

News and trends in Data Warehouse Automation, Big Data and BI. Johan Hendrickx & Dirk Vermeiren

Chapter 6 Basics of Data Integration. Fundamentals of Business Analytics RN Prasad and Seema Acharya

Oracle Big Data Strategy Simplified Infrastrcuture

A Tour of the Zoo the Hadoop Ecosystem Prafulla Wani

Oracle s Big Data solutions. Roger Wullschleger. <Insert Picture Here>

Forecast of Big Data Trends. Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 3 September 2014

How Transactional Analytics is Changing the Future of Business A look at the options, use cases, and anti-patterns

Challenges for Data Driven Systems

Big Data Analytics. Lucas Rego Drumond

UNIVERSITY OF INFINITE AMBITIONS. MASTER OF SCIENCE COMPUTER SCIENCE DATA SCIENCE AND SMART SERVICES

Big Data Technologies Compared June 2014

Where is... How do I get to...

Hadoop and Data Warehouse Friends, Enemies or Profiteers? What about Real Time?

Information Architecture

The basic data mining algorithms introduced may be enhanced in a number of ways.

BIG DATA What it is and how to use?

How to use Big Data in Industry 4.0 implementations. LAURI ILISON, PhD Head of Big Data and Machine Learning

Big Data Integration: A Buyer's Guide

Real Time Big Data Processing

Data Warehousing in the Age of Big Data

Big Data in Subsea Solutions

2015 Analyst and Advisor Summit. Advanced Data Analytics Dr. Rod Fontecilla Vice President, Application Services, Chief Data Scientist

Datenverwaltung im Wandel - Building an Enterprise Data Hub with

CONNECTING DATA WITH BUSINESS

Safe Harbor Statement

The Role of the BI Competency Center in Maximizing Organizational Performance

StreamStorage: High-throughput and Scalable Storage Technology for Streaming Data

Oracle Database - Engineered for Innovation. Sedat Zencirci Teknoloji Satış Danışmanlığı Direktörü Türkiye ve Orta Asya

The Impact of Big Data on Classic Machine Learning Algorithms. Thomas Jensen, Senior Business Expedia

Data Warehousing and Analytics Infrastructure at Facebook. Ashish Thusoo & Dhruba Borthakur athusoo,dhruba@facebook.com

The Future of Business Analytics is Now! 2013 IBM Corporation

Executive Summary... 2 Introduction Defining Big Data The Importance of Big Data... 4 Building a Big Data Platform...

[callout: no organization can afford to deny itself the power of business intelligence ]

IBM WebSphere DataStage Online training from Yes-M Systems

Customized Report- Big Data

Ganzheitliches Datenmanagement

Real-time Big Data Analytics with Storm

Integrating Netezza into your existing IT landscape

Lambda Architecture. Near Real-Time Big Data Analytics Using Hadoop. January Website:

Teaching Big Data and Analytics to Undergraduate and Graduate Students

Big Data JAMES WARREN. Principles and best practices of NATHAN MARZ MANNING. scalable real-time data systems. Shelter Island

MDM and Data Warehousing Complement Each Other

Oracle Daily Business Intelligence. PDF created with pdffactory trial version

Big Data - Infrastructure Considerations

Conjugating data mood and tenses: Simple past, infinite present, fast continuous, simpler imperative, conditional future perfect

Introduction to Big Data Analytics p. 1 Big Data Overview p. 2 Data Structures p. 5 Analyst Perspective on Data Repositories p.

Next presentation starting soon Business Analytics using Big Data to gain competitive advantage

Choosing The Right Big Data Tools For The Job A Polyglot Approach

Apache Hadoop in the Enterprise. Dr. Amr Awadallah,

UNIFY YOUR (BIG) DATA

Oracle Mobile Suite and Oracle Adaptive Case Management

Transcription:

Big Data-Anwendungsbeispiele aus Industrie und Forschung Dr. Patrick Traxler +43 7236 3343 898 Patrick.traxler@scch.at www.scch.at Das SCCH ist eine Initiative der Das SCCH befindet sich im

Organizational Frame non-profit organization constituted as Ltd owners Johannes Kepler University Linz Upper Austrian Research GmbH Association of Company Partners of SCCH ~ 65 employees (>100 with partners) 5,6 mio. euros income incl. subsidies in business year 2012 founded in July 1999 in the realm of the K plus Program since January 2008 COMET competence center 2007 Software Competence Center Hagenberg GmbH 2

Research Topics Process and Quality Engineering software engineering software quality process and approaches Rigorous Methods in Software Engineering formal methods modeling critical software components Software Analytics and Evolution software architecture model-based development integration of architecture in development Data Analysis Systems automated and intelligent data analysis prediction knowledge discovery Knowledge-Based Vision Systems machine vision object recognition object tracking 3

Application Domains Data Analysis Systems Topics Topics Computational Models Semantic Knowledge Models Knowledge Discovery Machine Learning Stream Data Analysis Data Warehousing Data Management 4

Overview Internet of Things in industry Industrial production Machines & devices A pattern for processing and analyzing industrial big data Disaster management NoSQL DWH integration Research in computer science Summary 5

Internet of Things in Industry Coined as Industrial Internet by Evans & Annunziata, 2012 (General Electrics) 6

Industrial Production Subsystem 2 Subsystem 1 Subsystem i Subsystem n Subsystem 3 PIMS Subsystems generate streams of sensor data Complex interaction of subsystems Stored in production information management system Analysis tasks Quality assurance Process optimization Fault detection Fault diagnosis 7

Machines & Devices Big data storage Machines at different locations generate streams of sensor data Many machines or devices (spatial-temporal context) Stored in big data storage Analysis tasks Usage monitoring Condition monitoring Fault detection Fault diagnosis... 8

A Pattern for Processing and Analysing Industrial Big Data Machines & devices, industrial production, building automation, smart grid (renewable energy), Data = # Machines, devices, (sub)systems * # Time points * # Features # Time points = Time period * Frequency General Setting Units generate streams of sensor data (time, value) Central storage of data for analysis tasks E.g. model learning once a week Near real-time processing of data [using a learned model] E.g. immediately check for faults E.g. optimize control every second unit 1 unit 2 unit i central storage unit n 9

A Pattern for Processing and Analysing Industrial Big Data Combining Big Data Storages (BDS) and Stream Processing Engines (SPE) BDS: for offline data processing and analysis SPE: for online, near real-time data processing and analysis unit 1 unit 2 SPE Read e.g. from RDBMS unit i MUX REPLAY Model unit n BDS MapReduce Implemented by current technology without much effort REPLAY partially solves the problem of different programming paradigms for SPEs (CQL) vs. BDSs (MapReduce) Open problem: Usage of multiple SPE per machine or combiner Open problem: Integration of existing incremental learning tools such as MOA 10

Disaster management NoSQL DWH Integration Use Case: INDYCO National research project (FFG) Development of dynamic disaster management system Situations derived from incoming sensor data Dynamic workflows started depending on current situation 11

Disaster management NoSQL DWH Integration Use Case: INDYCO 12

Disaster management NoSQL DWH Integration Motivation technology Increasing popularity of NoSQL (big) data storages Tabular/columnar data storages Document storages Graph storages Key/value storages Problem Fast reaction to live data Wish to integrate in traditional business intelligence (BI) environments 13

Disaster management NoSQL DWH Integration Use Case: INDYCO Complex Event Processing (CEP) engine (Drools) to interpret (live) sensor data Aggregated sensor data, detected situations, executed workflows loaded in data warehouse (DWH) for ad-hoc analysis and possible for long-term learning 14

Disaster management NoSQL DWH Integration Use Case: INDYCO NoSQL database (MongoDB) to store live sensor data (and situations & workflows) MapReduce to build aggregates Aggregated sensor data, situations, workflows transferred in DWH (SQL Server Integration Services) Basis for traditional BI (Analysis/Reporting Services) and advanced analytics (Rapidminer, KNIME) 15

Disaster management NoSQL DWH Integration Use Case: INDYCO 16

Research in computer science > High redundancy > Conditional mean & median > Major progress in (mapreduce) algorithms > Almost no redundancy > Web, knowledge, social graph > Very limited (map-reduce) algorithms > Moderate redundancy > Flow & sensor networks > Mixed results??? 17

Summary Big data in industrial applications Many machines or devices Complex facilities Temporal context (sensor data) Complex feature set Data = # Machines, devices, systems * # Time points * # Features Industrial big data processing: Usually achievable with current big data technologies. Industrial big data analysis: Depends strongly on the application. Does there exist an efficent map-reduce algorithm? 18

Contact Dr. Reinhard Stumptner Reinhard.stumptner@scch.at www.scch.at Dr. Thomas Natschläger Thomas.natschlaeger@scch.at www.scch.at Dr. Patrick Traxler Patrick.traxler@scch.at www.scch.at 19