Industry 4.0 and Big Data

Similar documents
COMP9321 Web Application Engineering

Standards for Big Data in the Cloud

Big Data and Analytics: Challenges and Opportunities

Volume 3, Issue 6, June 2015 International Journal of Advance Research in Computer Science and Management Studies

Big Data. Fast Forward. Putting data to productive use

Big Data and Industrial Internet

Chapter 7. Using Hadoop Cluster and MapReduce

The Rise of Industrial Big Data

Big Data on Microsoft Platform

INTERSEC BENCHMARK. High Performance for Fast Data & Real-Time Analytics Part I: Vs Hadoop

Big Data a threat or a chance?

Analytics in the Cloud. Peter Sirota, GM Elastic MapReduce

Analyzing Big Data with AWS

Secure and Semantic Web of Automation

Big Data Explained. An introduction to Big Data Science.

BIG DATA ANALYTICS REFERENCE ARCHITECTURES AND CASE STUDIES

Semantic Heterogeneity Reduction for Big Data in Industrial Automation

BIG. Big Data Analysis John Domingue (STI International and The Open University) Big Data Public Private Forum

Addressing Open Source Big Data, Hadoop, and MapReduce limitations

The Rise of Industrial Big Data. Brian Courtney General Manager Industrial Data Intelligence

How to use Big Data in Industry 4.0 implementations. LAURI ILISON, PhD Head of Big Data and Machine Learning

Associate Professor, Department of CSE, Shri Vishnu Engineering College for Women, Andhra Pradesh, India 2

Transforming the Telecoms Business using Big Data and Analytics

Cloud Search Based Applications for Big Data - Challenges and Methodologies for Acceleration

Where is... How do I get to...

BIG DATA TOOLS. Top 10 open source technologies for Big Data

How To Make Sense Of Data With Altilia

Open source Google-style large scale data analysis with Hadoop

Big Data & QlikView. Democratizing Big Data Analytics. David Freriks Principal Solution Architect

SIMATIC IT Historian. Increase your efficiency. SIMATIC IT Historian. Answers for industry.

bigdata Managing Scale in Ontological Systems

How To Help Your Business With Big Data Analytics

Zero-in on business decisions through innovation solutions for smart big data management. How to turn volume, variety and velocity into value

Leveraging Big Data Technologies to Support Research in Unstructured Data Analytics

Big Data Framework for u-healthcare System. Tae-Woong Kim 1, Jai-Hyun Seu 2.

Mag. Vikash Kumar, Dr. Anna Fensel SEMANTIC DATA ANALYTICS AS A BASIS FOR ENERGY EFFICIENCY SERVICES

Copyright 2013 Splunk Inc. Introducing Splunk 6

Machina Research. Where is the value in IoT? IoT data and analytics may have an answer. Emil Berthelsen, Principal Analyst April 28, 2016

Real Time Big Data Processing

Design and Implementation of a Semantic Web Solution for Real-time Reservoir Management

Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies

So What s the Big Deal?

Big Data and Your Data Warehouse Philip Russom

Storage and Retrieval of Large RDF Graph Using Hadoop and MapReduce

Big Data. White Paper. Big Data Executive Overview WP-BD Jafar Shunnar & Dan Raver. Page 1 Last Updated

Publishing Linked Data Requires More than Just Using a Tool

Big Data & Analytics: Your concise guide (note the irony) Wednesday 27th November 2013

Enabling Self Organising Logistics on the Web of Things

Improving Data Processing Speed in Big Data Analytics Using. HDFS Method

UNIVERSITY OF INFINITE AMBITIONS. MASTER OF SCIENCE COMPUTER SCIENCE DATA SCIENCE AND SMART SERVICES

Data collection architecture for Big Data

Big Data Technology ดร.ช ชาต หฤไชยะศ กด. Choochart Haruechaiyasak, Ph.D.

Shared Infrastructure: What and Where is Collaboration Needed to Build the SM Platform?

BIG DATA & ANALYTICS. Transforming the business and driving revenue through big data and analytics

Big Data: Opportunities & Challenges, Myths & Truths 資 料 來 源 : 台 大 廖 世 偉 教 授 課 程 資 料

Towards a Thriving Data Economy: Open Data, Big Data, and Data Ecosystems

Outline. What is Big data and where they come from? How we deal with Big data?

Challenges for Data Driven Systems

De la Business Intelligence aux Big Data. Marie- Aude AUFAURE Head of the Business Intelligence team Ecole Centrale Paris. 22/01/14 Séminaire Big Data

SURVEY REPORT DATA SCIENCE SOCIETY 2014

Datenverwaltung im Wandel - Building an Enterprise Data Hub with

Data Virtualization A Potential Antidote for Big Data Growing Pains

Copyright 2014, Neudesic. All rights reserved.

AGENDA. What is BIG DATA? What is Hadoop? Why Microsoft? The Microsoft BIG DATA story. Our BIG DATA Roadmap. Hadoop PDW

Big Data Analytics. Prof. Dr. Lars Schmidt-Thieme

HPC technology and future architecture

LDIF - Linked Data Integration Framework

EL Program: Smart Manufacturing Systems Design and Analysis

HadoopRDF : A Scalable RDF Data Analysis System

How To Handle Big Data With A Data Scientist

Big Data, Fast Data, Complex Data. Jans Aasman Franz Inc

E6895 Advanced Big Data Analytics Lecture 4:! Data Store

Big Data Challenges and Success Factors. Deloitte Analytics Your data, inside out

Big Data: Overview and Roadmap eglobaltech. All rights reserved.

BIG DATA TRENDS AND TECHNOLOGIES

The 3 questions to ask yourself about BIG DATA

Big Data Technologies Compared June 2014

Intel HPC Distribution for Apache Hadoop* Software including Intel Enterprise Edition for Lustre* Software. SC13, November, 2013

Monitis Project Proposals for AUA. September 2014, Yerevan, Armenia

Big Data: Study in Structured and Unstructured Data

Big Data Executive Survey

The Lab and The Factory

Danny Wang, Ph.D. Vice President of Business Strategy and Risk Management Republic Bank

CAP4773/CIS6930 Projects in Data Science, Fall 2014 [Review] Overview of Data Science

Expanding Uniformance. Driving Digital Intelligence through Unified Data, Analytics, and Visualization

Big Data Analytics - Accelerated. stream-horizon.com

Vision Paper Distributed Data Mining and Big Data

Transcription:

Industry 4.0 and Big Data Marek Obitko, mobitko@ra.rockwell.com Senior Research Engineer 03/25/2015 PUBLIC PUBLIC - 5058-CO900H

2 Background Joint work with Czech Institute of Informatics, Robotics and Cybernetics Big Data related topics investigated in RA-DIC laboratory within CIIRC Goal of the effort: Semantic Big Data Historian

3 Agenda Overview of related trends Industry 4.0 Big Data Semantics Semantic Big Data Historian Architecture Use Case Outlook Conclusion

4 Agenda Overview of related trends Industry 4.0 Big Data Semantics Semantic Big Data Historian Architecture Use Case Outlook Conclusion

5 Industry 4.0 Fourth Industrial Revolution Predicted a-priori, not observed ex-post Economic impact predicted to be huge Operational effectiveness, new business models, services and products Clear definition not provided Usually: vision, basic technologies, selected scenarios Design principles Interoperability, virtualization, decentralization, real-time capability, service orientation, and modularity

6 Industry 4.0 Components Primary components Cyber-physical systems Fusion of physical and virtual world integration of computation and physical processes Features: unique identification RFID tags, centralized storage and analytics, multiple sensors and actuators, network compatible Example: virtual battery a battery in electric car has its virtual counterpart updated in real time, which allows diagnostics, simulation, prediction etc. for better customer experience Internet of Things Network of physical systems that are uniquely identified and can interact to reach common goals Example: Smart Homes connected devices (temperature sensor, heating, mobile phone) Internet of Services Offering services via Internet so that they can be offered and combined into value-added services by various suppliers Example: forming virtual production technologies and capabilities Smart Factory often mentioned as a key feature of Industry 4.0 Information coming from physical and virtual world used to provide context and assistance for people and machines to execute their tasks in a better way Example: demand driven production, intelligent work piece carriers Other also related components: Smart product, Machine to machine (M2M), Big Data, Cloud

7 Agenda Overview of related trends Industry 4.0 Big Data Semantics Semantic Big Data Historian Architecture Use Case Outlook Conclusion

8 Big Data Motivation A CPG (consumer packaged goods) company generates 5,000 data samples every 33 milliseconds This corresponds to 70TB per year Can we meaningfully use such amount of data? Big Data dataset that is growing so that it becomes difficult to manage it using existing database management concepts and tools 3Vs Volume, Velocity, Variety

9 Big Data Volume data will grow 50 times by 2020 FB 50PB Velocity storing and getting data fraud detection Variety unstructured, 90% of new data videos Applications Online marketing targeting products based on user clickstream (Google, Amazon, Netflix ) Medicine, biology, chemistry data analysis Technologies Map-Reduce framework, introduced by Google Running on cheap machines in parallel in clusters (splitting data) implemented in e.g. Apache Hadoop It s about variety, not volume The Big is not the main problem, focus on heterogeneous data integration new analytic applications based on data that were not tracked so far

10 Agenda Overview of related trends Industry 4.0 Big Data Semantics Semantic Big Data Historian Architecture Use Case Outlook Conclusion

11 Semantics Linked Data / Semantic Web (machine processable data) Tens of RDF Gtriples on web Resource Description Framework Resources uniquely identified by URI Triples subject property object In fact relations between objects, values of properties Together forming RDF graph(s) Web Ontology Language Ontology specifies the conceptualization In fact description of vocabulary, constraints, attaches meaning to identifiers Designed for internet and web And so also usable for Internet of Things, Internet of Services etc. Inherently distributed approach, integration of data from heterogeneous and unreliable data sources

12 Agenda Overview of related trends Industry 4.0 Big Data Semantics Semantic Big Data Historian Architecture Use Case Outlook Conclusion

13 Plant Data Processing Traditional Historian Time series data collection, focus on fast scan rate Analyzing data What the ph was at 2:34:56 PM March 15, 2015 Not a problem, single retrieval, unless there is a problem with volume What the ph trend was from 1 to 7 PM of March 15, 2013, plus compare it to previous similar weekdays, holidays, after it rained, when different suppliers were used etc. Not easily possible in historians available today, especially for large scale data Samples of needed data processing Pattern recognition, pattern matching Predictive maintenance Benchmarking of KPIs Clustering similar machines Real time statistics / analytics / reporting

14 Semantic Big Data Historian Vision, currently being implemented to verify the technologies Collecting data from sensors Architecture based on OPC UA Sensors semantically described All data processed using Semantic Web languages and technologies allows linking data together Data stored in Hadoop Analyzing data Querying using SPARQL (RDF querying language) More complex queries implemented directly in Map-Reduce framework

15 Description of sensors and data Ontology building on top of SSN Semantic Sensor Network Ontology (W3C effort) Ontology describes Sensors Observations, including physical units, time, data quality etc. Data expressed using the ontology Particular observations All data linked together Directly stored as RDF triples

16 Agenda Overview of related trends Industry 4.0 Big Data Semantics Semantic Big Data Historian Architecture Use Case Outlook Conclusion

17 Case study data from passive house Our goal: evaluate the suitability of proposed technologies, scalability etc. Data focus: indoor air quality Environmental parameters: Temperature, Carbon dioxide concentration, Relative humidity, Air pressure Sample analysis tasks Relaxation time of the house Impact of sunlight on indoor temperature Detection of people inside

18 Case study data from passive house Raw data conversion to RDF to be stored to triple store

19 Case study data from passive house Sample task detection of people inside Time series processing of CO 2 data Values in sliding window, comparing with threshold Verified the results by comparing with people occupancy list Main result Data not really very big, however, reaching the limits of MATLAB package Map-Reduce implementation in Hadoop (both pre-processing and detection) much faster than in MATLAB The task proved the advantage of Hadoop implementation scalability

20 Agenda Overview of related trends Industry 4.0 Big Data Semantics Semantic Big Data Historian Architecture Use Case Outlook Conclusion

21 Outlook Semantic Big Data Historian overall goal: Semantic: connect data together Provide semantic description in the endpoints, connect to OPC UA and let the Historian to connect the data appropriately Big Data: be able to work with larger volume of data Historian Using Map-Reduce and similar frameworks to store, retrieve and analyze larger volume of heterogeneous data Focus on time-series data, however be able to also include other types of data E.g., information about suppliers, orders, shifts, various annotations etc. Achieve analytics that was not possible without current technologies Also connect to actions in physical world, not only ad-hoc analysis

22 Agenda Overview of related trends Industry 4.0 Big Data Semantics Semantic Big Data Historian Architecture Use Case Outlook Conclusion

23 Conclusion Industry 4.0 fusion of physical and virtual world, network of physical systems that interact to reach common goals, integration of services, smart devices, homes, factories, Big Data and Semantics prerequisite for processing large volume of heterogeneous data Semantic Big Data Historian The goal is to provide advanced analytics on plant heterogeneous data, in the scale that was not possible until now Demonstrated the Hadoop scalability Demonstrated Semantic Web suitability for data integration Next steps include advanced data analysis Industry 4.0 both distributed and centralized approaches needed Small scale (M2M) versus large scale (cloud) data processing

Thank you! Questions? Contact: mobitko@ra.rockwell.com PUBLIC www.rockwellautomation.com PUBLIC - 5058-CO900H