Anomaly Detection and Predictive Maintenance



Similar documents
Anomaly Detection in Predictive Maintenance

Anomaly Detection in Predictive Maintenance Time Alignment and Visualization

Environmental Remote Sensing GEOG 2021

Data and Machine Architecture for the Data Science Lab Workflow Development, Testing, and Production for Model Training, Evaluation, and Deployment

Creating Usable Customer Intelligence from Social Media Data:

PdM Overview. Predictive Maintenance Services

Using multiple models: Bagging, Boosting, Ensembles, Forests

Big Data, Smart Energy, and Predictive Analytics Time Series Prediction of Smart Energy Data

Taming the Internet of Things with KNIME Data Enrichement, Visualization, Time Series Analysis, and Optimization

Geo-Localization of KNIME Downloads

KNIME TUTORIAL. Anna Monreale KDD-Lab, University of Pisa

OMBU ENTERPRISES, LLC. Process Metrics. 3 Forest Ave. Swanzey, NH Phone: Fax: OmbuEnterprises@msn.

Is a Data Scientist the New Quant? Stuart Kozola MathWorks

Azure Machine Learning, SQL Data Mining and R

Ensembles and PMML in KNIME

What s Cooking in KNIME

SureSense Software Suite Overview

TIETS34 Seminar: Data Mining on Biometric identification

Modelling, Extraction and Description of Intrinsic Cues of High Resolution Satellite Images: Independent Component Analysis based approaches

The Scientific Data Mining Process

Using Data Mining for Mobile Communication Clustering and Characterization

Calculation of Minimum Distances. Minimum Distance to Means. Σi i = 1

Practical Data Science with Azure Machine Learning, SQL Data Mining, and R

Time Domain and Frequency Domain Techniques For Multi Shaker Time Waveform Replication

: Introduction to Machine Learning Dr. Rita Osadchy

Machine Learning with MATLAB David Willingham Application Engineer

AT&T Global Network Client for Windows Product Support Matrix January 29, 2015

Hadoop s Advantages for! Machine! Learning and. Predictive! Analytics. Webinar will begin shortly. Presented by Hortonworks & Zementis

SPC Data Visualization of Seasonal and Financial Data Using JMP WHITE PAPER

Technical Information

Data Mining and Visualization

Predictive Analytics for Logistics to Increase Man & Machine Efficiency

Dan French Founder & CEO, Consider Solutions

COC131 Data Mining - Clustering

Predictive Modeling Techniques in Insurance

Introduction to Exploratory Data Analysis

How to use Big Data in Industry 4.0 implementations. LAURI ILISON, PhD Head of Big Data and Machine Learning

A Regime-Switching Model for Electricity Spot Prices. Gero Schindlmayr EnBW Trading GmbH

A comparison of radio direction-finding technologies. Paul Denisowski, Applications Engineer Rohde & Schwarz

An Introduction to Data Mining. Big Data World. Related Fields and Disciplines. What is Data Mining? 2/12/2015

The accurate calibration of all detectors is crucial for the subsequent data

CI6227: Data Mining. Lesson 11b: Ensemble Learning. Data Analytics Department, Institute for Infocomm Research, A*STAR, Singapore.

WebFOCUS RStat. RStat. Predict the Future and Make Effective Decisions Today. WebFOCUS RStat

Machine Learning and Data Mining. Regression Problem. (adapted from) Prof. Alexander Ihler

Based on Chapter 11, Excel 2007 Dashboards & Reports (Alexander) and Create Dynamic Charts in Microsoft Office Excel 2007 and Beyond (Scheck)

PeakVue Analysis for Antifriction Bearing Fault Detection

ENERGY STAR for Data Centers

Anomaly detection. Problem motivation. Machine Learning

Introduction to Data Mining

The Big Data Paradigm Shift. Insight Through Automation

Big Data: Rethinking Text Visualization

Classification and Prediction

A Case of Study on Hadoop Benchmark Behavior Modeling Using ALOJA-ML

Adaptive feature selection for rolling bearing condition monitoring

BIDM Project. Predicting the contract type for IT/ITES outsourcing contracts

Statistical Distributions in Astronomy

WHITE PAPER ON. Operational Analytics. HTC Global Services Inc. Do not copy or distribute.

Data Mining & Data Stream Mining Open Source Tools

Intelligent Process Management & Process Visualization. TAProViz 2014 workshop. Presenter: Dafna Levy

Automatic Labeling of Lane Markings for Autonomous Vehicles

CS Data Science and Visualization Spring 2016

CHAPTER 6 TEXTURE ANIMATION

Decision Trees from large Databases: SLIQ

Lavastorm Analytic Library Predictive and Statistical Analytics Node Pack FAQs

Visualization methods for patent data

MACHINE LEARNING IN HIGH ENERGY PHYSICS

Credit Card Fraud Detection Using Self Organised Map

Hierarchical Clustering Analysis

Location matters. 3 techniques to incorporate geo-spatial effects in one's predictive model

Pallas Ludens. We inject human intelligence precisely where automation fails. Jonas Andrulis, Daniel Kondermann

Technical Report. The KNIME Text Processing Feature:

Comparison of Non-linear Dimensionality Reduction Techniques for Classification with Gene Expression Microarray Data

Qi Liu Rutgers Business School ISACA New York 2013

Stuart Gillen. Principal Marketing Manger. National Instruments ni.com

ANALYTICS IN BIG DATA ERA

Non-negative Matrix Factorization (NMF) in Semi-supervised Learning Reducing Dimension and Maintaining Meaning

THIS paper reports some results of a research, which aims to investigate the

Advanced analytics at your hands

OUTLIER ANALYSIS. Data Mining 1

Visual-based ID Verification by Signature Tracking

INTRODUCTION TO MACHINE LEARNING 3RD EDITION

Active Vibration Isolation of an Unbalanced Machine Spindle

Supervised Classification workflow in ENVI 4.8 using WorldView-2 imagery

Visualization and Big Data in Official Statistics

Vibration Analysis Services. FES Systems Inc. 1

Sense Making in an IOT World: Sensor Data Analysis with Deep Learning

Text Analytics using High Performance SAS Text Miner

Framing Business Problems as Data Mining Problems

Easily Identify Your Best Customers

Learning Example. Machine learning and our focus. Another Example. An example: data (loan application) The data and the goal

White Paper. How Streaming Data Analytics Enables Real-Time Decisions

How To Perform An Ensemble Analysis

Vision based Vehicle Tracking using a high angle camera

Application of Predictive Analytics for Better Alignment of Business and IT

A Demonstration of a Robust Context Classification System (CCS) and its Context ToolChain (CTC)

Transcription:

Anomaly Detection and Predictive Maintenance Rosaria Silipo Iris Adae Christian Dietz Phil Winters Rosaria.Silipo@knime.com Iris.Adae@uni-konstanz.de Christian.Dietz@uni-konstanz.de Phil.Winters@knime.com KNIME.com AG, Zurich, Switzerland www.knime.com @KNIME Copyright 20145KNIME.com AG

KNIME UGM 2012 Dr. Killian Thiel Dr. Tobias Kötter Phil Winters Text Mining Meets Network Mining Text Mining for Sentiment Network Mining for + Relevance = 2

KNIME UGM 2013 Time Series + Machine Learning + Big Data Rosaria Silipo + Phil Winters Manufacturing Chemical Life Science Transportation Utilities Automotive Cyber Security The Irish Energy Trials 3

KNIME UGM 2014: The Internet of Things 16 th and Harvard Bike sources Bike Sinks Union Station Topology / Elevations Street Maps Commuters Weather Holiday Schedules Tourists Rosaria Silipo Aaron Hart Phil Winters

White Papers, Public Data and the KNIME Workflows! 5

KNIME UGM 2015: The Hot Topics. Intrusion Detection / Prevention Fraud Detection / Prevention Fault Detection / Prevention Failure Detection / Prevention Health Monitoring / Early Warning Eco-System Disturbances / Prediction Preventive Maintenance Predictive Maintenance Reliability Analysis Anticipatory Failure Determination Guided Error Prevention 6

Anomaly Detection Determination Early Reliability Warning Failure Prevention Error Maintenance Preventive Anticipatory Analysis Health EcoSystem Fault Fraud Guided Predictive Monitoring Prediction Intrusion Disturbances The ideal prevention would predict all failure before it occurs 7

Anomaly Detection: The Approaches 1. Supervised Anomaly Detection. A group of patterns are labelled as anomalies and we need to find them. This is just a classification problem where one of the classes is named anomaly. 2. Static Unsupervised Anomaly Detection. There are a number of labelled pattern classes and suddenly a weird unrecognized outlier pattern shape shows up. Like an unknown heartbeat during an ECG. Either benign or worrisome, an alarm must be triggered.

Anomaly Detection: The Approach 3. Dynamic Unsupervised Anomaly Detection. Here some measures change over time till their values are not normal anymore. For example, while a motor is slowly deteriorating, one of the measurements might change till it gets out of control and the motor breaks. We want to stop the motor before it completely breaks producing even more damages. This problem is similar to number 2 but slightly more challenging because it is not pattern based and it changes slowly over time.

Dynamic Unsupervised Anomaly Detection Applications 10

The Fourier Transform Every signal can be filtered into a series of circular paths: How Big? (amplitude) How Fast? (Frequency) Where to Start (Phase Angle) 11

The Real Challenge.

Until Now! A Motor and its Sensors 28 time series from 28 sensors on 8 different parts of a mechanical engine. A1 (input shaft vertical) A2 (second shaft horizontal upper bearing) A3 (third shaft horizontal lower bearing) A4 (internal gear 275 degrees) A5 (internal gear 190,5 degree) A6 (input shaft bearing 150) A7 (input shaft bearing 151) M1 (torque KnM)

KNIME in Action 14

The Data Time Series are FFT-derived Spectral Amplitudes There is only one motor breakdown episodes on July 21, 2008 The breakdown is visible only from some sensors and only in some frequency bands The engine was substituted with a new one after the breakdown

Align Time Series Copyright 2014 KNIME.com AG 16

Combine all Spectral Time Series Data Cleaning and DateTime Conversion Frequency Binning Average Spectral Amplitude on Frequency Bins vs. Date

Pivoting Average Spectral Amplitude by Day and Frequency Bin 01.01.2007 02.01.2007 time [0-100] [100-200] [200-300] [1100-1200] [> 1200] Hz 18

Time Alignment: Column Merger Node Outer join of all column values Union of datetime values in one single column

Data Visualization Copyright 2014 KNIME.com AG 20

Data Visualization: Time Plots by Frequency Bands A1-SV3 [0, 100] Hz Breaking point July 21, 2008 A1-SV3 [500, 600] Hz Old motor piece New motor piece Only some Spectral Time Series shows the break down

Data Visualization: HeatMap Heatmap of spectrum amplitudes by frequency bands for A1 SV3 sensor signal Frequency bands [200,300] Hz and [500,600] Hz show the breaking point in red

Scatter Matrix Jan.01.2007 Aug.31.2007 Sep.01.2008 Jul.21.2008 Jul.22.2008 Apr.20.2009 23

Data Visualization: Auto-Correlation Map A1-SV3 [300-400]Hz signal 01.Jan.2007 31.Aug.2007 A1-SV3 [300-400]Hz signal 01.Sep.2007 20.Jul.2008 Correlation with past values changes in Time!

Data Visualization: Correlation Map A1-SV3 signal 01.Jan.2007 31.Aug.2007 A1-SV3 signal 01.Sep.2007 20.Jul.2008 Correlation across Frequency Bands changes in Time!

Data Analytics Copyright 2014 KNIME.com AG 26

The Approach Read all data Spectral amplitudes on frequency bins vs. time Define normal Learn normal values and patterns Detect alarming situations Export models as PMML

Learn normal to recognize what is not We only have normal measures/patterns available till the system starts failing We need to train a machine learning algorithm to predict/classify those normal patterns/measures We then use the machine learning algorithm s uncertainty to fire an alarm on what might not be normal

Learn normal : Training Set A1-SV3 [0, 100] Hz 31 August 2007 Predictive Maintenance Breaking point July 21, 2008 A1-SV3 [500, 600] Hz Training Set Only some Spectral Time Series shows the break down

Time Series Prediction Copyright 2014 KNIME.com AG 30

Learn normal : Input and Target Features input Time series prediction target

Time Series Training: Learn what is normal On each Frequency Band, on each Sensor Signal, on each Motor Part: Train a Linear Auto-Regressive Model (Lag = 10, no seasonality) 595 AR Models

Time Series Training: Learn what is normal 10 past values Last value for missing values Train 595 AR Models And Write to PMML files 33

Time Series Production: Linear Auto-Regression Prediction 10 past values Last value for missing values Prediction, Prediction errors, First level alarms A7, SA1 [200,300]Hz Sep 2007 Original signal Apr 15 2008 Jul 21 2008 Prediction Apr 2009 34

KNIME in Action 35

Signal Execution: Part 1 Copyright 2014 KNIME.com AG 36

Time Series Production: Recognize what is not normal On each Frequency Band, Sensor Motor part: Prediction Error PE(t) as: PE(t) = abs(prediction(t) x(t)) First Level Alarm IF PE(t) > Mean(PE(t)) + K*stddev(MA(t)) => alm(t) = PE(t) ELSE alm(t) = 0 MA(t) = Moving Average(alm(t), N=21, backward window) Second Level Alarm IF MA(t) > 0.01 => Alarm

MA(alm(t)) Time Series Production: Second Level Alarms If alarm => send email May 05 2008 A7, SA1 [200,300]Hz Mar 06 2008 Jul 2008 Sep 2007 A7, SA1 [400,500]Hz A7, SA1 [300,400]Hz

MA(alm(t)) Time Series Production: Second Level Alarms R Stacked Plot May 05 2008 A7, SA1 [200,300]Hz Mar 06 2008 Sep 2007 Jul 2008

Signal Execution: Part 2 Copyright 2014 KNIME.com AG 40

Time Series Production: Use KNIME to Send Alarms!!! Using RESTful Services: 41

KNIME in Action 42

But we can go Further! Pattern Recognition input Time series prediction target Input/target Pattern Recognition

KNIME 44

Pattern Recognition Iris Adae Copyright 2014 KNIME.com AG 45

Recognize what is not normal Pattern Recognition Output Probability / Confidence < threshold => Alarm Auto-associator pattern reproduction error > threshold => Alarm

Pattern Recognition: General Windowing Scheme Model the system with data from the past Use the system with most recent data point Time Single observation on multiple frequency bands 01.Jan.2007 31.Aug.2007 (as normality) 01.Sep.2007 20.Jul.2008 (check for anomaly) 47

Pattern Recognition: Cluster Based Fuzzy c-means -> clusters Fuzzy c-means also generates an outlier cluster If recent data point is in outlier cluster -> Anomaly 48

Pattern Recognition: Cluster Based Results Results: Fuzzy Cluster Apr 14 2008 Jul 21 2008 49

Pattern Recognition: Similarity Based Similarity Search -> nearest neighbor to current pattern If no nearest neighbor -> Anomaly 50

Pattern Recognition: Similarity Based Apr 14 2008 Jul 21 2008 51

KNIME in Action 52

But KNIME can do even more!!! Slow motion cameras can now record 4,400,000,000,000 frames per Second We don t have images of the motor turning but what if we did? 53

Image Processing Christian Dietz Copyright 2014 KNIME.com AG 54

The Dataset Data from DAGM 2007 Competition (data sponsored by Robert Bosch Corporate Research department, Schwieberdingen, Germany) Goal: Find defects in images with minimal user-interaction Dataset contains about 1100 images. 55

What s a Defect? 56

How to detect them in KNIME? 57

Reading Images 58

Calculating Features For each pixel we calculate features in the neighborhood of the pixel. Examples: Intensity Statistics (Mean, Variance, StdDev, etc), Texture, Computational expensive step: We are working on a faster version. 59

Annotate Training Data Defect Not a defect 60

Learn a Model Given the training data we can train a model to distinguish defect pixels from not defect pixels. Model: Random Forest Additional: We determine the average size of the defects for pruning too small and too big defects 61

Detecting Defects Defect Defect Defect 62

Detecting Defects: pass the new column back to learning!

KNIME in Action 64

Possible Next Steps.. Random Forrest and Threshold Detection Combining the Models Text Processing Big Data 65

Thank You We will announce White Paper availability! Questions: http://tech.knime.org/forum Rosaria.Silipo@knime.com Phil.Winters@knime.com 60k Follow 40k us: Twitter: @KNIME 20k LinkedIn: https://www.linkedin.com/groups?gid=2212172 KNIME Blog: http://www.knime.org/blog 66