Big Data within Automotive Testing



Similar documents
Analysis of Large Scale Data Volumes

Product Verification & Validation Management

ASAM ODS, Peak ODS Server and openmdm as a company-wide information hub for test and simulation data. Peak Solution GmbH, Nuremberg

ASAM ODS Workflow in the area of vehicle safety with openmdm

Avalon Anwendertreffen 2015

openmdm an Open Source Platform for Measured Data Management Dr. Dietmar Rapf, Michael Schwarzbach

Mastering the flood of NVH data through ASAM-ODS and MDM

openmdm an Open Source Platform for Measured Data Management Dr. Dietmar Rapf, Michael Schwarzbach

The MDM (Measurement Data Management) system environment

Associate Professor, Department of CSE, Shri Vishnu Engineering College for Women, Andhra Pradesh, India 2

Bringing Big Data Modelling into the Hands of Domain Experts

A Service for Data-Intensive Computations on Virtual Clusters

Chapter 7. Using Hadoop Cluster and MapReduce

A Distributed Approach to Business Intelligence Systems Synchronization

Backing up to the Cloud

MDM as Service Component

MeDaMAk. A measurement data management system based on the openmdm framework. Christian Rechner EPOS CAT GmbH

NETWRIX EVENT LOG MANAGER

SC-T35/SC-T45/SC-T46/SC-T47 ViewSonic Device Manager User Guide

Scenario of using workflows in Measurement Data Management (MDM) that are based on the ASAM ODS Workflow Application Model

Content Management Playout Encryption Broadcast Internet. Content Management Services

SIPAC. Signals and Data Identification, Processing, Analysis, and Classification

Powerful Management of Financial Big Data

Leveraging the Power of SOLR with SPARK. Johannes Weigend QAware GmbH Germany pache Big Data Europe September 2015

Datenverwaltung im Wandel - Building an Enterprise Data Hub with

Implementing a Digital Video Archive Based on XenData Software

Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies

Leveraging BlobSeer to boost up the deployment and execution of Hadoop applications in Nimbus cloud environments on Grid 5000

Oracle Big Data SQL Technical Update

R12 Oracle Purchasing Fundamentals

Business Process Management

Data processing goes big

NETWRIX CHANGE NOTIFIER

Implementing an Automated Digital Video Archive Based on the Video Edition of XenData Software

Hadoop IST 734 SS CHUNG

MaDaM the web-based Measurement Data Management for Big Data. by Dr. Bernhard Sünder, Managing Director, AMS GmbH

Microsoft Windows Server Update Services Questions & Answers About The Product

Active Directory Management. Agent Deployment Guide

Trends and Research Opportunities in Spatial Big Data Analytics and Cloud Computing NCSU GeoSpatial Forum

2012 LABVANTAGE Solutions, Inc. All Rights Reserved.

Big Data - Infrastructure Considerations

Running a typical ROOT HEP analysis on Hadoop/MapReduce. Stefano Alberto Russo Michele Pinamonti Marina Cobal

A new dimension of sound and vibration analysis

How to Setup SQL Server Replication

Testing 3Vs (Volume, Variety and Velocity) of Big Data

Company wide Testdata Postprocessing Solutions

Hadoop on Windows Azure: Hive vs. JavaScript for Processing Big Data

Data Refinery with Big Data Aspects

International Journal of Advancements in Research & Technology, Volume 3, Issue 2, February ISSN

Cloud Computing at Google. Architecture

Ignify ecommerce. Item Requirements Notes

HDFS Cluster Installation Automation for TupleWare

Hadoop and Map-Reduce. Swati Gore

Architecting for the next generation of Big Data Hortonworks HDP 2.0 on Red Hat Enterprise Linux 6 with OpenJDK 7

SharePoint Training. Yes-M Systems LLC. Length: Hours Course

Distributed File System. MCSN N. Tonellotto Complements of Distributed Enabling Platforms

PINsafe Multifactor Authentication Solution. Technical White Paper

Oracle s Big Data solutions. Roger Wullschleger. <Insert Picture Here>

Introducing Oracle Exalytics In-Memory Machine

A new innovation to protect, share, and distribute healthcare data

Sector vs. Hadoop. A Brief Comparison Between the Two Systems

ORACLE BUSINESS INTELLIGENCE WORKSHOP. Prerequisites for Oracle BI Workshop

FTP-Stream Data Sheet

Conversion of Measurement Data from MDF to ATFX

Getting Started with Hadoop. Raanan Dagan Paul Tibaldi

Introduction to Hadoop. New York Oracle User Group Vikas Sawhney

Fax and Oracle Collaboration Suite. An Oracle White Paper August 2005

Managing Large Imagery Databases via the Web

Regular data supply and partner data management in the cloud

2015 The MathWorks, Inc. 1

Sharepoint vs. inforouter

Implementing a Digital Video Archive Using XenData Software and a Spectra Logic Archive

Hadoop s Entry into the Traditional Analytical DBMS Market. Daniel Abadi Yale University August 3 rd, 2010

Endeca Introduction to Big Data Analytics

BIG DATA USING HADOOP

The Access Engine. Building Integration System - The Access Engine. Security Systems

THE ATLAS DISTRIBUTED DATA MANAGEMENT SYSTEM & DATABASES

Big Data Systems CS 5965/6965 FALL 2014

ACCESS INTELLIGENCE. an intelligent step beyond Access Management. White Paper

NoSQL for SQL Professionals William McKnight

BIG DATA HADOOP TRAINING

wu.cloud: Insights Gained from Operating a Private Cloud System

Manage Oracle Database Users and Roles Centrally in Active Directory or Sun Directory. Overview August 2008

Simulation Data Management with Interoperability across domains

Tackling Big Data with MATLAB Adam Filion Application Engineer MathWorks, Inc.

Can High-Performance Interconnects Benefit Memcached and Hadoop?

HADOOP ADMINISTATION AND DEVELOPMENT TRAINING CURRICULUM

Oracle Platform as a Service and Infrastructure as a Service Public Cloud Service Descriptions-Metered & Non-Metered.

Big Data on Microsoft Platform

Local server VS Cloud service A real scenario

An Oracle White Paper January Using Oracle's StorageTek Search Accelerator

Microsoft Big Data Solutions. Anar Taghiyev P-TSP

ISTEC.MIP Measurement Data Integration Platform

Data Domain Profiling and Data Masking for Hadoop

Transcription:

Big Data within Automotive Testing Bringing Analysis to the Data ASAM US Workshop, Novi Dr. Ralf Nörenberg ralf.noerenberg@highqsoft.de

Bringing Analysis to the data Content 1 2 3 4 5 Disclaimer and Big Data ODS (Open Data Service) Overview Prerequisites to ODS State-of-Today: Big ODS Projects (Large Scale Data) State-of-Tomorrow: Integration of Big Data into ODS (BIG ODS)

Disclaimer What is this presentation about? An abstract view on big data technologies & objectives A view on core objectives and limitations within automotive testing industries Approaches on technology transfer into the domain Current big-data industry use-cases

Big Data Overview What are the objectives of Big Data? Objective: Capacities Objectives: On-Demand Results Data size [mb] Current capacities Big Data capacities (Access) / Analysis rate [mb/s] Big Data 1. Integration of various data (types) 2. Handling of variable data and content 3. Quick adaption to new scenarios and individual case analysis 4. Iterative and explorative analyses in reaction to missing or changing requirements 5. Optimized for flexibility Big Data (Storage) enables Data Mining (Access and Analysis) Explore huge amounts of data from various systems and domains to derive models / patterns / results. We speak of a transformation of data into information / knowledge.

Big Data Overview What s the challenge with Big Data? Huge amount of data to be stored Exponential Growth Rate Challenges in Access and Analysis Volume (Scale of Data) Velocity (Analysis of Data) Variety (Different forms of data) Veracity (Uncertainty of data) How do we store the data (data structure / semantics)? How do we provide (enterprise) scalability? We ve got dependencies to specific Use-Cases!

Bringing Analysis to the data Content 1 2 3 4 5 Disclaimer and Big Data ODS (Open Data Service) Overview Prerequisites to ODS State-of-Today: Big ODS Projects State-of-Tomorrow: Integration of Big Data into ODS (BIG ODS)

ODS (Open Data Service) Overview What is ODS? The Open Data Service (ODS). is a generic definition of measurement related objects and its relations covers all domains Noise Vibration Harshness Engine Road Load Crash 1. Base Model: Standardized data 2. Application Model: specific derivations of the base model are allowed 3. ATFx Files: exchange data format for data exchange Organization / Description Measurement Physics Flexibility Application Model Test Test Object Test Environment Test Sequence Measurement MeasurementQuantity Submatrix LocalColumn ExternalComponent AoFile Units Physical Dimensions Quantities Any ParameterSet Parameter

ODS (Open Data Service) Overview The challenges in ODS [risks & costs] today automated analysis / Large Scale Data & Big Data 3 rd -party tool integration / analyses frameworks / web integration availability of data / long-term storage and archiving dependencies on tool suppliers / formats number of suppliers and tools [year] 1990 2000 2010 2020 Single test stands with proprietary data formats Locally networked and automated test stands Globally networked and automated test stands Locally networked, virtual and reality test stands

Big Data vs. ODS A comparison Big Data ODS 1. Un- / Semi-structured Data 2. Handling of variable data and content 3. Quick adaption to new scenarios and individual case analysis 4. Iterative and explorative analyses in reaction to missing or changing requirements 5. Optimized for flexibility 1. Structured Data 2. Focus on high value data 3. Identification of repeatable operation and processes 4. Definite requirements of specialists 5. Optimized for quick access and given analyses (which depend on data structure) Find a BIG ODS solution technology wise: Big Data technologies are the wheels, we need a tires (includes use cases).

Bringing Analysis to the data Content 1 2 3 4 5 Disclaimer and Big Data ODS (Open Data Service) Overview Prerequisites to ODS State-of-Today: Big ODS Projects (Large Scale Data) State-of-Tomorrow: Integration of Big Data into ODS (BIG ODS)

ODS System Solutions General use-cases and facts (keep in mind) 1. ODS data is only beginning to have distributed locations One location, (hopefully) minimal data redundancy 2. Test stands generate same data for a long period of time Changes are of structural nature (new format) Flexibility within data schema rarely required (e.g. new channels) 3. Established file formats like MDF4.x for storage and analysis Data is itemized by ODS for access & search (meta data) File is generally required in whole (third party tool) 4. Analysis is done locally by local experts very specific and complex (vs. string based or analysis on semistructured data) 5. ODS needs a single point of access web access primarily targets at meta data in Oracle v4 mdf v5

ODS System Solutions Definining the use-case is essential STEP 1: Business and Data Understanding What do I want to do with my data? Derive business cases and identify data available. STEP 2: Data Organization & Preparation How do I store my data? Data Semantics, Validation, Data Organization / Structure Iterative Process and an estimated 80% of the whole project effort. STEP 3: Data Analysis Creation of models and analysis programs and its configuration. STEP 4: Evaluation of Analysis Results Evaluation, Comparison and Optimization of Analysis Results STEP 5: Integration of all steps into an environment Evaluation, Comparison and Optimization of Analysis Results

ODS System Solutions Data Semantics A broad structure in the application model allows specific result-sets for searching (bound on use-case) allows all-in-depth analysis functionality This topic is controversially discussed in the Europe / US. Data models here tend to store more information (than needed) into AoAny / AoParameterSets / AoParameter. This results in a loss of data semantics and access capabilities (leads to the necessity of an indexer).

Bringing Analysis to the data Content 1 2 3 4 5 Disclaimer and Big Data ODS (Open Data Service) Overview Prerequisites to ODS State-of-Today: Big ODS Projects (Large Scale Data) State-of-Tomorrow: Integration of Big Data into ODS (BIG ODS)

State-of-Today: Big ODS Projects (I) Integration of an Indexer (Elastic Search) The project stores data of a number of test stands. Web-Server ODS API ODS Server Oracle The project is based on a regular ODS setup: centralized Web-Server access for users Oracle and ODS Server on same hardware File Servers linked by FILE_MODE=MULTI_VOLUME A web interface allows the user to look up measurements and add information to ODS according to the project use-cases. Big Data Use-Case (Access): On-the-fly meta data search (auto-complete) Full-text meta data search ( Live Mass-Data Search") File Server

State-of-Today: Big ODS Projects (I) Integration of an Indexer (Elastic Search) Web-Server: Decided which calls are directed to the server (write and mass data access) and which calls go to the indexer (meta data read access) and handles security for the latter. security Web-Server ODS API use-case Elastic Search Indexer Elastic Search Indexer: The indexer allows on demand response for the structure used (company use-cases). The indexer is synchronized with ODS. ODS Server synchronization Event Processor Notification Server Event Processor: The event processor acknowledges notifications, schedules and executes the task of applying the change to Elastic Search. Oracle File Server Notification Server: A Notification is generated based on import / any server event / user action /, therefore covers any event of change.

State-of-Today: Big ODS Projects (I) In Numbers Project Data (processed) Oracle Files Data-Root Mixed-Mode (BTF) Files on File Server (last 1-2 years) Files Volume Files on tape archive (5 years before that) Files Volume ~10 1.216.610 1.700 43.000.000 4.000 60.000.000 6.000 GB Files GB Files GB Files GB Data in Oracle rather small, mass-data is stored externally Data Volume strongly increasing 1 ODS server instance and adequate IT infrastructure (file server/network) Analysis is mostly done manually (per file download) Currently the project will be enhanced to automated analysis functionality.

State-of-Today: Big ODS Projects (II) Vehicle Fleet Testing (planned tests) This road-load data project is set up in an environment of a car manufacture who has all vehicle fleet testing data (and most activities) centralized within one project. The scope of this projects contains many customers (all internal development departments) all passenger vehicle types and almost all ECUs available all domains and data types over 500 testing vehicles to be managed Within ODS, this is the upmost state-of-the-art system to manage the test execution flow and standardized measurement data.

State-of-Today: Big ODS Projects (II) Integration of Automated Analysis Web-Server: The web server allows access to the data and analysis results based on the role of the employee. Web-Server ODS API Analysis Server Analysis Server: The analysis server is a framework to execute a measurement with an analysis program (JAVA, Matlab, DIAdem, ) to produce a result (measurement, report, email, ). The analysis programs are defined by specialists and may be put into an order. ODS Server Notification Server Oracle File Server Notification Server: A Notification is generated based on import event / user action and reports measurements/tests based on pre-configured MIME-types and analysis programs.

State-of-Today: Big ODS Projects (II) In Numbers Data Base Size (Oracle) Files on File Server(s) Files Volume Absolute growth Files in tape archive Files Volume Analysis Servers No. of Servers No. of Analyses ODS Servers Productive ODS Servers 900 39.346.529 5.400 1.5-1.8 58.462.941 7.700 1 210.000 3 GB Files GB GB per day Files GB in total per month in total Data volume in Oracle and on file server is very extensive. In terms of data volume, analysis rate and enterprise setup, this is one of the most beautiful ODS projects Project is state-of-the-art and meets performance requirements

Bringing Analysis to the data Content 1 2 3 4 5 Disclaimer and Big Data ODS (Open Data Service) Overview Prerequisites to ODS State-of-Today: Big ODS Projects (Large Scale Data) State-of-Tomorrow: Integration of Big Data into ODS (BIG ODS)

Big ODS Systems State-of-Tomorrow 1 2 Current ODS projects are manageable and performant and there s still room for improvement! 1. Integration of Web-Services for single-point of access (generic, specific work-flows) 2. Integration of Indexer Technology for quick data access (RESTfull user interface) 3 4 Web-Server ODS API Indexer 5 3. Integration of Hadoop database interface substitute transport protocols (Corba) 4. Integration of ODS server distributor ODS Server Analysis Server 5. Integration of Analysis Server Frameworks (distributor?) 6 Oracle 6. Integration of Big Data Storage (e.g. Hadoop) File Server

Big ODS Systems State-of-tomorrow: Conclusion 1 Current ODS projects are manageable and performant and there s still room for improvement! 3 4 1 2 Web-Server ODS API All Indexer this is great, but we do need: 1 5 1. Integration of Web-Services for single-point of access (generic, specific work-flows) 2. Integration of Indexer Technology for quick data access (RESTfull user interface) 3. Integration of Hadoop database interface BIG ODS use-cases Substitute needed! transport protocols (Corba) 4. Integration of ODS server distributor 6 Oracle ODS Server Analysis Server 5. Integration of (multiple) Analysis Server Frameworks (distributor?) 6. Oracle Adapter with Mass Data in Hadoop / Hadoop File Server

Big ODS Systems: A vision Road-Load Data from Vehicle Field Testing (unplanned) Car drives full day with random scenarios. ODS based (meta data) HDFS (mass data) Data is kept in ODS or ODS based big data technology as focus is on analysis. Everything is recorded (time and senor) and directly stored in binary data. Map-Reduce Jobs as analyses to allow Fleet Analysis, e.g. none-periodic events (particle filter burst) Identification of planned driving scenarios described and tested before???

Thank you! Any questions, suggestions and ideas? HighQSoft GmbH ralf.noerenberg@highqsoft.de