Search-based business intelligence and reverse data engineering with Apache Solr

Size: px
Start display at page:

Download "Search-based business intelligence and reverse data engineering with Apache Solr"

Transcription

1 Search-based business intelligence and reverse data engineering with Apache Solr M a r i o - L e a n d e r R e i m e r C h i e f T e c h n o l o g i s t

2 This talk will o Give a brief overview of the AIR system s architecture o Show reverse data engineering using Solr and MIR o Talk about the fight for our right to Solr o Describe solutions for the problem of combinatorial explosion o Outline a flexible and lightweight ETL approach for Solr Mario-Leander Reimer 2

3 A <<System>> AIR Central I <<Subsystem>> Spring Framework I <<Subsystem>> JEE 5 WS Clients A <<Ext. System>> AIR Bus I <<Ext. System>> Backend Systems Vehicles Service Bulletins Independent Workshop A <<Client>> Browser Search and Display Retrofits JSF Web UI REST API Protocoll Watchlist File Storage AIR DB Document Storage Mechanic A <<Anwendungscluster>> A <<Client>> Masterdata Documents AIR Repository AIR Client I <<Subsystem>> I <<Subsystem>> Measures Solr Access Apache Solr.NET WPF Solr Extensions Defects Maintenance Launch Parts Flat Rates 20 Languages 800 GB Solr Index Service Technician A <<Ext. System>> 3rd Party Application AIR Fork DLL AIR Call DLL Query A <<Application Cluster>> AIR Loader I <<Subsystem>> Spring Framework Vehicles Service Bulletins Load Execute A <<System>> AIR Control I <<Subsystem>> Jenkins Maintenance Defects A <<Ext. System>> 3rd Party ios App Flat Rates Documents AIR ios Lib Repair Overview... Backend Databases and Systems Mario-Leander Reimer 3

4 A <<Anwendungscluster>> AIR Repository 20 Languages 800 GB Solr Index I <<Subsystem>> Apache Solr Slave Solr Extensions Replicate I <<Subsystem>> Apache Solr Master Solr Extensions 20 Languages 800 GB Solr Index A <<Application Cluster>> AIR Loader A <<System>> AIR Control I <<Subsystem>> Spring Framework Load I <<Subsystem>> Jenkins Vehicles Service Bulletins Execute Maintenance Defects Flat Rates Repair Overview Documents... Backend Databases and Systems Search A <<System>> MIR Mario-Leander Reimer 4

5 Let s go back to when it all began Source: 5

6 The project vision: find the right information in less than 3 clicks. The situation: o Users had to use up to 7 different applications for their daily work. o Systems were not really integrated nicely. o Finding the correct information was laborious and error prone. The idea: o Combine the data into a consistent information network. o Make the information network and its data searchable and navigable. o Replace existing application with one easy to use application. Mario-Leander Reimer 6

7 But how do we find the originating system for the desired data? System A System B System C System D Where to find the vehicle data? Vehicle data 60 potential systems and 5000 entities. Other data Mario-Leander Reimer 7

8 And how do we find the hidden relations between the systems and their data? System A System B System C System D How is the data linked to each other? Customer Vehicle potential relations. Documents Other data Mario-Leander Reimer 8

9 Meta Information Research (MIR) Source: 9

10 MIR is a simple and lightweight data reverse engineering and analysis tool based on Solr. o MIR manages meta information about the source systems (the data models and record descriptions) o MIR allows to navigate and search in the metadata, you can drill into the metadata using facets o MIR also manages the target data model and Solr schema description A <<System>> Meta Information Research 25MB MIR User Interface Metadata Index I <<Subsystem>> Apache Solr MIR Loader Read Backend Databases and Systems Sources (Java, XML) Magic Draw MIR Generators Mario-Leander Reimer 10

11 Facetted drill down Wildcard queries Search results Tree view of systems, tables and attributes Found potential synonyms for the chassis number 11

12 Solr entities for each release Solr schema attributes EAT YOUR OWN DOG FOOD. The AIR Solr schema definition is modelled and defined within MIR. 12

13 def sourcegenerator = MIR + Solr + Maven; Mario-Leander Reimer 13

14 But Solr is a full text search engine. You have to use an Oracle DB for your application data! NO! 14

15 Some of the AIR requirements were... o Focus is on search. Transactions are not required. o High demands on request volume and performance. o Free navigation on data model and content. o Support for full text search and facetted search. o Offline capabilities. o Scalability from low-end device to server to cloud. Mario-Leander Reimer 15

16 Apache Solr outperformed Oracle significantly in query time as well as index size. SELECT * FROM VEHICLE WHERE VIN='V%' INFO_TYPE:VEHICLE AND VIN:V* 383 ms 384 ms 383 ms 038 ms 000 ms 000 ms SELECT * FROM VEHICLE WHERE VIN='%X%' INFO_TYPE:VEHICLE AND VIN:*X* 389 ms 387 ms 386 ms 092 ms 000 ms 000 ms SELECT * FROM MEASURE WHERE TEXT='engine' INFO_TYPE:MEASURE AND TEXT:engine 859 ms 379 ms 383 ms 039 ms 000 ms 000 ms Test data set: records Disk space: 132 MB Solr vs. 385 MB Oracle Mario-Leander Reimer 16

17 Dirt Race Use Case: o Low-end devices o No Internet 28. Source: September

18 Running Solr and AIR-2-Go on Raspberry Pi Model B worked like a charm. Running Debian Linux + JDK8 Jetty Container with the Solr and AIR WARs deployed Reduced Solr data set with approx ~1.5 Mio documents Model B Hardware Specs: o ARMv6 CPU at 700Mhz o 512MB RAM o 32GB SD-Card Mario-Leander Reimer 18

19 YOU GOTTA FIGHT FOR YOUR RIGHT TO SOLR! 19

20 No silver bullet. A careful schema design is crucial for your Solr performance. Mario-Leander Reimer 20

21 Naive data denormalization can quickly lead to combinatorial explosion. m FRU Groups Flat Rate Units m Packages n n n Measures Repair Instructions Vehicles m m n n Types m n n Technical Documents Parts n Fault Indications Num Docs: Relationship Navigation Mario-Leander Reimer 21

22 Multi-valued fields can efficiently store 1..n relations but may result in false positives. { } "INFO_TYPE":"AWPOS_GROUP", "NUMMER" :[ " ", " " ] "BAUSTAND" :[" T23:00:00Z"," T23:00:00Z"] "E_SERIES" :[ "F10", "E30" ] Index 0 Index 1 q=info_type:awpos_group AND NUMMER: AND E_SERIES:F10 q=info_type:awpos_group AND NUMMER: AND E_SERIES:E30 In case this doesn t matter, perform a post filtering in your application. Note: latest Solr versions support nested child documents. Use instead. Mario-Leander Reimer 22

23 Technical documents and their validity were expressed in a binary representation. o Validity expressions may have up to 46 characteristics. o Validity expressions use 5 different boolean operators (AND, NOT, ) o Validity expessions can be nested and complex. o Some characteristics are dynamic and not even known at index time. Solution: transform the validity expressions into the equivalent JavaScript terms and evaluate these terms at query time using a custom function query filter. Mario-Leander Reimer 23

24 Binary validity expression example. Type( ) = Brand, Value( ) = BMW PKW Type( ) = E-Series, Value( ) = F10 Type( ) = Transmission, Value( ) = MECH 24

25 Transformation of binary validity terms into their JavaScript equivalent at index time. AND(Brand='BMW PKW', E-Series='F10' Transmission='MECH') ((BRAND=='BMW PKW')&&(E_SERIES=='F10')&&(TRANSMISSION=='MECH')) { "INFO_TYPE": "TECHNISCHES_DOKUMENT", "DOKUMENT_TITEL": "Getriebe aus- und einbauen", "DOKUMENT_ART": " reparaturanleitung", "VALIDITY": "((BRAND=='BMW PKW')&&((E_SERIES=='F10')&&(...))", BRAND": [ BMW PKW"],... } Mario-Leander Reimer 25

26 Base64 decode The JavaScript validity term is evaluated at query time using a custom function query. &fq=info_type:technisches_dokument &fq=dokument_art:reparaturanleitung &fq={!frange l=1 u=1 incl=true incu=true cache=false cost=500} jsterm(validity,eyjnt1rpul9lukfgvfnut0zgqvjux01pve9sqvjcruluu 1ZFUkZBSFJFTiI6IkIiLCJFX01BU0NISU5FX0tSQUZUU1RPRkZBUlQiOm51bG wsilnjq0hfukhfsvrtrkfiulpfvucioiiwiiwiqu5uuklfqii6ikfxrcisikv kjbvvjfsuhfijoiwccifq==) { BRAND":"BMW PKW", "E_SERIES":"F10", "TRANSMISSION":"MECH" } Mario-Leander Reimer 26

27 How often do we load data? How do we ensure data consistency? 27

28 A traditional approach using a DWH and ETL: too inflexible, heavy weight and expensive. System A File DB ETL ETL jobs would usually be implemented with Informatica System B ETL ETL Data Warehouse ETL AIR Solr ETL System C ETL File DB Significant business logic required depending on the source database Mario-Leander Reimer 28

29 Start Build Run Flexible and lightweight ETL combined with Continuous Delivery and DevOps. H <<System>> AIR Loader Slave H <<System>> AIR Search Developer Solr Index Solr Index Build & Deploy I <<System>> Apache Solr A <<System>> AIR Loader Load Replicate I <<System>> Apache Solr I <<System>> Nexus Repository I <<System>> Apache Maven Extract I <<System>> Jenkins Slave Data Source A Operations I <<System>> Jenkins Master Data Source n Mario-Leander Reimer 29

30 30

31 Apache Solr has become a powerful tool for data analytics applications. Be creative. Our next big project using Apache Solr is already on its way. High performance application to predict and calculate the bill of materials for all required parts and orders. Apache Solr as a compressed, scalable and high performance time series database. FOSDEM 15 Florian Lautenschlager, QAware GmbH Leveraging the Power of SOLR and SPARK Apache: Big Data 2015 Johannes Weigend, QAware GmbH Mario-Leander Reimer 31

32 Business intelligence is about asking the right questions about your data. 32

33 And with Apache Solr you can search and find the answers you are looking for. 33

34 & Mario-Leander Reimer Chief Technologist, QAware GmbH

Leveraging the Power of SOLR with SPARK. Johannes Weigend QAware GmbH Germany pache Big Data Europe September 2015

Leveraging the Power of SOLR with SPARK. Johannes Weigend QAware GmbH Germany pache Big Data Europe September 2015 Leveraging the Power of SOLR with SPARK Johannes Weigend QAware GmbH Germany pache Big Data Europe September 2015 Welcome Johannes Weigend - CTO QAware GmbH - Software architect / developer - 25 years

More information

Decoding the Big Data Deluge a Virtual Approach. Dan Luongo, Global Lead, Field Solution Engineering Data Virtualization Business Unit, Cisco

Decoding the Big Data Deluge a Virtual Approach. Dan Luongo, Global Lead, Field Solution Engineering Data Virtualization Business Unit, Cisco Decoding the Big Data Deluge a Virtual Approach Dan Luongo, Global Lead, Field Solution Engineering Data Virtualization Business Unit, Cisco High-volume, velocity and variety information assets that demand

More information

Search and Real-Time Analytics on Big Data

Search and Real-Time Analytics on Big Data Search and Real-Time Analytics on Big Data Sewook Wee, Ryan Tabora, Jason Rutherglen Accenture & Think Big Analytics Strata New York October, 2012 Big Data: data becomes your core asset. It realizes its

More information

College of Engineering, Technology, and Computer Science

College of Engineering, Technology, and Computer Science College of Engineering, Technology, and Computer Science Design and Implementation of Cloud-based Data Warehousing In partial fulfillment of the requirements for the Degree of Master of Science in Technology

More information

XpoLog Competitive Comparison Sheet

XpoLog Competitive Comparison Sheet XpoLog Competitive Comparison Sheet New frontier in big log data analysis and application intelligence Technical white paper May 2015 XpoLog, a data analysis and management platform for applications' IT

More information

System Requirements Table of contents

System Requirements Table of contents Table of contents 1 Introduction... 2 2 Knoa Agent... 2 2.1 System Requirements...2 2.2 Environment Requirements...4 3 Knoa Server Architecture...4 3.1 Knoa Server Components... 4 3.2 Server Hardware Setup...5

More information

Informatica Data Director Performance

Informatica Data Director Performance Informatica Data Director Performance 2011 Informatica Abstract A variety of performance and stress tests are run on the Informatica Data Director to ensure performance and scalability for a wide variety

More information

Sisense. Product Highlights. www.sisense.com

Sisense. Product Highlights. www.sisense.com Sisense Product Highlights Introduction Sisense is a business intelligence solution that simplifies analytics for complex data by offering an end-to-end platform that lets users easily prepare and analyze

More information

How To Retire A Legacy System From Healthcare With A Flatirons Eas Application Retirement Solution

How To Retire A Legacy System From Healthcare With A Flatirons Eas Application Retirement Solution EAS Application Retirement Case Study: Health Insurance Introduction A major health insurance organization contracted with Flatirons Solutions to assist them in retiring a number of aged applications that

More information

Oracle Identity Analytics Architecture. An Oracle White Paper July 2010

Oracle Identity Analytics Architecture. An Oracle White Paper July 2010 Oracle Identity Analytics Architecture An Oracle White Paper July 2010 Disclaimer The following is intended to outline our general product direction. It is intended for information purposes only, and may

More information

Real-World Analytics with Solr Cloud and Spark

Real-World Analytics with Solr Cloud and Spark Real-World Analytics with Solr Cloud and Spark Solving Analytic Problems for Billions of Records Within Seconds Johannes Weigend Apache Big Data North America 2016 May 2016 Vancouver, May 2016 Johannes

More information

Tips and tricks for using SAP BusinessObjects Web Intelligence with SAP BW

Tips and tricks for using SAP BusinessObjects Web Intelligence with SAP BW Orange County Convention Center Orlando, Florida May 15-18, 2011 Tips and tricks for using SAP BusinessObjects Web Intelligence with SAP BW Deepu Sasidharan ] [ Agenda Introduction System Landscape Advanced

More information

<Insert Picture Here> Oracle BI Standard Edition One The Right BI Foundation for the Emerging Enterprise

<Insert Picture Here> Oracle BI Standard Edition One The Right BI Foundation for the Emerging Enterprise Oracle BI Standard Edition One The Right BI Foundation for the Emerging Enterprise Business Intelligence is the #1 Priority the most important technology in 2007 is business intelligence

More information

applications. JBoss Enterprise Application Platform

applications. JBoss Enterprise Application Platform JBoss Enterprise Application Platform What is it? JBoss Enterprise Application Platform is the industryleading platform for next-generation enterprise Java applications. It provides a stable, open source

More information

An Oracle White Paper July 2011. Oracle Primavera Contract Management, Business Intelligence Publisher Edition-Sizing Guide

An Oracle White Paper July 2011. Oracle Primavera Contract Management, Business Intelligence Publisher Edition-Sizing Guide Oracle Primavera Contract Management, Business Intelligence Publisher Edition-Sizing Guide An Oracle White Paper July 2011 1 Disclaimer The following is intended to outline our general product direction.

More information

Architecting for the next generation of Big Data Hortonworks HDP 2.0 on Red Hat Enterprise Linux 6 with OpenJDK 7

Architecting for the next generation of Big Data Hortonworks HDP 2.0 on Red Hat Enterprise Linux 6 with OpenJDK 7 Architecting for the next generation of Big Data Hortonworks HDP 2.0 on Red Hat Enterprise Linux 6 with OpenJDK 7 Yan Fisher Senior Principal Product Marketing Manager, Red Hat Rohit Bakhshi Product Manager,

More information

Introduction to Oracle Business Intelligence Standard Edition One. Mike Donohue Senior Manager, Product Management Oracle Business Intelligence

Introduction to Oracle Business Intelligence Standard Edition One. Mike Donohue Senior Manager, Product Management Oracle Business Intelligence Introduction to Oracle Business Intelligence Standard Edition One Mike Donohue Senior Manager, Product Management Oracle Business Intelligence The following is intended to outline our general product direction.

More information

Load and Performance Load Testing. RadView Software October 2015 www.radview.com

Load and Performance Load Testing. RadView Software October 2015 www.radview.com Load and Performance Load Testing RadView Software October 2015 www.radview.com Contents Introduction... 3 Key Components and Architecture... 4 Creating Load Tests... 5 Mobile Load Testing... 9 Test Execution...

More information

Oracle Data Integrator Overview and Demo. Maria Mundrova ETL Developer/DW Consultant

Oracle Data Integrator Overview and Demo. Maria Mundrova ETL Developer/DW Consultant Oracle Data Integrator Overview and Demo Maria Mundrova ETL Developer/DW Consultant BGOUG Spring Conference, 2009 Presentation Agenda Oracle Data Integrator What is ODI? ELT Approach Architecture Oracle

More information

Take full advantage of IBM s IDEs for end- to- end mobile development

Take full advantage of IBM s IDEs for end- to- end mobile development Take full advantage of IBM s IDEs for end- to- end mobile development ABSTRACT Mobile development with Rational Application Developer 8.5, Rational Software Architect 8.5, Rational Developer for zenterprise

More information

Performance Management Platform

Performance Management Platform Open EMS Suite by Nokia Performance Management Platform Functional Overview Version 1.4 Nokia Siemens Networks 1 (16) Performance Management Platform The information in this document is subject to change

More information

OBIEE 11g Scaleout & Clustering

OBIEE 11g Scaleout & Clustering OBIEE 11g Scaleout & Clustering Borkur Steingrimsson, Rittman Mead Consulting Collaborate, Orlando, April 2011 Agenda Review OBIEE Architecture Installation Scenarios : Desktop, Departmental, Enterprise

More information

Oracle Primavera P6 Enterprise Project Portfolio Management Performance and Sizing Guide. An Oracle White Paper October 2010

Oracle Primavera P6 Enterprise Project Portfolio Management Performance and Sizing Guide. An Oracle White Paper October 2010 Oracle Primavera P6 Enterprise Project Portfolio Management Performance and Sizing Guide An Oracle White Paper October 2010 Disclaimer The following is intended to outline our general product direction.

More information

Inside the Digital Commerce Engine. The architecture and deployment of the Elastic Path Digital Commerce Engine

Inside the Digital Commerce Engine. The architecture and deployment of the Elastic Path Digital Commerce Engine Inside the Digital Commerce Engine The architecture and deployment of the Elastic Path Digital Commerce Engine Contents Executive Summary... 3 Introduction... 4 What is the Digital Commerce Engine?...

More information

Big Data for Investment Research Management

Big Data for Investment Research Management IDT Partners www.idtpartners.com Big Data for Investment Research Management Discover how IDT Partners helps Financial Services, Market Research, and Investment Management firms turn big data into actionable

More information

SharePoint 2010 vs. SharePoint 2013 Feature Comparison

SharePoint 2010 vs. SharePoint 2013 Feature Comparison SharePoint 2010 vs. SharePoint 2013 Feature Comparison March 2013 2013 SUSAN HANLEY LLC SharePoint 2010 vs. 2013 From a document collaboration perspective, the structures of both versions are the same

More information

Big Data Use Case. How Rackspace is using Private Cloud for Big Data. Bryan Thompson. May 8th, 2013

Big Data Use Case. How Rackspace is using Private Cloud for Big Data. Bryan Thompson. May 8th, 2013 Big Data Use Case How Rackspace is using Private Cloud for Big Data Bryan Thompson May 8th, 2013 Our Big Data Problem Consolidate all monitoring data for reporting and analytical purposes. Every device

More information

XpoLog Center Suite Data Sheet

XpoLog Center Suite Data Sheet XpoLog Center Suite Data Sheet General XpoLog is a data analysis and management platform for Applications IT data. Business applications rely on a dynamic heterogeneous applications infrastructure, such

More information

2012 LABVANTAGE Solutions, Inc. All Rights Reserved.

2012 LABVANTAGE Solutions, Inc. All Rights Reserved. LABVANTAGE Architecture 2012 LABVANTAGE Solutions, Inc. All Rights Reserved. DOCUMENT PURPOSE AND SCOPE This document provides an overview of the LABVANTAGE hardware and software architecture. It is written

More information

Oracle BI EE Implementation on Netezza. Prepared by SureShot Strategies, Inc.

Oracle BI EE Implementation on Netezza. Prepared by SureShot Strategies, Inc. Oracle BI EE Implementation on Netezza Prepared by SureShot Strategies, Inc. The goal of this paper is to give an insight to Netezza architecture and implementation experience to strategize Oracle BI EE

More information

European Archival Records and Knowledge Preservation Database Archiving in the E-ARK Project

European Archival Records and Knowledge Preservation Database Archiving in the E-ARK Project European Archival Records and Knowledge Preservation Database Archiving in the E-ARK Project Janet Delve, University of Portsmouth Kuldar Aas, National Archives of Estonia Rainer Schmidt, Austrian Institute

More information

Liferay Portal s Document Library: Architectural Overview, Performance and Scalability

Liferay Portal s Document Library: Architectural Overview, Performance and Scalability Liferay Portal s Document Library: Architectural Overview, Performance and Scalability Table of Contents EXECUTIVE SUMMARY... 1 HIGH LEVEL ARCHITECTURE... 2 User Interface Layer... 2 Service Layer....

More information

Cost-Effective Business Intelligence with Red Hat and Open Source

Cost-Effective Business Intelligence with Red Hat and Open Source Cost-Effective Business Intelligence with Red Hat and Open Source Sherman Wood Director, Business Intelligence, Jaspersoft September 3, 2009 1 Agenda Introductions Quick survey What is BI?: reporting,

More information

Introduction to Hadoop HDFS and Ecosystems. Slides credits: Cloudera Academic Partners Program & Prof. De Liu, MSBA 6330 Harvesting Big Data

Introduction to Hadoop HDFS and Ecosystems. Slides credits: Cloudera Academic Partners Program & Prof. De Liu, MSBA 6330 Harvesting Big Data Introduction to Hadoop HDFS and Ecosystems ANSHUL MITTAL Slides credits: Cloudera Academic Partners Program & Prof. De Liu, MSBA 6330 Harvesting Big Data Topics The goal of this presentation is to give

More information

SAP Crystal Reports & SAP HANA: Integration & Roadmap Kenneth Li SAP SESSION CODE: 0401

SAP Crystal Reports & SAP HANA: Integration & Roadmap Kenneth Li SAP SESSION CODE: 0401 SAP Crystal Reports & SAP HANA: Integration & Roadmap Kenneth Li SAP SESSION CODE: 0401 LEARNING POINTS Learn about Crystal Reports for HANA Glance at the road map for the product Overview of deploying

More information

The Rise of Industrial Big Data. Brian Courtney General Manager Industrial Data Intelligence

The Rise of Industrial Big Data. Brian Courtney General Manager Industrial Data Intelligence The Rise of Industrial Big Data Brian Courtney General Manager Industrial Data Intelligence Agenda Introduction Big Data for the industrial sector Case in point: Big data saves millions at GE Energy Seeking

More information

Agile Business Intelligence Data Lake Architecture

Agile Business Intelligence Data Lake Architecture Agile Business Intelligence Data Lake Architecture TABLE OF CONTENTS Introduction... 2 Data Lake Architecture... 2 Step 1 Extract From Source Data... 5 Step 2 Register And Catalogue Data Sets... 5 Step

More information

Search Big Data with MySQL and Sphinx. Mindaugas Žukas www.ivinco.com

Search Big Data with MySQL and Sphinx. Mindaugas Žukas www.ivinco.com Search Big Data with MySQL and Sphinx Mindaugas Žukas www.ivinco.com Agenda Big Data Architecture Factors and Technologies MySQL and Big Data Sphinx Search Server overview Case study: building a Big Data

More information

Beyond Conventional Data Warehousing. Florian Waas Greenplum Inc.

Beyond Conventional Data Warehousing. Florian Waas Greenplum Inc. Beyond Conventional Data Warehousing Florian Waas Greenplum Inc. Takeaways The basics Who is Greenplum? What is Greenplum Database? The problem Data growth and other recent trends in DWH A look at different

More information

Software. PowerExplorer. Information Management and Platform DATA SHEET

Software. PowerExplorer. Information Management and Platform DATA SHEET DATA SHEET PowerExplorer Software Information Management and Platform KEY FEATURES Web-enabled Advanced, ad-hoc query capabilities Spatial E&P data presentation ZGF file import/export Spatializer Tabular

More information

Trends and Research Opportunities in Spatial Big Data Analytics and Cloud Computing NCSU GeoSpatial Forum

Trends and Research Opportunities in Spatial Big Data Analytics and Cloud Computing NCSU GeoSpatial Forum Trends and Research Opportunities in Spatial Big Data Analytics and Cloud Computing NCSU GeoSpatial Forum Siva Ravada Senior Director of Development Oracle Spatial and MapViewer 2 Evolving Technology Platforms

More information

Using Linux Clusters as VoD Servers

Using Linux Clusters as VoD Servers HAC LUCE Using Linux Clusters as VoD Servers Víctor M. Guĺıas Fernández [email protected] Computer Science Department University of A Corunha funded by: Outline Background: The Borg Cluster Video on Demand.

More information

Overview Western 12.-13.9.2012 Mariusz Gieparda

Overview Western 12.-13.9.2012 Mariusz Gieparda Overview Western 12.-13.9.2012 Mariusz Gieparda 1 Corporate Overview Company Global Leader in Business Continuity Easy. Affordable. Innovative. Technology Protection Operational Excellence Compliance Customer

More information

Nuxeo, an open source platform for content-centric business applications. Stéfane Fermigier, Nuxeo Laurent Doguin, Nuxeo

Nuxeo, an open source platform for content-centric business applications. Stéfane Fermigier, Nuxeo Laurent Doguin, Nuxeo Nuxeo, an open source platform for content-centric business applications Stéfane Fermigier, Nuxeo Laurent Doguin, Nuxeo Nuxeo, the Company Providing an Open Source Content Management Platform for Business

More information

Oracle Architecture, Concepts & Facilities

Oracle Architecture, Concepts & Facilities COURSE CODE: COURSE TITLE: CURRENCY: AUDIENCE: ORAACF Oracle Architecture, Concepts & Facilities 10g & 11g Database administrators, system administrators and developers PREREQUISITES: At least 1 year of

More information

The Future of IoT. Zach Shelby VP Marketing, IoT Feb 3 rd, 2015

The Future of IoT. Zach Shelby VP Marketing, IoT Feb 3 rd, 2015 The Future of IoT Zach Shelby VP Marketing, IoT Feb 3 rd, 2015 1 Internet of (really nerdy) People 1980s 2 Internet of (content silo) People 1990s 3 Internet of (Web) People 2000s 4 Internet of (really

More information

http://support.oracle.com/

http://support.oracle.com/ Oracle Primavera Contract Management 14.0 Sizing Guide October 2012 Legal Notices Oracle Primavera Oracle Primavera Contract Management 14.0 Sizing Guide Copyright 1997, 2012, Oracle and/or its affiliates.

More information

Databricks. A Primer

Databricks. A Primer Databricks A Primer Who is Databricks? Databricks vision is to empower anyone to easily build and deploy advanced analytics solutions. The company was founded by the team who created Apache Spark, a powerful

More information

Oracle Data Integrator 11g: Integration and Administration

Oracle Data Integrator 11g: Integration and Administration Oracle University Contact Us: Local: 1800 103 4775 Intl: +91 80 4108 4709 Oracle Data Integrator 11g: Integration and Administration Duration: 5 Days What you will learn Oracle Data Integrator is a comprehensive

More information

Oracle Big Data SQL Technical Update

Oracle Big Data SQL Technical Update Oracle Big Data SQL Technical Update Jean-Pierre Dijcks Oracle Redwood City, CA, USA Keywords: Big Data, Hadoop, NoSQL Databases, Relational Databases, SQL, Security, Performance Introduction This technical

More information

Contents Introduction... 5 Deployment Considerations... 9 Deployment Architectures... 11

Contents Introduction... 5 Deployment Considerations... 9 Deployment Architectures... 11 Oracle Primavera Contract Management 14.1 Sizing Guide July 2014 Contents Introduction... 5 Contract Management Database Server... 5 Requirements of the Contract Management Web and Application Servers...

More information

Enterprise Service Bus

Enterprise Service Bus We tested: Talend ESB 5.2.1 Enterprise Service Bus Dr. Götz Güttich Talend Enterprise Service Bus 5.2.1 is an open source, modular solution that allows enterprises to integrate existing or new applications

More information

Phire Architect Hardware and Software Requirements

Phire Architect Hardware and Software Requirements Phire Architect Hardware and Software Requirements Copyright 2014, Phire. All rights reserved. The Programs (which include both the software and documentation) contain proprietary information; they are

More information

Apache HBase. Crazy dances on the elephant back

Apache HBase. Crazy dances on the elephant back Apache HBase Crazy dances on the elephant back Roman Nikitchenko, 16.10.2014 YARN 2 FIRST EVER DATA OS 10.000 nodes computer Recent technology changes are focused on higher scale. Better resource usage

More information

Patent Public Advisory Committee Meeting PE2E Status. David Landrith Patents Portfolio Manager July 14, 2011

Patent Public Advisory Committee Meeting PE2E Status. David Landrith Patents Portfolio Manager July 14, 2011 Patent Public Advisory Committee Meeting PE2E Status David Landrith Patents Portfolio Manager July 14, 2011 PE2E Timeline 2 Front-End Summary Integrated front-end design & implementation Initial front-end

More information

Dell Microsoft Business Intelligence and Data Warehousing Reference Configuration Performance Results Phase III

Dell Microsoft Business Intelligence and Data Warehousing Reference Configuration Performance Results Phase III White Paper Dell Microsoft Business Intelligence and Data Warehousing Reference Configuration Performance Results Phase III Performance of Microsoft SQL Server 2008 BI and D/W Solutions on Dell PowerEdge

More information

Department of Defense. Enterprise Information Warehouse/Web (EIW) Using standards to Federate and Integrate Domains at DOD

Department of Defense. Enterprise Information Warehouse/Web (EIW) Using standards to Federate and Integrate Domains at DOD Department of Defense Human Resources - Enterprise Information Warehouse/Web (EIW) Using standards to Federate and Integrate Domains at DOD Federation Defined Members of a federation agree to certain standards

More information

Oracle Business Activity Monitoring 11g New Features

Oracle Business Activity Monitoring 11g New Features Oracle Business Activity Monitoring 11g New Features Gert Schüßler Principal Sales Consultant Oracle Deutschland GmbH Agenda Overview Architecture Enterprise Integration Framework

More information

Fusion Applications Overview of Business Intelligence and Reporting components

Fusion Applications Overview of Business Intelligence and Reporting components Fusion Applications Overview of Business Intelligence and Reporting components This document briefly lists the components, their common acronyms and the functionality that they bring to Fusion Applications.

More information

Databricks. A Primer

Databricks. A Primer Databricks A Primer Who is Databricks? Databricks was founded by the team behind Apache Spark, the most active open source project in the big data ecosystem today. Our mission at Databricks is to dramatically

More information

Hitachi Data Center Analytics

Hitachi Data Center Analytics Hitachi Data Center Analytics Agenda Storage analytics challenges Introducing Hitachi Data Center Analytics Storage analytics use cases and solutions Q&A Storage Analytics Challenges Storage Pain Points

More information

HDFS. Hadoop Distributed File System

HDFS. Hadoop Distributed File System HDFS Kevin Swingler Hadoop Distributed File System File system designed to store VERY large files Streaming data access Running across clusters of commodity hardware Resilient to node failure 1 Large files

More information

Tool-Assisted Knowledge to HL7 v3 Message Translation (TAMMP) Installation Guide December 23, 2009

Tool-Assisted Knowledge to HL7 v3 Message Translation (TAMMP) Installation Guide December 23, 2009 Tool-Assisted Knowledge to HL7 v3 Message Translation (TAMMP) Installation Guide December 23, 2009 Richard Lyn [email protected] Jianwei Yang [email protected] Document Revision History Rev. Level Date

More information

BIG DATA CAN DRIVE THE BUSINESS AND IT TO EVOLVE AND ADAPT RALPH KIMBALL BUSSUM 2014

BIG DATA CAN DRIVE THE BUSINESS AND IT TO EVOLVE AND ADAPT RALPH KIMBALL BUSSUM 2014 BIG DATA CAN DRIVE THE BUSINESS AND IT TO EVOLVE AND ADAPT RALPH KIMBALL BUSSUM 2014 Ralph Kimball Associates 2014 The Data Warehouse Mission Identify all possible enterprise data assets Select those assets

More information

Building Your Big Data Team

Building Your Big Data Team Building Your Big Data Team With all the buzz around Big Data, many companies have decided they need some sort of Big Data initiative in place to stay current with modern data management requirements.

More information

Case Study. Web Application for Financial & Economic Data Analysis. www.brainvire.com 2013 Brainvire Infotech Pvt. Ltd Page 1 of 1

Case Study. Web Application for Financial & Economic Data Analysis. www.brainvire.com 2013 Brainvire Infotech Pvt. Ltd Page 1 of 1 Case Study Web Application for Financial & Economic Data Analysis www.brainvire.com 2013 Brainvire Infotech Pvt. Ltd Page 1 of 1 Client Requirement This is a highly customized application for financial

More information

SAP Database Strategy Overview. Uwe Grigoleit September 2013

SAP Database Strategy Overview. Uwe Grigoleit September 2013 SAP base Strategy Overview Uwe Grigoleit September 2013 SAP s In-Memory and management Strategy Big- in Business-Context: Are you harnessing the opportunity? Mobile Transactions Things Things Instant Messages

More information

What's New in SAS Data Management

What's New in SAS Data Management Paper SAS034-2014 What's New in SAS Data Management Nancy Rausch, SAS Institute Inc., Cary, NC; Mike Frost, SAS Institute Inc., Cary, NC, Mike Ames, SAS Institute Inc., Cary ABSTRACT The latest releases

More information

PLATFORA INTERACTIVE, IN-MEMORY BUSINESS INTELLIGENCE FOR HADOOP

PLATFORA INTERACTIVE, IN-MEMORY BUSINESS INTELLIGENCE FOR HADOOP PLATFORA INTERACTIVE, IN-MEMORY BUSINESS INTELLIGENCE FOR HADOOP Your business is swimming in data, and your business analysts want to use it to answer the questions of today and tomorrow. YOU LOOK TO

More information

IBM Rational Asset Manager

IBM Rational Asset Manager Providing business intelligence for your software assets IBM Rational Asset Manager Highlights A collaborative software development asset management solution, IBM Enabling effective asset management Rational

More information

ORACLE TAX ANALYTICS. The Solution. Oracle Tax Data Model KEY FEATURES

ORACLE TAX ANALYTICS. The Solution. Oracle Tax Data Model KEY FEATURES ORACLE TAX ANALYTICS KEY FEATURES A set of comprehensive and compatible BI Applications. Advanced insight into tax performance Built on World Class Oracle s Database and BI Technology Design after the

More information

Enterprise GIS Architecture Deployment Options. Andrew Sakowicz

Enterprise GIS Architecture Deployment Options. Andrew Sakowicz Enterprise GIS Architecture Deployment Options Andrew Sakowicz Audience Audience - Architects - Developers - Administrators - Project Managers Level: - Beginner / Intermediate Introduction Andrew Sakowicz

More information

How to make a good Software Requirement Specification(SRS)

How to make a good Software Requirement Specification(SRS) Information Management Software Information Management Software How to make a good Software Requirement Specification(SRS) Click to add text TGMC 2011 Phases Registration SRS Submission Project Submission

More information

Ganzheitliches Datenmanagement

Ganzheitliches Datenmanagement Ganzheitliches Datenmanagement für Hadoop Michael Kohs, Senior Sales Consultant @mikchaos The Problem with Big Data Projects in 2016 Relational, Mainframe Documents and Emails Data Modeler Data Scientist

More information

Jenkins User Conference Herzelia, July 5 2012 #jenkinsconf. Testing a Large Support Matrix Using Jenkins. Amir Kibbar HP http://hp.

Jenkins User Conference Herzelia, July 5 2012 #jenkinsconf. Testing a Large Support Matrix Using Jenkins. Amir Kibbar HP http://hp. Testing a Large Support Matrix Using Jenkins Amir Kibbar HP http://hp.com/go/oo About Me! 4.5 years with HP! Almost 3 years System Architect! Out of which 1.5 HP OO s SA! Before that a Java consultant

More information

IT Business Management System Requirements Guide

IT Business Management System Requirements Guide IT Business Management System Requirements Guide IT Business Management 8.0 This document supports the version of each product listed and supports all subsequent versions until the document is replaced

More information

Test Data Management Concepts

Test Data Management Concepts Test Data Management Concepts BIZDATAX IS AN EKOBIT BRAND Executive Summary Test Data Management (TDM), as a part of the quality assurance (QA) process is more than ever in the focus among IT organizations

More information

POTENTIAL DHH TECHNICAL ARCHITECTURE

POTENTIAL DHH TECHNICAL ARCHITECTURE POTENTIAL DHH TECHNICAL ARCHITECTURE SPECIFICALLY FOR CONSIDERATION REGARDING IMPLEMENTATIONS OF MEDS AND MMIS.NET, Java, Dynamics, SharePoint, ColdFusion, etc. BUSINESS APPLICATIONS PLATFORM GOVERNANCE

More information

Q: Which versions of Oracle BI does Primavera P6 Analytics support? A: Oracle Business Intelligence 10g

Q: Which versions of Oracle BI does Primavera P6 Analytics support? A: Oracle Business Intelligence 10g FAQ: Primavera P6 Analytics Q: Is Primavera P6 Analytics an Oracle BI Application like Oracle Project Analytics? A: Primavera P6 Analytics is very similar to other Oracle business intelligence applications

More information

What s New with Informatica Data Services & PowerCenter Data Virtualization Edition

What s New with Informatica Data Services & PowerCenter Data Virtualization Edition 1 What s New with Informatica Data Services & PowerCenter Data Virtualization Edition Kevin Brady, Integration Team Lead Bonneville Power Wei Zheng, Product Management Informatica Ash Parikh, Product Marketing

More information

a division of Technical Overview Xenos Enterprise Server 2.0

a division of Technical Overview Xenos Enterprise Server 2.0 Technical Overview Enterprise Server 2.0 Enterprise Server Architecture The Enterprise Server (ES) platform addresses the HVTO business challenges facing today s enterprise. It provides robust, flexible

More information

Big Data Approaches. Making Sense of Big Data. Ian Crosland. Jan 2016

Big Data Approaches. Making Sense of Big Data. Ian Crosland. Jan 2016 Big Data Approaches Making Sense of Big Data Ian Crosland Jan 2016 Accelerate Big Data ROI Even firms that are investing in Big Data are still struggling to get the most from it. Make Big Data Accessible

More information

Constructing a Data Lake: Hadoop and Oracle Database United!

Constructing a Data Lake: Hadoop and Oracle Database United! Constructing a Data Lake: Hadoop and Oracle Database United! Sharon Sophia Stephen Big Data PreSales Consultant February 21, 2015 Safe Harbor The following is intended to outline our general product direction.

More information

Integrating data in the Information System An Open Source approach

Integrating data in the Information System An Open Source approach WHITE PAPER Integrating data in the Information System An Open Source approach Table of Contents Most IT Deployments Require Integration... 3 Scenario 1: Data Migration... 4 Scenario 2: e-business Application

More information

Qlik Sense Enabling the New Enterprise

Qlik Sense Enabling the New Enterprise Technical Brief Qlik Sense Enabling the New Enterprise Generations of Business Intelligence The evolution of the BI market can be described as a series of disruptions. Each change occurred when a technology

More information

100% NO CODING NO DEVELOPING IMMEDIATE BUSINESS -25% -70% UNLIMITED SCALABILITY DEVELOPMENT TIME SOFTWARE STABILITY

100% NO CODING NO DEVELOPING IMMEDIATE BUSINESS -25% -70% UNLIMITED SCALABILITY DEVELOPMENT TIME SOFTWARE STABILITY 100% UNLIMITED SCALABILITY TOTAL COST OF OWNERSHIP -25% +50% EFFICENCY INCREASE -70% +65% DEVELOPMENT TIME SOFTWARE STABILITY NO CODING NO DEVELOPING IMMEDIATE BUSINESS FlexyGo Rapid Application Builder

More information

8. Business Intelligence Reference Architectures and Patterns

8. Business Intelligence Reference Architectures and Patterns 8. Business Intelligence Reference Architectures and Patterns Winter Semester 2008 / 2009 Prof. Dr. Bernhard Humm Darmstadt University of Applied Sciences Department of Computer Science 1 Prof. Dr. Bernhard

More information

Planning the Installation and Installing SQL Server

Planning the Installation and Installing SQL Server Chapter 2 Planning the Installation and Installing SQL Server In This Chapter c SQL Server Editions c Planning Phase c Installing SQL Server 22 Microsoft SQL Server 2012: A Beginner s Guide This chapter

More information

POINT-TO-POINT vs. MEAP THE RIGHT APPROACH FOR AN INTEGRATED MOBILITY SOLUTION

POINT-TO-POINT vs. MEAP THE RIGHT APPROACH FOR AN INTEGRATED MOBILITY SOLUTION POINT-TO-POINT vs. MEAP THE RIGHT APPROACH FOR AN INTEGRATED MOBILITY SOLUTION Executive Summary Enterprise mobility has transformed the way businesses engage with customers, partners and staff while exchanging

More information

Analytics Software for Energy Management and Building Systems Optimization and Equipment Fault Detection

Analytics Software for Energy Management and Building Systems Optimization and Equipment Fault Detection Analytics Software for Energy Management and Building Systems Optimization and Equipment Fault Detection Data is the Driver We now have access to data lots of it! Building automation system data Utility

More information

<Insert Picture Here> Michael Hichwa VP Database Development Tools [email protected] Stuttgart September 18, 2007 Hamburg September 20, 2007

<Insert Picture Here> Michael Hichwa VP Database Development Tools michael.hichwa@oracle.com Stuttgart September 18, 2007 Hamburg September 20, 2007 Michael Hichwa VP Database Development Tools [email protected] Stuttgart September 18, 2007 Hamburg September 20, 2007 Oracle Application Express Introduction Architecture

More information

OWB Users, Enter The New ODI World

OWB Users, Enter The New ODI World OWB Users, Enter The New ODI World Kulvinder Hari Oracle Introduction Oracle Data Integrator (ODI) is a best-of-breed data integration platform focused on fast bulk data movement and handling complex data

More information

Cross Platform Applications with IBM Worklight

Cross Platform Applications with IBM Worklight IJCSNS International Journal of Computer Science and Network Security, VOL.15 No.11, November 2015 101 Cross Platform Applications with IBM Worklight P.S.S.Vara Prasad and Mrs.S.Durga Devi Dept. of IT

More information

Integrating Ingres in the Information System: An Open Source Approach

Integrating Ingres in the Information System: An Open Source Approach Integrating Ingres in the Information System: WHITE PAPER Table of Contents Ingres, a Business Open Source Database that needs Integration... 3 Scenario 1: Data Migration... 4 Scenario 2: e-business Application

More information